Spectral Clustering

What is this?

This is my final assignment in a course I took, "Software Project" (0368.2161)

The code performs Spectral Clustering on any given comma-delimited csv dataset - detailed documentation of the algorithm itself can be found in ./tests/resources/sp_project.pdf, which was provided by course staff

Omg wowa wiwa! How did you do this?

I reused my previous work on kmeans++
I significantly optimized some parts of the algorithm by hardcoding matrix multiplication on rotation matrices, which are very close in our case to the identity matrix - this greatly reduced computational complexity in Jacobi iterations
Most of the "heavy lifting" is done in C, with Python hooks in ./spkmeansmodule.c
You can find a bunch of unit tests in ./tests

But are there any limitations?

The data is loaded straight to memory, however it should be trivial, by modifying matrix.c, to allow for (slow) caching and even data manipulation on persistent storage, if we're dealing with extremely large amounts of data (i.e. many datapoints / large dimensionality)
Yes! Many more

Name		Name	Last commit message	Last commit date
Latest commit History 204 Commits
.vscode		.vscode
algorithms		algorithms
generics		generics
tests		tests
.gitignore		.gitignore
README.md		README.md
comp.sh		comp.sh
kmeans.c		kmeans.c
setup.py		setup.py
spkmeans.c		spkmeans.c
spkmeans.h		spkmeans.h
spkmeans.py		spkmeans.py
spkmeansmodule.c		spkmeansmodule.c
submit.sh		submit.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spectral Clustering

What is this?

Omg wowa wiwa! How did you do this?

But are there any limitations?

About

Releases

Packages

Languages

msx98/spectral_clustering

Folders and files

Latest commit

History

Repository files navigation

Spectral Clustering

What is this?

Omg wowa wiwa! How did you do this?

But are there any limitations?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages