clustering_dashplotly

Clusters the data and makes a dashboard with some basic plots

[UMAP, DBSCAN, Agglomerative, Dash, Plotly]

(Year 1 Data Science(HVE) course assignment 2022)

(Addressing outdated logic in this code to improve efficiency. Refactoring in progress to address god object concerns, inefficient loops in data processing, enhance modularity, etc)

Takes in pre-processed data (no NaNs, encoded);
Scales the data;
Makes 2D UMAP embedding;
Performs DBSCAN and AgglomerativeClusterer hyperparameter tuning (for-loops);
Runs DBSCAN and AgglomerativeClusterer on the data, appends obtained cluster labels to the original dataframe;
Plots the results (basic Dash Plotly dashboard)
- 3 exploratory scatterplots (UMAP data embedding, Dbscan clustering results on the embedding, Agglomerative clustering results on the embedding) with some clustering evaluation metrics displayed (Silhouette, Davis-Bouldin, Calinski-Harabasz)
- 2 callback-changeable plots: bar- and donut chart (displays feature distribution per chosen algorithm, per chosen cluster)

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Absenteeism_at_work.csv		Absenteeism_at_work.csv
README.md		README.md
Starbucks_cleaned.csv		Starbucks_cleaned.csv
getdashboard_.py		getdashboard_.py
main.py		main.py
requirements.txt		requirements.txt
wine-clustering.csv		wine-clustering.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

clustering_dashplotly

Clusters the data and makes a dashboard with some basic plots

[UMAP, DBSCAN, Agglomerative, Dash, Plotly]

About

Releases

Packages

Languages

juliazubko/clustering_dashplotly

Folders and files

Latest commit

History

Repository files navigation

clustering_dashplotly

Clusters the data and makes a dashboard with some basic plots

[UMAP, DBSCAN, Agglomerative, Dash, Plotly]

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages