Solutions to data mining course homeworks.
-
Updated
Aug 19, 2021 - Jupyter Notebook
Solutions to data mining course homeworks.
An R package for simple, customizable binning of flow cytometric data
Mock exercise as Chief Data Scientist analyzing student standardized testing data for school board.
Taxonomic classification and read binning of mitochondrial DNA
Optimal binning: monotonic binning with constraints
Binning method to allow performing point cloud classification tasks on low resources machines.
Photographic binning
hierarchical clustering of DNA sequence using upcxx
BinGuru is an open-source Typescript package to bin/classify data using 18 established binning methods, including a new method, resiliency.
Feature engineering is the process of transforming raw data into features. Here are some basic ideas about feature engineering.
Tools for hexagonal binning (honeycomb plot) and visualisation.
recogniZing gEnome seqUences in metagenomic aSSemblies
Credit scoring toolkit with python
The function here is designed for binning continuous independent variables, in the way minimizing total entropy of corresponding response. Also this function can plot the change of the entropy in the process.
The feature engineering techniques discussed are - dimensionality reduction(pca), scaling(standard scaler, normalizer, minmaxscaler), categorical encoding(one hot/dummy), binning, clustering, feature selection. These are techniques performed on a dataset consisting of Californian House Prices.
This was my first project ever on Python. It's also my first attempt at EDA for my Executive PGP Course, with IIIT-B and UpGrad.
Binning biological data tracks and producing RDS containing data.tables
Add a description, image, and links to the binning topic page so that developers can more easily learn about it.
To associate your repository with the binning topic, visit your repo's landing page and select "manage topics."