Genetic-Data-PCA-implementation

Feature engineering using Principle Component Analysis

The input to PCA function would be two files, an intensity file and a metadata file. The formats for them can be seen in the attached files named gene_data.csv and meta.csv. The package should have appropriate error handling if the input files are not of the desired format.The output of the PCA function in the package should be an interactive PCA plot of PC1 vs PC2. Evaluating what PCA tells us about the given dataset (to understand about the data read the following paragraph), that is, are the different timepoints in the metadata file differentiating when seen on the PCA plot or not? What does that mean for the dataset given?

To understand the data let us first view the gene data. Gene data contains 32 columns of which the gene names have been provided in a column called symbol. Corresponding to each gene (a row) we see 30 different values. These values correspond to different samples corresponding to the column they are in. Now, let us look at the metadata which was in meta.csv. Once we view this data we will see a column for the sIdx. This column corresponds to the sample names in the gene data. The next column we will see is the Time column which corresponds to the time at which this sample was taken. Do note that there are multiple samples for each time point.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
01-PCA+Scatter-Code.ipynb		01-PCA+Scatter-Code.ipynb
02-Code-HTML.html		02-Code-HTML.html
03-Code-Py.py		03-Code-Py.py
Meta-data-sheet.csv		Meta-data-sheet.csv
README.md		README.md
gene_data-.csv		gene_data-.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Genetic-Data-PCA-implementation

Feature engineering using Principle Component Analysis

About

Releases

Packages

Languages

Pari-singh/Genetic-Data-PCA-implementation

Folders and files

Latest commit

History

Repository files navigation

Genetic-Data-PCA-implementation

Feature engineering using Principle Component Analysis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages