Skip to content

This rep contains the initial but most important step, a data scientist must know

Notifications You must be signed in to change notification settings

Pari-singh/Getting-and-Cleaning-Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

Accelerometer data analysis.

R script that analyze data gathered from smartphones accelerometers. Done as part of the Getting and Clening Data Coursera course.

Getting and Cleaning the data:

  1. Load the raw data from the working directory. Raw data is divided into train and test data sets.
  2. Merge the training and the test sets to create one data set.
  3. Load variable names from raw data separated file and assign them to the merged data
  4. Extract only the measurements on the mean and standard deviation for each measurement.
  5. Get variable names for the mean measurements
  6. Get variable names for the standard deviation measurements
  7. Subset data selecting columns that are among the previous variable names
  8. Use descriptive activity names to name the activities in the data set.
  9. Load activity names from raw data separated file
  10. Cast activity variable to factor
  11. Level activity variable to the loaded activity names
  12. Label the data set with descriptive variable names.
  13. Make variables names lower case
  14. Remove any dot from variable names
  15. Create an independent tidy data set with the average of each variable for each activity and each subject.
  16. Write the tidy data set to an output file.

Codebook

Dataset description is given in CodeBook

About

This rep contains the initial but most important step, a data scientist must know

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages