It consists of the following files:
- run_analysis.R would take the input file as UCI HAR datasets and perform requisite analysis
- readme.md would provide an overview of the processing
- codebook.md would provide an overview of the variables in the output file i.e. tidy dataset
Here is a sequence of steps performed for analysis:
- Read train data, X and Y variables along with subject data
- Read test data, X and Y variables along with subject data
- Read the feature names along with the activity labels
- Coerce this data into a single dataset and retain only those columns that have std or mean in their names
- Aggregate the data by grouping it by activity and subject and calculate mean by this grouping for all the variables
- Finally, write out the tidy dataset to a text file