#Getting and Cleaning Data
###Introduction The purpose of this project is to demonstrate ability to collect, work with, and clean a data set. The goal is to prepare tidy data that can be used for later analysis.
###Course Requirements
- a tidy data set
- a link to a Github repository with script for performing the analysis
- a code book that describes the variables, the data, and any transformations or work performed to clean up the data called CodeBook.md
###Data Raw data used in this project can be obtained from the following link:
https://d396qusza40orc.cloudfront.net/getdata%2Fprojectfiles%2FUCI%20HAR%20Dataset.zip
The zip file will extract to a number of data and meta data text files.
###The R Script
An R script reads data files and applies of transformations to obtain tidy data. Operations applied to the data include:
- Reading Training, Test and Features Data
- Merging of Training and Test Data
- Attaching Activity and Subjects Data to Merged Data
- Removal of Special Characters from Variable Names
- Calculation of avaerages of Activities per Subject
- Attaching Labels to Activity Values
- Writing the Tidy Data to file
###Tidy Data
The tidy data text file (tidy_data.txt) is the result of applying tidy data principles to the raw data. It contains Subjects grouped by all possible activities performed measuring the mean and standard deviations of a number of factors.
###Code Book A codeblock detailing the data, variables and code scripts used is also attached. It lists the input and output data, variables used as well as sequence of transformations applied to the data.