GitHub - arturocm/getdata_courseproject: Coursera's Data Science Specialization Repo

#Coursera's Data Science Specialization Repo - Getting and Cleaning Data Course Project

This repo contains the 3 requirements asked by the Course Project of the Getting and Cleaning Data course: readme.md, run_analysis.R and cookbook.md

The run_analysis.R script contains a run() function that will perform the task asked by the project:

Merges the training and the test sets to create one data set.
Extracts only the measurements on the mean and standard deviation for each measurement.
Uses descriptive activity names to name the activities in the data set
Appropriately labels the data set with descriptive variable names.
From the data set in step 4, creates a second, independent tidy data set with the average of each variable for each activity and each subject.
1. Merges the training and the test sets to create one data set.
1. Extracts only the measurements on the mean and standard deviation for each measurement.
1. Uses descriptive activity names to name the activities in the data set
1. Appropriately labels the data set with descriptive variable names.
1. From the data set in step 4, creates a second, independent tidy data set with the average of each variable for each activity and each subject.

There are a couple of considerations needed for the function to work.

It assumes "getdata-projectfiles-UCI HAR Dataset" zip file has been extracted in working directory as it is (the read.table functions includes paths to the necessary txt files)
It assumes whoever is running the script has already installed gdata/gplyr/reshape packages

The script also includes comments between code lines to facilitate the understanding of its logic step by step. An overview of this is:

It reads the required data frames and stores them into variables for easy access
Cross reference data tables with tables containing column names
Selects all columns that contains mean or std values
"Bind" corresponding tables (X, Y and Subject)
It does this for both Train and Test tables
Concatenate tables into a large single table
Looks to create a tidy data file with the use of melt/cast functions from the reshape2 package

Wish me luck... and good luck to you too!

###arturocm

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
codebook.md		codebook.md
cookbook.md		cookbook.md
run_analysis.R		run_analysis.R

Provide feedback