Skip to content

mjplacroix/amazon_reviews

Repository files navigation

Amazon Reviews Analysis

This Jupyter notebook is a visual data analysis investigating a handful of correlations across product categories in this dataset of 230 million Amazon reviews.

The following are a series of draft notebook I used for EDA (exploratory data analysis). Many pieces of the functions in the above notebook were drafted in these.

  1. pandas_feather_benchmark - a progressively scaled benchmarking comparison of feather-format VS pd.read_csv and .apply VS pandas datetime
  2. Time_Analysis - a time based analysis of several product catagories to discover any trends across annual, monthly, or weekly reviews
  3. Review_Timing - comparing across a series of time increments (day of week, day of month, and monthly) the average rating
  4. Word_Count - comparing common language in reviews across several catagories, included an analysis of top words

Notebook viewer

About

Analysis of 233 million Amazon reviews

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published