-
Notifications
You must be signed in to change notification settings - Fork 7
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
1 changed file
with
13 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
reddis_data_viz | ||
=============== | ||
|
||
Data science work for RedditInsight. | ||
|
||
1. Segmented data by subreddit | ||
2. Used NLTK to separate the words in titles by their parts of speech | ||
3. Developed frequency analysis of nouns by subreddit | ||
4. Munged dataset for predictive model- extracted day of week, and hour of day the post was created. Developed categorical variable out of the subreddit and domain features. | ||
5. Evaluated predictive value of model, decided to focus on data visualizations. | ||
6. Developed clustering analysis of subreddit data for subreddits that had natural topic segmentation. | ||
7. Developed noun frequency analysis by subreddit | ||
8. Visualizations created from this work are in- https://github.com/sheltowt/redditD3 |