A project analyzing how the content of TED talks have evolved over the years in terms of the presentation styles of speakers as well as the topics discussed.
Submitted as the Final Project for MY459 (Quantitative Text Analysis) for the MSc Applied Social Data Science degree at LSE
├── LICENSE
├── README.md <- The top-level README for developers using this project.
├── data
│ ├── interim <- Intermediate data that has been transformed.
│ ├── processed <- The final processed data for analysis.
│ └── raw <- The original, immutable TED corpus.
│
├── notebooks <- Jupyter notebooks.
│
│
├── reports <- Generated analysis as HTML, PDF, LaTeX, etc.
│ └── figures <- Generated graphics and figures to be used in reporting
│
├── requirements.txt <- The requirements file for reproducing the analysis environment.
│
├── src <- Source code for use in this project.
├── __init__.py <- Makes src a Python module
│
├── web_scraping <- Scripts to scrape TED to collect data
└── scrape_ted.py
Project template based on the cookiecutter data science project template. #cookiecutterdatascience