Skip to content

AlephTaw/the-open-data-science-bootcamp

Repository files navigation

the-open-data-science-bootcamp

Introduction:

This work is an outgrowth of an intensive self-study program in data science and machine learning. The goal is to maintain a collection of tools, illustrative coding samples, and learning resources to augment a formal or self-directed data science 'bootcamp experience.' Enjoy!

** Note: Efforts have been made to include as much open-source content as possible. Feel free to suggest alternatives to paid content, where comparable alternatives exist.

Sample Schedule:

Contents:

  • Software Engineering:

    • General:

      • Object Oriented Design Principles
      • Software Engineering Design Patterns
      • Exception Handling
      • Regular Expressions
      • Else
    • Language (Python)

      • Style Guide
      • Documentation
      • Linting
      • Logging
      • Documenting Projects
      • Testing
      • Language Deep Dive (Idioms, and more)
      • Data Science Specific
      • Else
        • Packaging Projects
    • Language (JavaScript)

      • Style Guide
      • Documentation
      • Linting
      • Logging
      • Documenting Projects
      • Testing
      • Language Deep Dive (Idioms, and more)
      • Else
  • Web (*Data Science 'Core Web Topics.' Other topics included as boilerplate fullstack content.)

    • Tools
      • Server-Side Tools/Tech
        • Flask*
        • NodeJS, Express, Mongoose
      • Client-Side Tools/Tech
        • HTML*, CSS*, Sass, JSS
        • ReactJS
        • Bootstrap, Material Design
      • UI/UX
        • Figma
    • In Practice
      • Scraping
        • Beautiful Soup
      • Apis
  • Algorithms

    • Data Structures
    • Searching and Sorting
    • Heaps
    • Trees
    • Graphs
    • Else
  • Data

    • Databases and Libraries

      • ORMs
        • sqlalchemy
        • flask-sqlalchemy
    • Data Analysis

      • pandas
    • Data Mining

      • Scraping
    • Data Visualization

      • matplotlib
      • bokeh
      • altair
      • plotly
      • D3js
    • Else

    • Scientific Computing

      • numpy
      • sciPy
  • Machine Learning

    • General:
      • scikit-learn
    • Deep Learning:
      • keras
      • tensorflow
      • pytorch
  • Statistics

  • DevOps

  • Data and Computing at Scale:

    • Spark / PySpark
    • Kafka
    • Apache Storm
    • MapReduce
    • Else
      • hdf5, ...

Subject Courses, Lectures and Live Content:

Illustrative Projects:

  • Production Demo Pipelines: Scikit-Learn, Keras, Pytorch
  • Twitter Sentiment
  • Wiki
  • Atari Deep Reinforcement Learning
  • Business Case Study
  • Papers to Code
  • Reference Applications and Implementations

Else - Data Science in Practice ...

  • Case Studies
  • Interviews
  • Business Analytics:
    • Metrics
    • A/B Testing

Resource Type Key:

* Cheatsheets:
* Documentation:
* Live Content (forumns, blogs, news, etc.)
* Video Tutorials:
* Written Tutorials:
* Video Lectures:
* Notebooks:
* Code Examples:
* Illustrative Projects:

Dependencies:

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published