Skip to content

Enhancing Productivity and Collaboration with an Organized File and Folder Structure in Data Science Projects Using Jupyter Lab

Notifications You must be signed in to change notification settings

ncdingari/data-science-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science Project

This data science project is structured to support robust data analysis and machine learning workflows using Python and Jupyter Notebooks. Below is an overview of the project structure and instructions on how to get started.

Project Structure

data/

  • raw/: Contains raw data files that are immutable and unaltered directly from the source.
  • processed/: Contains cleaned and manipulated data, ready for analysis.
  • external/: Data from third-party sources.

notebooks/

  • exploratory/: Contains Jupyter Notebooks for initial data exploration and analysis.
  • report/: Contains finalized notebooks for reporting and presentation purposes.

src/

This directory houses all Python scripts organized by their functionality:

  • data/: Scripts for data acquisition and generation.
  • features/: Scripts for feature engineering.
  • models/: Scripts for model training and prediction.
  • visualization/: Scripts for generating visualizations.

reports/

  • figures/: Stores generated graphical content for use in reports.
  • logs/: Contains output logs from scripts and models, useful for debugging and tracking experiments.

requirements.txt

Lists all Python libraries required to run the project. Ensures environment consistency across different setups.

  1. Install Requirements:
    pip install -r requirements.txt
  2. Conda Package Install Requirements:
    conda env create --name data_science_project_env --file environment.yaml
    conda activate data_science_project_env 
    

.gitignore

Specifies intentionally untracked files that Git should ignore.

setup.py

Makes the project pip-installable, allowing its modules to be easily imported across different parts of the project.

Installation Instructions

  1. Clone the repository:
    git clone [email protected]:ncdingari/data-science-project.git
    cd data-science-project

About

Enhancing Productivity and Collaboration with an Organized File and Folder Structure in Data Science Projects Using Jupyter Lab

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published