Data Science Project

This data science project is structured to support robust data analysis and machine learning workflows using Python and Jupyter Notebooks. Below is an overview of the project structure and instructions on how to get started.

Project Structure

`data/`

raw/: Contains raw data files that are immutable and unaltered directly from the source.
processed/: Contains cleaned and manipulated data, ready for analysis.
external/: Data from third-party sources.

`notebooks/`

exploratory/: Contains Jupyter Notebooks for initial data exploration and analysis.
report/: Contains finalized notebooks for reporting and presentation purposes.

`src/`

This directory houses all Python scripts organized by their functionality:

data/: Scripts for data acquisition and generation.
features/: Scripts for feature engineering.
models/: Scripts for model training and prediction.
visualization/: Scripts for generating visualizations.

`reports/`

figures/: Stores generated graphical content for use in reports.
logs/: Contains output logs from scripts and models, useful for debugging and tracking experiments.

`requirements.txt`

Lists all Python libraries required to run the project. Ensures environment consistency across different setups.

Install Requirements:
```
pip install -r requirements.txt
```

Conda Package Install Requirements:

conda env create --name data_science_project_env --file environment.yaml
conda activate data_science_project_env

`.gitignore`

Specifies intentionally untracked files that Git should ignore.

`setup.py`

Makes the project pip-installable, allowing its modules to be easily imported across different parts of the project.

Installation Instructions

Clone the repository:

git clone [email protected]:ncdingari/data-science-project.git
cd data-science-project

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Science Project

Project Structure

`data/`

`notebooks/`

`src/`

`reports/`

`requirements.txt`

`.gitignore`

`setup.py`

Installation Instructions

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
notebooks		notebooks
reports/figures		reports/figures
src		src
.gitignore		.gitignore
README.md		README.md
environment.yaml		environment.yaml
requirements.txt		requirements.txt
setup.py		setup.py

ncdingari/data-science-project

Folders and files

Latest commit

History

Repository files navigation

Data Science Project

Project Structure

data/

notebooks/

src/

reports/

requirements.txt

.gitignore

setup.py

Installation Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`data/`

`notebooks/`

`src/`

`reports/`

`requirements.txt`

`.gitignore`

`setup.py`

Packages