pytorch-deep-learning-exercise

implementing the exercise work of https://github.com/mrdbourke/pytorch-deep-learning

datasets are under data directory, food101 is a big dataset around 4-5G, so i download the original dataset under data folder, and split the image to train and test folders with 04_custom_data_creation.ipynb

i highly recommend anyone who want to learn pytorch should learn about this author's tutorial and finish the exercise work,happy coding

tips: i've modified the default tiny VGG model achitecture in 03_pytorch_computer_vision_exercise_solutions.ipynb, add BatchNorm2D and Dropout,just compare it with the orginal model,it turns out the new model generalized well on the test data,cuz solve the overfitting problem a little bit.

in 04_pytorch_custom_datasets.ipynb exercise,after increase the dataset of Food101 including steak,sushi and pizza to 20% of total images, train with hidden_units=20 TinyVGG model(including BatchNorm2D and Dropout),1000 epochs,the model performs better to classify multiclass food which the model has not learned when inference with new images i download from Google,you can give it a shot with more epochs.

in 05_pytorch_going_modular_script_mode.ipynb for implementing and adding the command line parameters of training process, i just create another script file called train_cli_params.py, models can be saved and reload thr pytorch for prediction in predict.py

in 06_pytorch_transfer_learning.ipynb you can fine-tuning the pre-trained models in torchvision.models like efficientnet_b0 and efficientnet_b2 with the last classified layer hyperparameters,after 10 epochs training,it turns out the EfficientNet_B2 convergence faster and stable than EfficientNet_B0,as a conclusion,like EfficientNet_B7 model,the model with more parameters and larger of size, performs much better than the smaller model

efficientnet_b0 model trained after 10 epochs accuracy:

the same training datasets and 10 epochs of efficientnet_b2 model accuracy:

in 07_pytorch_experiment_tracking.ipynb i trained the whole food101 datasets classification downstream task with EfficientNet_B7 model under GTX 4090 GPU,it took around 11 mins, so if you want to optimized and gain more precise accuracy,you can increase the training epochs and it will take longer,anyway,it's so interesting to test with different raw data and finetuning,you should give a try.

vision transformer research paper uploaded,you should read and learn end-to-end, the self-attention formula as below in the domain-specific of AI is the new holy grail as E=mc^2 of mass-energy equivalence

in 08_pytorch_paper_replicating.ipynb create a vision transformer architecture model from scratch, the vit_b_16 pre-trained model which built on vision transformer's accuracy performance higher than EfficientNet_B7 model,and the model size is about 10-11 times bigger than EfficientNet_B2,1.3 times bigger than EfficientNet_B2, so it's a trade-off you should make if you wanna a better performance or faster loading inference when you deploy online.

vit_b_16 pre-trained model downstream task training metrics as below:

the vit_b_16 model classification accuracy of unseen image is much more deterministic

in 09_pytorch_model_deployment.ipynb,when you upload your trained model in gradio to huggingface space,some modifications have been made:First,when you git push your files,you should use the huggingface Access Tokens instead of your huggingface account password. Second,when you encount git push binary files like images or video files with this command(git lfs track "your_file" # or *.your_extension),if it failed,you should run git filter-branch --tree-filter 'rm -rf path/to/your/file' HEAD, then git push origin --force --all it's obvious the ViT architecture model outperform the efficient model in the end, so you should do a trade-off when you deploy a model with accuracy and inference speed.

ViT-B/16 fine-tuning model with food101 around 100 thousand full-datasets 10 epochs training result:

you can check out my deployment mini model of food101 or big model of food101

in A_Quick_PyTorch2.0_Tutorial.ipynb training and testing with pytorch2.x new feature torch.compile(model),especially the first epoch takes more time than the non-compile model(for warm up of optimization),the rest epochs will be less time compared to non-compile one,so if you training for longer and take the full advantage of your GPU metrics, like with more batch_size,more training datasets(in case of not OOM),the overall time will be shorter

Markdown exmaples(include Latex syntax):

Header1

Header2

Header3

text

italics

bold

$$ z =:\sum_{i=1}^{n} w_i x_i $$

$$ \sqrt{ \frac{1}{n} \sum_{i=1}^{n} (x_i - \bar{x})^2 } $$

$$ p_i = \text{softmax}(x_i) = \frac{e^{x_i}}{\sum_j e^{x_j}} $$

Pytorch 2.0 tutorial

quote text

What are embeddings?

Nvidia Chat with RTX includes Llama2 13B int4 and Mistral 7B int4 model as well as RAG implementation.You can also set up

RAG on Windows using TensorRT-LLM and LlamaIndex

you can contact me on weixin in mainland of China：

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
data		data
going_modular		going_modular
models		models
A_Quick_PyTorch2.0_Tutorial.ipynb		A_Quick_PyTorch2.0_Tutorial.ipynb
00-simple-local-rag.ipynb		00-simple-local-rag.ipynb
01_pytorch_workflow.ipynb		01_pytorch_workflow.ipynb
02_pytorch_classification.ipynb		02_pytorch_classification.ipynb
03_pytorch_computer_vision.ipynb		03_pytorch_computer_vision.ipynb
03_pytorch_computer_vision_exercises.ipynb		03_pytorch_computer_vision_exercises.ipynb
04_custom_data_creation.ipynb		04_custom_data_creation.ipynb
04_pytorch_custom_datasets.ipynb		04_pytorch_custom_datasets.ipynb
05_pytorch_going_modular_script_mode.ipynb		05_pytorch_going_modular_script_mode.ipynb
06_pytorch_transfer_learning.ipynb		06_pytorch_transfer_learning.ipynb
07_pytorch_experiment_tracking.ipynb		07_pytorch_experiment_tracking.ipynb
08_pytorch_paper_replicating.ipynb		08_pytorch_paper_replicating.ipynb
09_pytorch_model_deployment.ipynb		09_pytorch_model_deployment.ipynb
README.md		README.md
helper_functions.py		helper_functions.py
human-nutrition-text.zip		human-nutrition-text.zip
vision_transformer.pdf		vision_transformer.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

pytorch-deep-learning-exercise

Header1

Header2

Header3

About

Uh oh!

Releases

Packages

Languages

frankchieng/pytorch-deep-learning-execise

Folders and files

Latest commit

History

Repository files navigation

pytorch-deep-learning-exercise

Header1

Header2

Header3

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages