Skip to content
View gentaiscool's full-sized avatar
:octocat:
Writing interesting code...
:octocat:
Writing interesting code...

Highlights

  • Pro

Organizations

@HLTCHKUST @audioku @indobenchmark

Block or report gentaiscool

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
273 stars written in Python
Clear filter

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 288,732 48,055 Updated Dec 2, 2024

πŸ€— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 138,945 27,885 Updated Feb 10, 2025

Magnificent app which corrects your previous console command.

Python 89,813 3,620 Updated Jul 19, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 86,705 23,326 Updated Feb 10, 2025

Models and examples built with TensorFlow

Python 77,357 45,693 Updated Feb 10, 2025

Inference code for Llama models

Python 57,547 9,688 Updated Jan 26, 2025

TensorFlow code and pre-trained models for BERT

Python 38,625 9,656 Updated Jul 23, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,650 4,229 Updated Feb 10, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,933 6,451 Updated Jan 9, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,946 3,434 Updated Feb 3, 2025

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,780 3,620 Updated Jul 28, 2024

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 22,688 9,579 Updated Feb 9, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,712 2,592 Updated Feb 6, 2025

πŸ€— The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 19,583 2,747 Updated Feb 5, 2025

Fully open reproduction of DeepSeek-R1

Python 18,288 1,534 Updated Feb 10, 2025

Open source code for AlphaFold 2.

Python 13,152 2,322 Updated Jan 29, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,066 2,665 Updated Feb 10, 2025

πŸ„ Scalable embedding, reasoning, ranking for images and sentences with CLIP

Python 12,558 2,075 Updated Jan 23, 2024

An open-source NLP research library, built on PyTorch.

Python 11,797 2,250 Updated Nov 22, 2022

Ongoing research training transformer models at scale

Python 11,296 2,535 Updated Feb 10, 2025

A PyTorch-based Speech Toolkit

Python 9,329 1,432 Updated Feb 10, 2025

End-to-End Speech Processing Toolkit

Python 8,760 2,211 Updated Feb 5, 2025

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,531 1,422 Updated Feb 5, 2025

πŸš€ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,290 1,033 Updated Feb 10, 2025

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Python 8,269 959 Updated Feb 25, 2022

A Python implementation of global optimization with gaussian processes.

Python 8,063 1,556 Updated Jan 2, 2025

Chinese version of GPT2 training code, using BERT tokenizer.

Python 7,507 1,707 Updated Apr 25, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,085 1,043 Updated Feb 8, 2025

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,814 2,244 Updated Jan 8, 2025

A natural language modeling framework based on PyTorch

Python 6,332 799 Updated Oct 17, 2022
Next