gentaiscool

Writing interesting code...

Genta Indra Winata gentaiscool

Writing interesting code...

Researcher @ Capital One AI Foundations. Natural Language Processing, Speech, Multilingual, Code-switching, Dialogue

251 followers · 132 following

Capital One AI Foundations
New York
https://gentawinata.com
@gentaiscool

Achievements

x3 x2

Achievements

x3 x2

Highlights

Organizations

Stars

273 stars written in Python

Clear filter

donnemartin / system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 288,732 48,055 Updated Dec 2, 2024

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 138,945 27,885 Updated Feb 10, 2025

nvbn / thefuck

Magnificent app which corrects your previous console command.

Python 89,813 3,620 Updated Jul 19, 2024

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 86,705 23,326 Updated Feb 10, 2025

tensorflow / models

Models and examples built with TensorFlow

Python 77,357 45,693 Updated Feb 10, 2025

meta-llama / llama

Inference code for Llama models

Python 57,547 9,688 Updated Jan 26, 2025

google-research / bert

TensorFlow code and pre-trained models for BERT

Python 38,625 9,656 Updated Jul 23, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,650 4,229 Updated Feb 10, 2025

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,933 6,451 Updated Jan 9, 2025

Lightning-AI / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,946 3,434 Updated Feb 3, 2025

sebastianruder / NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,780 3,620 Updated Jul 28, 2024

pytorch / examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 22,688 9,579 Updated Feb 9, 2025

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,712 2,592 Updated Feb 6, 2025

huggingface / datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 19,583 2,747 Updated Feb 5, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 18,288 1,534 Updated Feb 10, 2025

google-deepmind / alphafold

Open source code for AlphaFold 2.

Python 13,152 2,322 Updated Jan 29, 2025

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,066 2,665 Updated Feb 10, 2025

jina-ai / clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Python 12,558 2,075 Updated Jan 23, 2024

allenai / allennlp

An open-source NLP research library, built on PyTorch.

Python 11,797 2,250 Updated Nov 22, 2022

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 11,296 2,535 Updated Feb 10, 2025

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 9,329 1,432 Updated Feb 10, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,760 2,211 Updated Feb 5, 2025

NVIDIA / apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,531 1,422 Updated Feb 5, 2025

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,290 1,033 Updated Feb 10, 2025

EleutherAI / gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Python 8,269 959 Updated Feb 25, 2022

bayesian-optimization / BayesianOptimization

A Python implementation of global optimization with gaussian processes.

Python 8,063 1,556 Updated Jan 2, 2025

Morizeyao / GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

Python 7,507 1,707 Updated Apr 25, 2024

EleutherAI / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,085 1,043 Updated Feb 8, 2025

OpenNMT / OpenNMT-py

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,814 2,244 Updated Jan 8, 2025

facebookresearch / pytext

A natural language modeling framework based on PyTorch

Python 6,332 799 Updated Oct 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Genta Indra Winata gentaiscool

Achievements

Achievements

Highlights

Organizations

Block or report gentaiscool

Stars

donnemartin / system-design-primer

huggingface / transformers

nvbn / thefuck

pytorch / pytorch

tensorflow / models

meta-llama / llama

google-research / bert

deepspeedai / DeepSpeed

facebookresearch / fairseq

Lightning-AI / pytorch-lightning

sebastianruder / NLP-progress

pytorch / examples

microsoft / unilm

huggingface / datasets

huggingface / open-r1

google-deepmind / alphafold

NVIDIA / NeMo

jina-ai / clip-as-service

allenai / allennlp

NVIDIA / Megatron-LM

speechbrain / speechbrain

espnet / espnet

NVIDIA / apex

huggingface / accelerate

EleutherAI / gpt-neo

bayesian-optimization / BayesianOptimization

Morizeyao / GPT2-Chinese

EleutherAI / gpt-neox

OpenNMT / OpenNMT-py

facebookresearch / pytext