-
Marlin Public
Marlin is a enhanced launcher for NeMo tailored to train LLMs at scale in Slurm-based clusters
Python UpdatedJan 22, 2025 -
NeMo Public
Forked from NVIDIA/NeMoA scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Python Apache License 2.0 UpdatedDec 21, 2024 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMDebugging Megatron. 3D Parallelism, models, training and more!
-
nanotron Public
Forked from swiss-ai/nanotronMinimalistic large language model 3D-parallelism training
Python Apache License 2.0 UpdatedNov 28, 2024 -
datatrove Public
Forked from huggingface/datatroveFreeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Python Apache License 2.0 UpdatedNov 22, 2024 -
NeMo-Framework-Launcher Public
Forked from NVIDIA/NeMo-Framework-LauncherProvides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.
Python Apache License 2.0 UpdatedNov 15, 2024 -
torchtitan Public
Forked from pytorch/torchtitanA native PyTorch Library for large model training
Python BSD 3-Clause "New" or "Revised" License UpdatedNov 6, 2024 -
-
-
MN5-Distributed-PyTorch Public
Entry point for developing distributed applications with PyTorch, Slurm and Singularity on MareNostrum5 Supercomputer
-
axolotl Public
Forked from axolotl-ai-cloud/axolotlGo ahead and axolotl questions
Python Apache License 2.0 UpdatedMay 10, 2024 -
Transformers training in a supercomputer with the 🤗 Stack and Slurm
-
-
FastChat Public
Forked from lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Python Apache License 2.0 UpdatedApr 19, 2024 -
nanotron-debug Public
Minimalistic large language model 3D-parallelism training
Python Apache License 2.0 UpdatedMar 7, 2024 -
accelerate Public
Forked from huggingface/accelerate🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Python Apache License 2.0 UpdatedFeb 20, 2024 -
datathon-2023-fashion-compatibility Public
Forked from data-science-mango/datathon-2023-fashion-compatibilityDocumentation for UPC 2023 Datathon Challenge
Jupyter Notebook UpdatedNov 30, 2023 -
audio-dataset Public
Forked from LAION-AI/audio-datasetAudio Dataset for training CLAP and other models
Python UpdatedDec 11, 2022 -
-
Suma-excels-por-carpetas Public
Un programa para sumar datos de diferentes archivos excels separados en diferentes carpetas
Jupyter Notebook UpdatedNov 22, 2020 -
Agarrar-tabla-url Public
Un programa para descargar datos de una tabla alojados en una web y copiarlos de una cierta manera ordenada en un excel
Jupyter Notebook UpdatedNov 22, 2020