Skip to content
View martinjaggi's full-sized avatar

Highlights

  • Pro

Organizations

@mlbench @epfml @amld @EPFLiGHT @CS-433

Block or report martinjaggi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
30 stars written in Python
Clear filter

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 38,701 6,274 Updated Dec 9, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,008 5,962 Updated Jan 24, 2025

Parallel computing with task scheduling

Python 12,875 1,731 Updated Jan 24, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,164 164 Updated Jan 24, 2025

Meditron is a suite of open-source medical Large Language Models (LLMs).

Python 1,939 179 Updated Apr 10, 2024

Large-scale linear classification, regression and ranking in Python

Python 1,729 214 Updated Jul 18, 2023

Reference implementations of MLPerf™ training benchmarks

Python 1,639 563 Updated Jan 15, 2025

PMLB: A large, curated repository of benchmark datasets for evaluating supervised machine learning algorithms.

Python 811 137 Updated Sep 10, 2024

Convolutional Neural Networks for Sentence Classification in Keras

Python 595 204 Updated Nov 13, 2018

distributed trainer for LLMs

Python 555 78 Updated May 20, 2024

Landmark Attention: Random-Access Infinite Context Length for Transformers

Python 420 36 Updated Dec 20, 2023
Python 412 15 Updated Nov 2, 2023

NMT Chatbot

Python 385 212 Updated Jun 6, 2020

Example code and applications for machine learning on Graphcore IPUs

Python 319 82 Updated Mar 5, 2024

Stochastic Gradient Push for Distributed Deep Learning

Python 160 37 Updated Apr 5, 2023

Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727

Python 145 33 Updated Oct 29, 2024

matrix factorization in PyTorch

Python 128 33 Updated Jul 6, 2023

Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research

Python 123 20 Updated Jan 24, 2025

Language Identification with Support for More Than 2000 Labels -- EMNLP 2023

Python 111 8 Updated Nov 28, 2024

nanoGPT-like codebase for LLM training

Python 83 25 Updated Jan 23, 2025

Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"

Python 67 2 Updated Oct 30, 2024
Python 52 3 Updated Nov 15, 2024
Python 47 11 Updated Feb 4, 2020

Code for WWW 2017 conference paper "Leveraging large amounts of weakly supervised data for multi-language sentiment classification"

Python 36 5 Updated Feb 11, 2019

CoLa - Decentralized Linear Learning: https://arxiv.org/abs/1808.04883

Python 20 5 Updated Nov 30, 2021

ColTraIn HBFP Training Emulator

Python 16 6 Updated Feb 16, 2023
Python 15 3 Updated Sep 6, 2020

Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"

Python 15 4 Updated Feb 29, 2024

Open Challenge - Automatic Training for Deep Learning

Python 4 1 Updated Oct 19, 2021