martinjaggi

Martin Jaggi martinjaggi

283 followers · 11 following

Achievements

x2 x3

Achievements

x2 x3

Highlights

Organizations

Stars

epfLLM / meditron

Meditron is a suite of open-source medical Large Language Models (LLMs).

Python 1,928 174 Updated Apr 10, 2024

ServiceNow / Fast-LLM

Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research

Python 118 18 Updated Jan 7, 2025

apple / ml-ademamix

Python 51 2 Updated Nov 15, 2024

keirp / OpenWebMath

XSLT 132 9 Updated May 2, 2024

epfml / disco

DISCO is a code-free and installation-free browser platform that allows any non-technical user to collaboratively train machine learning models without sharing any private data.

TypeScript 158 27 Updated Jan 8, 2025

Olivia-fsm / DoGE

Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"

Python 15 4 Updated Feb 29, 2024

cisnlp / GlotLID

Language Identification with Support for More Than 2000 Labels -- EMNLP 2023

Python 109 8 Updated Nov 28, 2024

epfml / schedules-and-scaling

Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"

Python 66 2 Updated Oct 30, 2024

bwasti / brr.js

trying to make WebGPU a bit easier to use

JavaScript 15 Updated Jan 9, 2024

huggingface / datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,142 159 Updated Jan 8, 2025

praeclarum / webgpu-torch

Tensor computation with WebGPU acceleration

TypeScript 601 17 Updated Jul 25, 2024

SkunkworksAI / hydra-moe

Python 411 15 Updated Nov 2, 2023

epfLLM / Megatron-LLM

distributed trainer for LLMs

Python 554 79 Updated May 20, 2024

epfml / landmark-attention

Landmark Attention: Random-Access Infinite Context Length for Transformers

Python 419 36 Updated Dec 20, 2023

zemlyansky / gpt-tfjs

GPT in TensorFlow.js

JavaScript 28 7 Updated Oct 16, 2023

Stability-AI / StableLM

StableLM: Stability AI Language Models

Jupyter Notebook 15,833 1,032 Updated Apr 8, 2024

epfml / llm-baselines

nanoGPT-like codebase for LLM training

Python 82 24 Updated Jan 8, 2025

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 38,352 6,189 Updated Dec 9, 2024

epfml / powersgd

Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727

Python 144 33 Updated Oct 29, 2024

Pialex99 / ML_ethics

Ren'Py 1 Updated Nov 17, 2021

parsa-epfl / HBFPEmulator

ColTraIn HBFP Training Emulator

Python 16 6 Updated Feb 16, 2023

epfml / Bi-Sent2Vec

Robust Cross-lingual Embeddings from Parallel Sentences

C++ 20 2 Updated Jun 27, 2020

DP-3T / documents

Decentralized Privacy-Preserving Proximity Tracing -- Documents

Shell 2,250 178 Updated Aug 22, 2022

graphcore / examples

Example code and applications for machine learning on Graphcore IPUs

Python 319 82 Updated Mar 5, 2024

facebookresearch / stochastic_gradient_push

Stochastic Gradient Push for Distributed Deep Learning

Python 159 38 Updated Apr 5, 2023

ahug / amld-pytorch-workshop

Introduction to PyTorch Workshop at the AMLD 2019

Jupyter Notebook 31 34 Updated Jun 10, 2019

epfml / autoTrain

Open Challenge - Automatic Training for Deep Learning

Python 3 1 Updated Oct 19, 2021

White-Link / UnsupervisedScalableRepresentationLearningTimeSeries

Unsupervised Scalable Representation Learning for Multivariate Time Series: Experiments

Jupyter Notebook 396 94 Updated Jul 31, 2024

epfl-dlab / Cr5

Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"

Jupyter Notebook 30 3 Updated Aug 17, 2019

GokuMohandas / Made-With-ML

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Jupyter Notebook 37,913 5,996 Updated Aug 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Martin Jaggi martinjaggi

Achievements

Achievements

Highlights

Organizations

Block or report martinjaggi

Stars

epfLLM / meditron

ServiceNow / Fast-LLM

apple / ml-ademamix

keirp / OpenWebMath

epfml / disco

Olivia-fsm / DoGE

cisnlp / GlotLID

epfml / schedules-and-scaling

bwasti / brr.js

huggingface / datatrove

praeclarum / webgpu-torch

SkunkworksAI / hydra-moe

epfLLM / Megatron-LLM

epfml / landmark-attention

zemlyansky / gpt-tfjs

Stability-AI / StableLM

epfml / llm-baselines

karpathy / nanoGPT

epfml / powersgd

Pialex99 / ML_ethics

parsa-epfl / HBFPEmulator

epfml / Bi-Sent2Vec

DP-3T / documents

graphcore / examples

facebookresearch / stochastic_gradient_push

ahug / amld-pytorch-workshop

epfml / autoTrain

White-Link / UnsupervisedScalableRepresentationLearningTimeSeries

epfl-dlab / Cr5

GokuMohandas / Made-With-ML