Skip to content
View martinjaggi's full-sized avatar

Highlights

  • Pro

Organizations

@mlbench @epfml @amld @EPFLiGHT @CS-433

Block or report martinjaggi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Meditron is a suite of open-source medical Large Language Models (LLMs).

Python 1,939 179 Updated Apr 10, 2024

Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research

Python 123 20 Updated Jan 24, 2025
Python 52 3 Updated Nov 15, 2024
XSLT 134 9 Updated May 2, 2024

DISCO is a code-free and installation-free browser platform that allows any non-technical user to collaboratively train machine learning models without sharing any private data.

TypeScript 159 28 Updated Jan 23, 2025

Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"

Python 15 4 Updated Feb 29, 2024

Language Identification with Support for More Than 2000 Labels -- EMNLP 2023

Python 111 8 Updated Nov 28, 2024

Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"

Python 67 2 Updated Oct 30, 2024

trying to make WebGPU a bit easier to use

JavaScript 15 Updated Jan 9, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,164 164 Updated Jan 22, 2025

Tensor computation with WebGPU acceleration

TypeScript 602 17 Updated Jul 25, 2024
Python 412 15 Updated Nov 2, 2023

distributed trainer for LLMs

Python 555 78 Updated May 20, 2024

Landmark Attention: Random-Access Infinite Context Length for Transformers

Python 420 36 Updated Dec 20, 2023

GPT in TensorFlow.js

JavaScript 29 7 Updated Oct 16, 2023

StableLM: Stability AI Language Models

Jupyter Notebook 15,826 1,033 Updated Apr 8, 2024

nanoGPT-like codebase for LLM training

Python 83 25 Updated Jan 23, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 38,694 6,272 Updated Dec 9, 2024

Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727

Python 145 33 Updated Oct 29, 2024
Ren'Py 1 Updated Nov 17, 2021

ColTraIn HBFP Training Emulator

Python 16 6 Updated Feb 16, 2023

Robust Cross-lingual Embeddings from Parallel Sentences

C++ 21 2 Updated Jun 27, 2020

Decentralized Privacy-Preserving Proximity Tracing -- Documents

Shell 2,250 178 Updated Aug 22, 2022

Example code and applications for machine learning on Graphcore IPUs

Python 319 82 Updated Mar 5, 2024

Stochastic Gradient Push for Distributed Deep Learning

Python 160 37 Updated Apr 5, 2023

Introduction to PyTorch Workshop at the AMLD 2019

Jupyter Notebook 31 34 Updated Jun 10, 2019

Open Challenge - Automatic Training for Deep Learning

Python 4 1 Updated Oct 19, 2021

Unsupervised Scalable Representation Learning for Multivariate Time Series: Experiments

Jupyter Notebook 396 94 Updated Jul 31, 2024

Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"

Jupyter Notebook 30 3 Updated Aug 17, 2019

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Jupyter Notebook 38,049 6,025 Updated Aug 18, 2024
Next