Skip to content
View martinjaggi's full-sized avatar

Highlights

  • Pro

Organizations

@mlbench @epfml @amld @EPFLiGHT @CS-433

Block or report martinjaggi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Meditron is a suite of open-source medical Large Language Models (LLMs).

Python 1,928 174 Updated Apr 10, 2024

Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research

Python 118 18 Updated Jan 7, 2025
Python 51 2 Updated Nov 15, 2024
XSLT 132 9 Updated May 2, 2024

DISCO is a code-free and installation-free browser platform that allows any non-technical user to collaboratively train machine learning models without sharing any private data.

TypeScript 158 27 Updated Jan 8, 2025

Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"

Python 15 4 Updated Feb 29, 2024

Language Identification with Support for More Than 2000 Labels -- EMNLP 2023

Python 109 8 Updated Nov 28, 2024

Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"

Python 66 2 Updated Oct 30, 2024

trying to make WebGPU a bit easier to use

JavaScript 15 Updated Jan 9, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,142 159 Updated Jan 8, 2025

Tensor computation with WebGPU acceleration

TypeScript 601 17 Updated Jul 25, 2024
Python 411 15 Updated Nov 2, 2023

distributed trainer for LLMs

Python 554 79 Updated May 20, 2024

Landmark Attention: Random-Access Infinite Context Length for Transformers

Python 419 36 Updated Dec 20, 2023

GPT in TensorFlow.js

JavaScript 28 7 Updated Oct 16, 2023

StableLM: Stability AI Language Models

Jupyter Notebook 15,833 1,032 Updated Apr 8, 2024

nanoGPT-like codebase for LLM training

Python 82 24 Updated Jan 8, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 38,352 6,189 Updated Dec 9, 2024

Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727

Python 144 33 Updated Oct 29, 2024
Ren'Py 1 Updated Nov 17, 2021

ColTraIn HBFP Training Emulator

Python 16 6 Updated Feb 16, 2023

Robust Cross-lingual Embeddings from Parallel Sentences

C++ 20 2 Updated Jun 27, 2020

Decentralized Privacy-Preserving Proximity Tracing -- Documents

Shell 2,250 178 Updated Aug 22, 2022

Example code and applications for machine learning on Graphcore IPUs

Python 319 82 Updated Mar 5, 2024

Stochastic Gradient Push for Distributed Deep Learning

Python 159 38 Updated Apr 5, 2023

Introduction to PyTorch Workshop at the AMLD 2019

Jupyter Notebook 31 34 Updated Jun 10, 2019

Open Challenge - Automatic Training for Deep Learning

Python 3 1 Updated Oct 19, 2021

Unsupervised Scalable Representation Learning for Multivariate Time Series: Experiments

Jupyter Notebook 396 94 Updated Jul 31, 2024

Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"

Jupyter Notebook 30 3 Updated Aug 17, 2019

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Jupyter Notebook 37,913 5,996 Updated Aug 18, 2024
Next