Skip to content
View jbochi's full-sized avatar

Organizations

@cobrateam

Block or report jbochi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Rust bindings for the C++ api of PyTorch.

Rust 4,633 365 Updated Jan 30, 2025

Sample Python extension using Rust/PyO3/tch to interact with PyTorch

Rust 34 Updated Feb 5, 2024

Sampling profiler for Python programs

Rust 13,324 448 Updated Feb 6, 2025
Python 394 36 Updated Jul 11, 2024
Jupyter Notebook 9 Updated Jun 3, 2024

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 481 69 Updated Feb 5, 2025

A simple, performant and scalable Jax LLM!

Python 1,637 327 Updated Mar 4, 2025

LLM <-> Python agentic runtime prototype

Python 31 8 Updated Mar 4, 2025

Tile primitives for speedy kernels

Cuda 2,094 120 Updated Mar 4, 2025

Arena-Hard-Auto: An automatic LLM benchmark.

Python 753 93 Updated Dec 29, 2024

AICI: Prompts as (Wasm) Programs

Rust 2,003 83 Updated Jan 22, 2025

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 708 64 Updated Dec 30, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 11,304 1,134 Updated Mar 4, 2025

A guidance language for controlling large language models.

Jupyter Notebook 19,801 1,086 Updated Mar 4, 2025

Structured Text Generation

Python 10,903 569 Updated Mar 4, 2025

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,447 168 Updated Jun 25, 2024

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,202 73 Updated Oct 14, 2024

MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts

Jupyter Notebook 281 47 Updated Nov 29, 2024

Machine Learning Engineering Open Book

Python 13,053 794 Updated Mar 1, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 47,437 5,038 Updated Jan 22, 2025

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,622 819 Updated Sep 1, 2024

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 4,134 232 Updated Dec 12, 2024

Perf monitoring CLI tool for Apple Silicon

Python 3,879 159 Updated Apr 18, 2024

GGUF implementation in C as a library and a tools CLI program

C 258 17 Updated Jan 9, 2025

A small utility library for parsing GGUF file info

Rust 27 3 Updated Jan 27, 2025

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

Jupyter Notebook 902 87 Updated Jun 22, 2024

MLX: An array framework for Apple silicon

C++ 19,392 1,104 Updated Mar 4, 2025

Examples in the MLX framework

Python 7,047 995 Updated Mar 4, 2025

Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)

Rust 2,768 221 Updated Dec 24, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 6,746 671 Updated Mar 3, 2025
Next