Skip to content
View gante's full-sized avatar

Organizations

@huggingface

Block or report gante

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A unified evaluation framework for large language models

Python 2,409 179 Updated Sep 12, 2024
Python 103 4 Updated Aug 30, 2024

PyTorch native quantization and sparsity for training and inference

Python 1,327 129 Updated Oct 10, 2024

Formatron empowers everyone to control the format of language models' output with minimal overhead.

Python 141 4 Updated Oct 9, 2024

Efficient and general syntactical decoding for Large Language Models

Python 184 15 Updated Oct 4, 2024

Run your own AI cluster at home with everyday devices πŸ“±πŸ’» πŸ–₯️⌚

Python 9,934 537 Updated Oct 9, 2024

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 695 34 Updated Oct 9, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 17,928 1,726 Updated Oct 10, 2024

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 468 12 Updated Sep 16, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 1,980 138 Updated Oct 9, 2024

Large Action Model framework to develop AI Web Agents

Python 5,367 485 Updated Oct 9, 2024

A pytorch quantization backend for optimum

Python 788 57 Updated Oct 8, 2024

Representation Engineering: A Top-Down Approach to AI Transparency

Jupyter Notebook 701 81 Updated Aug 14, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 82,800 22,319 Updated Oct 10, 2024

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 708 82 Updated Oct 9, 2024

PhotoMaker [CVPR 2024]

Jupyter Notebook 9,407 752 Updated Aug 15, 2024

Minimalistic large language model 3D-parallelism training

Python 1,159 108 Updated Oct 9, 2024

πŸ€— A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers

Python 85 15 Updated Oct 3, 2024

Structured Text Generation

Python 8,513 431 Updated Oct 8, 2024

Mamba SSM architecture

Python 12,794 1,081 Updated Oct 7, 2024

Machine Learning Engineering Open Book

Python 11,339 685 Updated Oct 10, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,587 510 Updated Oct 4, 2024
Jupyter Notebook 451 22 Updated Aug 23, 2024

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,116 66 Updated Feb 14, 2024

Minimum Bayes Risk Decoding for Hugging Face Transformers

Python 51 5 Updated Jun 3, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,554 281 Updated Jul 12, 2024

FRP Fork

Go 126 18 Updated Aug 30, 2024

Playing Pokemon Red with Reinforcement Learning

Jupyter Notebook 6,876 628 Updated Sep 4, 2024

πŸ‡΅πŸ‡Ή List of technology companies in Portugal.

1,311 204 Updated Oct 6, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,244 154 Updated Jun 25, 2024
Next