Skip to content
View shuyhere's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report shuyhere

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AGI Glossary

2 Updated Jan 15, 2025

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,042 1,034 Updated Jan 15, 2025

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,339 175 Updated Dec 5, 2024

MIXQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction

Python 73 13 Updated Oct 29, 2024

Mixed precision inference by Tensorrt-LLM

C++ 71 20 Updated Oct 23, 2024
Jupyter Notebook 10 2 Updated Aug 24, 2024

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Python 39 4 Updated Nov 25, 2024

Tools for understanding how transformer predictions are built layer-by-layer

Python 459 48 Updated Jun 2, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 7,342 706 Updated Jan 16, 2025

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,065 295 Updated Nov 5, 2024

Video Question Answering via Gradually Refined Attention over Appearance and Motion

Python 158 27 Updated Dec 5, 2017
88 7 Updated Oct 19, 2022

Large language models to generate stable crystals.

Python 92 17 Updated Jun 18, 2024

🔥 Omni large models and datasets for understanding and generating multi-modalities.

12 Updated Oct 25, 2024

Open source replication of Anthropic's Crosscoders for Model Diffing

Python 28 11 Updated Oct 27, 2024

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Python 9,374 689 Updated Jan 16, 2025

Everything about the SmolLM & SmolLM2 family of models

Python 1,553 81 Updated Jan 7, 2025

Toolkit for attaching, training, saving and loading of new heads for transformer models

Jupyter Notebook 259 23 Updated Jan 8, 2025

Training SAEs for your LLM, and visualize it in one place

Python 6 Updated Nov 4, 2024
Python 20 Updated Nov 19, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,398 227 Updated Jan 14, 2025

A modern, high customizable, responsive Jekyll theme for documentation with built-in search.

SCSS 7,806 3,700 Updated Jan 16, 2025

Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).

HTML 176 36 Updated Dec 16, 2024

A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..

197 8 Updated Oct 17, 2024
Python 39 8 Updated Nov 16, 2021

Collection of Reverse Engineering in Large Model

31 Updated Jan 8, 2025
Jupyter Notebook 53 9 Updated Nov 17, 2024

Animation engine for explanatory math videos

Python 73,994 6,472 Updated Jan 8, 2025

Brain Dynamics Programming in Python

Python 553 94 Updated Dec 16, 2024
Next