Skip to content
View junwucs's full-sized avatar

Block or report junwucs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 2,539 121 Updated Dec 27, 2024
Jupyter Notebook 348 27 Updated Jul 22, 2024

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 687 92 Updated Dec 24, 2024

Go ahead and axolotl questions

Python 8,173 898 Updated Dec 27, 2024

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 1,981 157 Updated Mar 27, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,626 216 Updated Dec 20, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,582 490 Updated Dec 15, 2024
Python 1,056 36 Updated Nov 21, 2024

veRL: Volcano Engine Reinforcement Learning for LLM

Python 499 35 Updated Dec 23, 2024

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

Python 41 1 Updated Oct 14, 2024

EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud

Python 22 Updated Mar 10, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 6,761 615 Updated Dec 27, 2024

[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.

Python 103 8 Updated Jul 17, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,055 484 Updated May 3, 2024

A robust web archive analytics toolkit

Cython 90 14 Updated Dec 5, 2024

Let your Claude able to think

TypeScript 10,449 1,196 Updated Dec 3, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,731 259 Updated Aug 9, 2024

The Open Cookbook for Top-Tier Code Large Language Model

Python 1,496 90 Updated Dec 8, 2024

A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering

Python 42 4 Updated Apr 5, 2022
Python 252 29 Updated Dec 25, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 7,986 804 Updated Dec 26, 2024

Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)

Python 828 65 Updated Aug 26, 2024

Code release for https://kovenyu.com/WonderWorld/

Python 365 13 Updated Dec 22, 2024

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 585 62 Updated Aug 30, 2024

具身智能入门指南

793 38 Updated Dec 27, 2024

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 973 180 Updated Jul 31, 2024

DataComp for Language Models

HTML 1,187 108 Updated Dec 11, 2024

Unsupervised Word Segmentation for Neural Machine Translation and Text Generation

Python 2,210 464 Updated Aug 7, 2024

Fast and customizable text tokenization library with BPE and SentencePiece support

C++ 291 70 Updated Sep 3, 2024
Next