Skip to content
View ShuDun23's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report ShuDun23

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,113 456 Updated Jan 8, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,367 5,086 Updated Jan 8, 2025

Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)

Python 903 94 Updated Jan 2, 2025

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,403 113 Updated Dec 26, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 37,495 4,813 Updated Jan 8, 2025

Pushes markdown documents from Github to Notion

Python 187 28 Updated Mar 29, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,903 1,676 Updated Jan 7, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,444 468 Updated Jan 7, 2025

[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

Python 136 7 Updated Dec 3, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,131 326 Updated Dec 27, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,450 4,579 Updated Dec 26, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 37,509 4,625 Updated Jan 8, 2025

Resources of deep learning for mathematical reasoning (DL4MATH).

345 27 Updated Dec 22, 2023

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

Python 23 1 Updated Apr 4, 2024

SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)

Python 26 1 Updated Nov 1, 2024

PyTorch native post-training library

Python 4,564 474 Updated Jan 8, 2025

code for "Hybrid Ordinary-Welsch Function based Robust Matrix Completion for MIMO Radar" accepted by TAES

MATLAB 1 Updated Nov 12, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,301 27,480 Updated Jan 8, 2025

Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc.…

Python 2,384 275 Updated Sep 26, 2024

📋 A list of open LLMs available for commercial use.

11,438 775 Updated Jul 5, 2024

The implementation of DeBERTa

Python 2,025 230 Updated Sep 29, 2023
Python 2,719 309 Updated Jan 7, 2025

GPT-3: Language Models are Few-Shot Learners

15,711 2,303 Updated Sep 18, 2020

TensorFlow code and pre-trained models for BERT

Python 38,489 9,635 Updated Jul 23, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 26,845 3,392 Updated Jul 23, 2024

The official Meta Llama 3 GitHub site

Python 27,853 3,189 Updated Aug 12, 2024

Inference code for Llama models

Python 57,134 9,646 Updated Aug 18, 2024

简单易懂的LLaMA微调指南。

Python 382 34 Updated Jul 5, 2023

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 15,230 1,231 Updated Dec 12, 2024

Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers g…

101 2 Updated Jul 12, 2024
Next