sdc17

Follow

⚡

Dachuan Shi sdc17

⚡

Follow

18 followers · 2 following

dachuanshi.com

Achievements

Achievements

Highlights

Pro

Lists (7)

Sort

💭 Chain of Thought

Let's think step by step.

🔥 ChatGPT and Beyond

All kinds of GPTs and GPT-derived applications.

21 repositories

🚀 Efficient Deep Learning

Model compression, acceleration, parameter efficient fine-tuning, etc.

94 repositories

🔬 Medical Image Analysis

AI-assisted deep learning models for medical image analysis.

11 repositories

🧐 Mixure of Experts

Two heads are better than one.

🚢 Pretrained Models

Vision, language, audio, and multimodal pretrained models.

78 repositories

⚡ Training Infrastructures

Frameworks for efficiently training or serving deep learning models.

11 repositories

Starred repositories

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 1,749 89 Updated Dec 22, 2024

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,097 104 Updated Dec 27, 2024

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 1,844 86 Updated Dec 23, 2024

deepseek-ai / DeepSeek-V3

Python 4,908 289 Updated Dec 27, 2024

mit-han-lab / duo-attention

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 413 23 Updated Oct 31, 2024

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 20,121 1,523 Updated Dec 28, 2024

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,347 226 Updated Dec 12, 2024

mlfoundations / dclm

DataComp for Language Models

HTML 1,187 108 Updated Dec 11, 2024

FranxYao / Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 447 29 Updated Mar 19, 2024

OpenLMLab / LEval

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Python 365 14 Updated Jul 9, 2024

OpenBMB / InfiniteBench

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Python 299 25 Updated Sep 25, 2024

epfml / landmark-attention

Landmark Attention: Random-Access Infinite Context Length for Transformers

Python 419 36 Updated Dec 20, 2023

haoliuhl / ringattention

Large Context Attention

Python 660 53 Updated Aug 12, 2024

opendatalab / labelU

Data annotation toolbox supports image, audio and video data.

Python 923 92 Updated Dec 27, 2024

opendatalab / LabelLLM

The Open-Source Data Annotation Platform

TypeScript 608 49 Updated Nov 6, 2024

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 22,242 1,607 Updated Dec 27, 2024

microsoft / MInference

[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 whil…

Python 853 39 Updated Dec 28, 2024

GATECH-EIC / mg-verilog

Python 34 6 Updated Oct 8, 2024

THUDM / LongBench

LongBench v2 and LongBench (ACL 2024)

Python 716 60 Updated Dec 24, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,553 2,576 Updated Dec 28, 2024

meta-llama / llama-stack-apps

Agentic components of the Llama Stack APIs

4,027 640 Updated Dec 21, 2024

test-time-training / ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,071 64 Updated Jul 14, 2024

LLMServe / DistServe

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 405 50 Updated Aug 19, 2024

jy-yuan / KIVI

[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Python 261 23 Updated Oct 10, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,247 397 Updated Aug 7, 2024

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,250 684 Updated Dec 24, 2024

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 15,022 1,213 Updated Dec 12, 2024

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,873 175 Updated Sep 25, 2024

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,282 130 Updated Dec 26, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,811 118 Updated Oct 30, 2024

Starred topics

prompt-engineering

lora

knowledge-distillation

pruning

quantization

llm

large-language-models

vision-and-language

chain-of-thought

mixture-of-experts

See all starred topics