Skip to content
View sdc17's full-sized avatar

Highlights

  • Pro

Block or report sdc17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 1,749 89 Updated Dec 22, 2024

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,097 104 Updated Dec 27, 2024

Tile primitives for speedy kernels

Cuda 1,844 86 Updated Dec 23, 2024
Python 4,908 289 Updated Dec 27, 2024

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 413 23 Updated Oct 31, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 20,121 1,523 Updated Dec 28, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,347 226 Updated Dec 12, 2024

DataComp for Language Models

HTML 1,187 108 Updated Dec 11, 2024

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 447 29 Updated Mar 19, 2024

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Python 365 14 Updated Jul 9, 2024

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Python 299 25 Updated Sep 25, 2024

Landmark Attention: Random-Access Infinite Context Length for Transformers

Python 419 36 Updated Dec 20, 2023

Large Context Attention

Python 660 53 Updated Aug 12, 2024

Data annotation toolbox supports image, audio and video data.

Python 923 92 Updated Dec 27, 2024

The Open-Source Data Annotation Platform

TypeScript 608 49 Updated Nov 6, 2024

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 22,242 1,607 Updated Dec 27, 2024

[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 whil…

Python 853 39 Updated Dec 28, 2024
Python 34 6 Updated Oct 8, 2024

LongBench v2 and LongBench (ACL 2024)

Python 716 60 Updated Dec 24, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,553 2,576 Updated Dec 28, 2024

Agentic components of the Llama Stack APIs

4,027 640 Updated Dec 21, 2024

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,071 64 Updated Jul 14, 2024

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 405 50 Updated Aug 19, 2024

[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Python 261 23 Updated Oct 10, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,247 397 Updated Aug 7, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,250 684 Updated Dec 24, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 15,022 1,213 Updated Dec 12, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,873 175 Updated Sep 25, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,282 130 Updated Dec 26, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,811 118 Updated Oct 30, 2024
Next