FYYFU

fyyfu FYYFU

17 followers · 19 following

Lists (4)

Sort

Stars

OSU-NLP-Group / AgentSafety

43 Updated Jan 14, 2025

VinAIResearch / WhoQA

Who's Who: Large Language Models Meet Knowledge Conflicts in Practice (EMNLP 2024 Findings)

7 1 Updated Nov 13, 2024

ContextualAI / gritlm

Generative Representational Instruction Tuning

Jupyter Notebook 588 42 Updated Jan 19, 2025

ModelTC / lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,802 222 Updated Jan 26, 2025

pytorch-labs / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,767 527 Updated Dec 14, 2024

chrisliu298 / awesome-llm-unlearning

A resource repository for machine unlearning in large language models

294 14 Updated Jan 16, 2025

tjunlp-lab / Awesome-LLM-Safety-Papers

11 1 Updated Dec 24, 2024

princeton-pli / LongProc

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation

HTML 16 Updated Jan 22, 2025

HuangOwen / Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

1,334 88 Updated Jan 25, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 8,401 927 Updated Jan 27, 2025

guidance-ai / guidance

A guidance language for controlling large language models.

Jupyter Notebook 19,524 1,063 Updated Jan 27, 2025

swe-bench / SWE-bench

[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 2,340 400 Updated Jan 22, 2025

THUDM / LongWriter

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Python 1,585 155 Updated Oct 29, 2024

pszemraj / textsum

CLI & Python API to easily summarize text-based files with transformers

Python 130 9 Updated Nov 2, 2024

princeton-nlp / CEPE

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Python 154 10 Updated Jun 13, 2024

alonj / Same-Task-More-Tokens

The code for the paper: "Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models"

Jupyter Notebook 54 3 Updated Jul 9, 2024

wjn1996 / Awesome-LLM-Reasoning-Openai-o1-Survey

The related works and background techniques about Openai o1

195 7 Updated Jan 7, 2025

haonan3 / AnchorContext

AnchorAttention: Improved attention for LLMs long-context training

Python 203 6 Updated Jan 15, 2025

QwenLM / Self-Lengthen

Python 79 5 Updated Nov 6, 2024

FFY0 / AdaKV

The Official Implementation of Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference

Python 56 Updated Jan 23, 2025

bojone / rerope

Rectified Rotary Position Embeddings

Python 348 30 Updated May 20, 2024

edahanoam / Awesome-Summarization-Datasets

Updating collection of summarization datasets in 100+ languages, based on our survey "The State and Fate of Summarization Datasets".

8 Updated Nov 8, 2024

NVIDIA / RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 868 61 Updated Dec 16, 2024

HKUNLP / STRING

Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"

Python 69 3 Updated Nov 25, 2024

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,127 492 Updated May 3, 2024

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,594 211 Updated Dec 5, 2024

October2001 / Awesome-KV-Cache-Compression

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

276 5 Updated Jan 16, 2025

princeton-nlp / ProLong

Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"

Python 149 5 Updated Dec 16, 2024

Infini-AI-Lab / MagicPIG

[ICLR2025] MagicPIG: LSH Sampling for Efficient LLM Generation

Python 181 12 Updated Dec 16, 2024

tml-epfl / llm-adaptive-attacks

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]

Shell 251 27 Updated Jan 23, 2025

fyyfu FYYFU

Lists (4)

awesome

long context

pre-train code

security

Stars