Skip to content
View FYYFU's full-sized avatar

Block or report FYYFU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Who's Who: Large Language Models Meet Knowledge Conflicts in Practice (EMNLP 2024 Findings)

7 1 Updated Nov 13, 2024

Generative Representational Instruction Tuning

Jupyter Notebook 588 42 Updated Jan 19, 2025

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,802 222 Updated Jan 26, 2025

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,767 527 Updated Dec 14, 2024

A resource repository for machine unlearning in large language models

294 14 Updated Jan 16, 2025

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation

HTML 16 Updated Jan 22, 2025

Awesome LLM compression research papers and tools.

1,334 88 Updated Jan 25, 2025

Go ahead and axolotl questions

Python 8,401 927 Updated Jan 27, 2025

A guidance language for controlling large language models.

Jupyter Notebook 19,524 1,063 Updated Jan 27, 2025

[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 2,340 400 Updated Jan 22, 2025

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Python 1,585 155 Updated Oct 29, 2024

CLI & Python API to easily summarize text-based files with transformers

Python 130 9 Updated Nov 2, 2024

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Python 154 10 Updated Jun 13, 2024

The code for the paper: "Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models"

Jupyter Notebook 54 3 Updated Jul 9, 2024

The related works and background techniques about Openai o1

195 7 Updated Jan 7, 2025

AnchorAttention: Improved attention for LLMs long-context training

Python 203 6 Updated Jan 15, 2025
Python 79 5 Updated Nov 6, 2024

The Official Implementation of Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference

Python 56 Updated Jan 23, 2025

Rectified Rotary Position Embeddings

Python 348 30 Updated May 20, 2024

Updating collection of summarization datasets in 100+ languages, based on our survey "The State and Fate of Summarization Datasets".

8 Updated Nov 8, 2024

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 868 61 Updated Dec 16, 2024

Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"

Python 69 3 Updated Nov 25, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,127 492 Updated May 3, 2024

GLM-4-Voice | 端到端中英语音对话模型

Python 2,594 211 Updated Dec 5, 2024

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

276 5 Updated Jan 16, 2025

Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"

Python 149 5 Updated Dec 16, 2024

[ICLR2025] MagicPIG: LSH Sampling for Efficient LLM Generation

Python 181 12 Updated Dec 16, 2024

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]

Shell 251 27 Updated Jan 23, 2025
Next