GitHub - CengZihao/Attention-Gate: In-context KV-Cache Eviction for LLMs via Attention-Gate

The current version is an early-stage implementation, and further improvements will be made in future updates.

The training script replaces modeling_llama_with_ag.py in the transformers library’s modeling_llama.py.

Continual pre-training uses the dataset redpajama_samples.jsonl.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.DS_Store		.DS_Store
.gitattributes		.gitattributes
README.md		README.md
modeling_llama_with_ag.py		modeling_llama_with_ag.py
redpajama_samples.jsonl		redpajama_samples.jsonl

Provide feedback