Stars
This repository contains code for the RAGonite project, a flexible RAG pipeline developed by the NLP team at Fraunhofer IIS, Erlangen, Germany. RAGONITE supports conversational question answering o…
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs
[ICML 2024] Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
Code from the winning submissions for the On Cloud N: Cloud Cover Detection Challenge
Sparsity-aware deep learning inference runtime for CPUs
MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
A curated list of resources about long-context in large-language models and video understanding.
Code base for "Target-Side Augmentation for Document-Level Machine Translation"
🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…
Running large language models on a single GPU for throughput-oriented scenarios.
Solution and experiments to the challenge CRI competition 2022 - text paraphrase detection