Stars
[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
The official PyTorch implementation of Google's Gemma models
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
✨✨Latest Advances on Multimodal Large Language Models
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
An open-source framework for training large multimodal models.
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
Intuitive Annotation Tool for Information Extraction / Named Entity Recognition using localturk / Amazon Mechanical Turk
Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
A large-scale text-to-image prompt gallery dataset based on Stable Diffusion
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Simple image captioning model
LaTeX template for dissertations in Peking University
🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…