Skip to content
View VegB's full-sized avatar
🈚
🈚
  • UC Santa Barbara
  • Santa Barbara, CA

Organizations

@asyml

Block or report VegB

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Python 117 2 Updated Aug 23, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,261 504 Updated Jul 31, 2024
Python 40 3 Updated Jan 20, 2024

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 5,242 328 Updated Jul 21, 2024

Project webpage of LayoutGPT

JavaScript 2 Updated Jun 9, 2023

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,712 370 Updated Mar 14, 2024

✨✨Latest Advances on Multimodal Large Language Models

12,059 770 Updated Oct 7, 2024

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,261 171 Updated Sep 23, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,337 2,912 Updated Sep 2, 2024

Official repo for LayoutGPT

Python 286 20 Updated Apr 10, 2024

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,561 241 Updated Mar 5, 2024

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 901 34 Updated Jun 11, 2024

An open-source framework for training large multimodal models.

Python 3,686 280 Updated Aug 31, 2024

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)

Jupyter Notebook 136 6 Updated Nov 27, 2023

Reverse engineered ChatGPT API

Python 28,010 4,477 Updated Aug 2, 2023

Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥

Python 1,251 119 Updated Dec 1, 2023

Intuitive Annotation Tool for Information Extraction / Named Entity Recognition using localturk / Amazon Mechanical Turk

JavaScript 266 27 Updated Aug 25, 2019

Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training

Python 163 16 Updated Apr 27, 2023

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Python 1,202 59 Updated Oct 18, 2022

A large-scale text-to-image prompt gallery dataset based on Stable Diffusion

Python 1,197 66 Updated Jul 11, 2024

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 1,702 192 Updated May 20, 2024
Jupyter Notebook 216 27 Updated Dec 18, 2023

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,649 336 Updated Aug 7, 2024

Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.

Python 272 24 Updated Jul 12, 2024

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python 2,836 585 Updated Jul 19, 2024

Simple image captioning model

Jupyter Notebook 1,290 215 Updated Jun 9, 2024

LaTeX template for dissertations in Peking University

TeX 535 186 Updated Apr 25, 2024

🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 173,025 25,865 Updated Oct 8, 2024