Skip to content
View Confusezius's full-sized avatar
🥦
🥦

Highlights

  • Pro

Block or report Confusezius

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Everything about the SmolLM2 and SmolVLM family of models

Python 1,972 111 Updated Feb 20, 2025

Evaluate Multimodal LLMs as Embodied Agents

Python 37 1 Updated Feb 14, 2025

Implementation of Alphafold 3 from Google Deepmind in Pytorch

Python 1,372 172 Updated Jan 22, 2025

Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]

Python 51 2 Updated Dec 10, 2024

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,040 125 Updated Mar 2, 2025

[NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization

Python 124 10 Updated Jan 27, 2025

[ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"

Python 12 Updated May 31, 2024
Jupyter Notebook 8 2 Updated Mar 23, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,679 171 Updated Feb 21, 2025

Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.

Python 247 24 Updated Aug 12, 2023

Official repository of Evolutionary Optimization of Model Merging Recipes

Python 1,292 99 Updated Nov 29, 2024

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,162 214 Updated Mar 1, 2025
Python 706 45 Updated Mar 6, 2024

(ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life

Python 336 15 Updated Dec 2, 2024

VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning

Python 103 10 Updated Sep 17, 2024

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 589 364 Updated Jul 4, 2024

Official repository for "Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model" [ICLR 2024 spotlight]

7 Updated Feb 20, 2024

Unofficial implementation of "SODA: Bottleneck Diffusion Models for Representation Learning"

Jupyter Notebook 82 4 Updated Mar 21, 2024

[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"

Python 59 5 Updated Jul 4, 2024

✨✨Latest Advances on Multimodal Large Language Models

14,096 898 Updated Mar 4, 2025

Consistency Distilled Diff VAE

Python 2,160 76 Updated Nov 7, 2023

DataComp: In search of the next generation of multimodal datasets

Python 684 56 Updated Jan 2, 2024

A curated list of plugins that you can add to your FiftyOne install!

Python 112 18 Updated Mar 4, 2025

Refine high-quality datasets and visual AI models

Python 9,247 606 Updated Mar 4, 2025

Python package to download and use the SSB datasets

Python 11 3 Updated Aug 3, 2023

{KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch

Python 211 22 Updated Mar 2, 2025

This is the repository for the Photorealistic Unreal Graphics (PUG) datasets for representation learning.

Jupyter Notebook 232 12 Updated Apr 4, 2024

Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".

Python 286 14 Updated Oct 22, 2024

Implementation of Discrete Key / Value Bottleneck, in Pytorch

Python 87 3 Updated Jul 9, 2023
Next