Highlights
- Pro
Stars
A bibliography and survey of the papers surrounding o1
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Official Code for Stable Cascade
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
Code release for Image Sculpting: Precise Object Editing with 3D Geometry Control [CVPR 2024]
Official Open Source code for "Scaling Language-Image Pre-training via Masking"
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"