Highlights
- Pro
Stars
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge manageme…
User-friendly WebUI for AI (Formerly Ollama WebUI)
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
Frequency-aware Image Restoration for Industrial Visual anomaly detection
LLaMA: Open and Efficient Foundation Language Models
[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)
[IROS 2023] Open-Vocabulary Affordance Detection in 3d Point Clouds
[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
[CVPR'19] Dataset and code used in the research project Scan2CAD: Learning CAD Model Alignment in RGB-D Scans
This is the code related to "Context-aware Alignment and Mutual Masking for 3D-Language Pre-training" (CVPR 2023).
[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
(TPAMI 2024) A Survey on Open Vocabulary Learning
[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
Monocular Depth Estimation Toolbox based on MMSegmentation.
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
Humanoid Agents: Platform for Simulating Human-like Generative Agents
Official implementation for "ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing"
Refine high-quality datasets and visual AI models
🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"