Skip to content
View vvcatstar's full-sized avatar

Block or report vvcatstar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 5,274 476 Updated Jan 22, 2025

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

Jupyter Notebook 2,761 381 Updated Jul 15, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 18,143 1,822 Updated Apr 17, 2025

🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Python 1,053 67 Updated Apr 15, 2025

Complete Open Source and Modular solution for MMO

C++ 7,007 2,804 Updated Apr 17, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 28,527 3,551 Updated Jul 23, 2024

Python scripts for the Segment Anythin 2 (SAM2) model in ONNX

Python 239 15 Updated Aug 29, 2024

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 16,507 2,259 Updated Oct 30, 2024

[CVPR 2024] Text-to-3D using Gaussian Splatting

Python 831 47 Updated Jan 7, 2024

The modified differential Gaussian rasterization in the CVPR 2024 highlight paper: GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting.

Cuda 174 13 Updated Nov 21, 2024

[CVPR 2024] Photo-SLAM: Real-time Simultaneous Localization and Photorealistic Mapping for Monocular, Stereo, and RGB-D Cameras

C++ 533 58 Updated Jun 12, 2024

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

Python 855 162 Updated Dec 18, 2024

Grok open release

Python 50,248 8,349 Updated Aug 30, 2024

Algorithm to texture 3D reconstructions from multi-view stereo images

C++ 1,015 349 Updated Jul 25, 2023

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Python 26,720 3,377 Updated Apr 3, 2025

A collaboration friendly studio for NeRFs

Python 10,107 1,398 Updated Apr 4, 2025

Vector Quantized VAEs - PyTorch Implementation

Python 897 140 Updated Jul 12, 2023

Point cloud diffusion for 3D model synthesis

Python 6,699 787 Updated Jul 4, 2024

Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).

Python 1,055 72 Updated Nov 15, 2023
Python 386 37 Updated Nov 29, 2022

This repository contains the code for the paper "Occupancy Networks - Learning 3D Reconstruction in Function Space"

Python 1,595 298 Updated Jun 27, 2023

Teaching robots to respond to open-vocab queries with CLIP and NeRF-like neural fields

Python 165 17 Updated Mar 2, 2024

Let us control diffusion models!

Python 32,050 2,868 Updated Feb 25, 2024

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 55,153 5,421 Updated Apr 5, 2025

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

Python 1,999 213 Updated Jan 20, 2024

Most popular metrics used to evaluate object detection algorithms.

Python 5,038 1,031 Updated Dec 29, 2024
Lua 7 Updated Oct 30, 2022
Next