Skip to content
View futurev's full-sized avatar

Block or report futurev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解

Java 32,292 7,887 Updated Dec 28, 2024

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Jupyter Notebook 6,458 430 Updated Dec 22, 2024

More relighting!

Python 7,156 415 Updated Nov 28, 2024

A dummy's guide to setting up (and using) HPC clusters on Ubuntu 22.04LTS using Slurm and Munge. Created by the Quant Club @ UIowa.

238 21 Updated Apr 3, 2024

Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.

Python 73 2 Updated Nov 25, 2024

A robotics hardware platform for the integration sensors and end effectors into a common platform.

Python 60 5 Updated Oct 31, 2024

Sparsh Self-supervised touch representations for vision-based tactile sensing

Jupyter Notebook 110 7 Updated Nov 1, 2024

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,336 152 Updated Oct 21, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,811 118 Updated Oct 30, 2024

Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"

Python 736 72 Updated Apr 7, 2024

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,952 130 Updated Jul 2, 2024

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,441 248 Updated Apr 24, 2024

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Python 988 64 Updated Oct 6, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,871 470 Updated Nov 5, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 4 Updated Aug 3, 2024

This repository is created to retrain a Florence-2 model with your custom dataset

Python 1 Updated Jul 23, 2024

💯 Curated coding interview preparation materials for busy software engineers

TypeScript 120,388 14,866 Updated Oct 8, 2024

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,361 342 Updated Nov 3, 2024

Utilities intended for use with Llama models.

Python 5,439 904 Updated Dec 18, 2024

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,787 377 Updated Mar 14, 2024

An Open-source Toolkit for LLM Development

Python 2,740 176 Updated May 24, 2024

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,095 219 Updated Dec 3, 2024

Repository for synthetic RGB to Thermal Infrared translation module from "Edge-guided multidomain RGB to TIR translation", ICRA 2023 submission

Python 79 4 Updated Apr 15, 2024

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Python 761 53 Updated Mar 25, 2024

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

C++ 12,833 2,879 Updated Aug 8, 2024

A Household multimodal environment (HoME) based on the SUNCG indoor scenes dataset

Python 4 42 Updated Nov 30, 2017

🤩 An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024

Python 139 9 Updated Jun 13, 2024

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 604 33 Updated Oct 22, 2024

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks

Python 1,567 223 Updated Dec 28, 2024
Next