futurev

Follow

X.ai futurev

Follow

be good

3 followers · 25 following

Lists (2)

Sort

learn

🚀 My stack

Stars

doocs / leetcode

🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer（第 2 版）》、《程序员面试金典（第 6 版）》题解

Java 32,292 7,887 Updated Dec 28, 2024

FoundationVision / VAR

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Jupyter Notebook 6,458 430 Updated Dec 22, 2024

lllyasviel / IC-Light

More relighting!

Python 7,156 415 Updated Nov 28, 2024

SergioMEV / slurm-for-dummies

A dummy's guide to setting up (and using) HPC clusters on Ubuntu 22.04LTS using Slurm and Munge. Created by the Quant Club @ UIowa.

238 21 Updated Apr 3, 2024

EvolvingLMMs-Lab / multimodal-sae

Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.

Python 73 2 Updated Nov 25, 2024

facebookresearch / digit-plexus

A robotics hardware platform for the integration sensors and end effectors into a common platform.

Python 60 5 Updated Oct 31, 2024

facebookresearch / sparsh

Sparsh Self-supervised touch representations for vision-based tactile sensing

Jupyter Notebook 110 7 Updated Nov 1, 2024

IDEA-Research / T-Rex

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,336 152 Updated Oct 21, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,811 118 Updated Oct 30, 2024

facebookresearch / omni3d

Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"

Python 736 72 Updated Apr 7, 2024

dvlab-research / LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,952 130 Updated Jul 2, 2024

amazon-science / polygon-transformer

Python 134 9 Updated Jul 19, 2023

OFA-Sys / OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,441 248 Updated Apr 24, 2024

OFA-Sys / ONE-PEACE

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Python 988 64 Updated Oct 6, 2024

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,871 470 Updated Nov 5, 2024

transcengram / cambrian

Forked from cambrian-mllm/cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 4 Updated Aug 3, 2024

Pecako2001 / Florence-2-FineTuning

This repository is created to retrain a Florence-2 model with your custom dataset

Python 1 Updated Jul 23, 2024

yangshun / tech-interview-handbook

💯 Curated coding interview preparation materials for busy software engineers

TypeScript 120,388 14,866 Updated Oct 8, 2024

NExT-GPT / NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,361 342 Updated Nov 3, 2024

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 5,439 904 Updated Dec 18, 2024

OpenGVLab / LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,787 377 Updated Mar 14, 2024

Alpha-VLLM / LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Python 2,740 176 Updated May 24, 2024

PKU-YuanGroup / Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,095 219 Updated Dec 3, 2024

RPM-Robotics-Lab / sRGB-TIR

Repository for synthetic RGB to Thermal Infrared translation module from "Edge-guided multidomain RGB to TIR translation", ICRA 2023 submission

Python 79 4 Updated Apr 15, 2024

PKU-YuanGroup / LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Python 761 53 Updated Mar 25, 2024

bulletphysics / bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

C++ 12,833 2,879 Updated Aug 8, 2024

ml-lab / home-platform

A Household multimodal environment (HoME) based on the SUNCG indoor scenes dataset

Python 4 42 Updated Nov 30, 2017

harpreetsahota204 / awesome-cvpr-2024

🤩 An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024

Python 139 9 Updated Jun 13, 2024

csuhan / OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 604 33 Updated Oct 22, 2024

open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks

Python 1,567 223 Updated Dec 28, 2024