XuYunqiu

YunqiuXu XuYunqiu

9 followers · 2 following

Chongqing, China

Achievements

Stars

NVlabs / dream-in-4d

Official PyTorch implementation of "A Unified Approach for Text- and Image-guided 4D Scene Generation", [CVPR 2024]

Python 72 4 Updated Apr 23, 2024

Lakonik / MVEdit

3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation

JavaScript 319 15 Updated Dec 25, 2024

ALEEEHU / Awesome-Text2X-Resources

This is an open collection of state-of-the-art (SOTA), novel Text to X (X can be everything) methods (papers, codes and datasets).

168 10 Updated Feb 4, 2025

datamllab / LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 637 61 Updated Jun 1, 2024

zhangfaen / finetune-Qwen2-VL

Python 301 36 Updated Feb 5, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 39,638 4,862 Updated Feb 6, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 6,771 488 Updated Feb 7, 2025

UX-Decoder / LLaVA-Grounding

Python 380 15 Updated Jul 29, 2024

FoundationVision / Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 534 61 Updated Jun 7, 2024

shikras / d-cube

A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).

Python 112 7 Updated Mar 20, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,392 417 Updated Aug 7, 2024

shenyunhang / APE

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 502 31 Updated May 8, 2024

DirtyHarryLYL / LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

849 36 Updated Jun 5, 2024

HumanSignal / label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 20,618 2,527 Updated Feb 7, 2025

voxel51 / fiftyone

Refine high-quality datasets and visual AI models

Python 9,145 594 Updated Feb 7, 2025

cvat-ai / cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 13,128 3,115 Updated Feb 7, 2025

OpenGVLab / Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, B…

Python 487 36 Updated Apr 21, 2024