encounter1997

Follow

🎯

Focusing

Wen Wang encounter1997

🎯

Focusing

Follow

162 followers · 545 following

China
@encounter19972

Achievements

Achievements

Highlights

Pro

Organizations

Lists (5)

Sort

DE-DETRs

ECCV22 submission

Detection

DETR-baselines

DETR baselines for comparison

FP-DETR

SFA

Starred repositories

stanford-oval / storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 13,879 1,286 Updated Dec 11, 2024

berkeley-hipie / segllm

Code release for "SegLLM: Multi-round Reasoning Segmentation"

Python 50 3 Updated Dec 30, 2024

EzioBy / edicho

[Arxiv 2024] Edicho: Consistent Image Editing in the Wild

24 Updated Dec 31, 2024

Yuanshi9815 / OminiControl

A minimal and universal controller for FLUX.1.

Python 1,016 62 Updated Dec 30, 2024

deepseek-ai / DeepSeek-V3

Python 11,631 793 Updated Dec 31, 2024

a-r-r-o-w / finetrainers

Memory-optimized training scripts for video models based on Diffusers

Python 625 63 Updated Dec 31, 2024

yzhang2016 / video-generation-survey

A reading list of video generation

459 29 Updated Dec 26, 2024

Johanan528 / DepthLab

Official implementation of "DepthLab: From Partial to Complete"

Python 312 13 Updated Dec 31, 2024

LMM101 / Awesome-Multimodal-Next-Token-Prediction

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

145 3 Updated Dec 31, 2024

kijai / ComfyUI-FramerWrapper

Python 66 Updated Dec 20, 2024

qiuyu96 / LeviTor

Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

Python 96 2 Updated Dec 20, 2024

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 20,855 1,605 Updated Dec 31, 2024

sihyun-yu / REPA

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 761 38 Updated Dec 17, 2024

yihao-meng / AniDoc

Official Implementations for Paper - AniDoc: Animation Creation Made Easier

Python 376 21 Updated Dec 31, 2024

baaivision / NOVA

NOVA: Autoregressive Video Generation without Vector Quantization

Python 282 8 Updated Dec 31, 2024

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 8,049 810 Updated Dec 31, 2024

xiaomabufei / lumos

Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text

Python 26 Updated Dec 11, 2024

baaivision / See3D

You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale

Python 580 14 Updated Dec 21, 2024

showlab / MovieBench

Python 34 1 Updated Dec 24, 2024

Lightricks / LTX-Video

Official repository for LTX-Video

Python 2,270 169 Updated Dec 20, 2024

vietnh1009 / ASCII-generator

ASCII generator (image to text, image to image, video to video)

Python 7,545 572 Updated Nov 22, 2024

magic-quill / MagicQuill

Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 2,480 228 Updated Dec 16, 2024

OpenBMB / MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,974 908 Updated Oct 22, 2024

LC044 / WeChatMsg

提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手

Python 35,803 3,726 Updated Nov 26, 2024

leofan90 / Awesome-World-Models

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

47 2 Updated Dec 26, 2024

google-deepmind / alphafold3

AlphaFold 3 inference pipeline.

Python 5,716 679 Updated Dec 20, 2024

foundation-model-stack / fms-fsdp

🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.

Python 207 32 Updated Dec 19, 2024

simular-ai / Agent-S

Agent S: an open agentic framework that uses computers like a human

Python 722 97 Updated Dec 25, 2024

Tencent / Hunyuan3D-1

Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

Python 2,495 190 Updated Dec 26, 2024

NVIDIA / Cosmos-Tokenizer

A suite of image and video neural tokenizers

Python 1,038 27 Updated Dec 23, 2024

Starred topics

low-light-image

deep-image-prior

gcn

cvpr2019