Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
A collection of awesome video generation studies.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Official inference repo for FLUX.1 models
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Diffusion model papers, survey, and taxonomy
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
2024 up-to-date list of DATASETS, CODEBASES and PAPERS on Multi-Task Learning (MTL), from Machine Learning perspective.
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
VMamba: Visual State Space Models,code is based on mamba
Official PyTorch Implementation of the Longhorn Deep State Space Model
[ICLR 2024] MogaNet: Efficient Multi-order Gated Aggregation Network
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Vision Mamba 2: More Efficient Visual Representation Learning with State Space Duality
✨✨Latest Advances on Multimodal Large Language Models