Stars
Lightweight coding agent that runs in your terminal
ProBench: Automatic Evaluation on Open-ended Multi-domain Expert Tasks
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
A minimal and universal controller for FLUX.1.
[CVPR 2025] Official Dataloader and Evaluation Scripts for VideoAutoBench.
Minimalistic large language model 3D-parallelism training
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
Codebase for Aria - an Open Multimodal Native MoE
Convert PDF to markdown + JSON quickly with high accuracy
The Library for LLM-based multi-agent applications
[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.
③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…
A collection of AWESOME things about mixture-of-experts
A Python package to estimate class prevalence in unlabeled datasets by specifying stability assumptions
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.
PyRCA: A Python Machine Learning Library for Root Cause Analysis
[TPAMI 2023] This is an official implementation for "Vicinity Vision Transformer".
EVA Series: Visual Representation Fantasies from BAAI
[CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning
🌇 A collection of links for free stock photography, video and Illustration websites