
Starred repositories
Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"
SpaceSniffer is a freeware disk space analyzer for Windows that make use of the Treemap concept to view the current disk usage.
Official PyTorch implementation of One-Minute Video Generation with Test-Time Training
Generate diagrams from textual description
The official implementation of "MagicColor: Multi-Instance Sketch Colorization"
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
哔哩哔哩-API收集整理【不断更新中....】
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
Python tool for converting files and office documents to Markdown.
[CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation" . Project page: https://bizgen-msra.github.io/
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …
DuckDB is an analytical in-process SQL database management system
Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.
A WebUI app for Music-Source-Separation-Training and we packed UVR together!
Repository for training models for music source separation.
Integrate the DeepSeek API into popular softwares
[CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown