Lists (1)
Sort Name ascending (A-Z)
Stars
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
Effortless data labeling with AI support from Segment Anything and other awesome models.
An Open Source Machine Learning Framework for Everyone
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
A local chatbot fine-tuned by bilibili user comments.
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
a GUI application, which uses YOLOv8 for Object Detection/Tracking, Human Pose Estimation/Tracking from images, videos or camera
Reference implementation for DPO (Direct Preference Optimization)
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
CoreNet: A library for training deep neural networks
General technology for enabling AI capabilities w/ LLMs and MLLMs
OpenMMLab Model Compression Toolbox and Benchmark.
This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin…
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)
Official code of our CVPR paper "SASIC: Stereo Image Compression with Latent Shifts and Stereo Attention"