Starred repositories
Open-source and strong foundation image recognition models.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
2025年1月更新,目前国内可用Docker镜像源汇总,DockerHub国内镜像加速列表,🚀DockerHub镜像加速器
这是一款提高ChatGPT的数据安全能力和效率的插件。并且免费共享大量创新功能,如:自动刷新、保持活跃、数据安全、取消审计、克隆对话、言无不尽、净化页面、展示大屏、拦截跟踪、日新月异、明察秋毫等。让我们的AI体验无比安全、顺畅、丝滑、高效、简洁。
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
🚀 A very efficient Texas Holdem GTO solver
💬 Ready-to-use, flexible RAG Chatbot. 基于大模型和 RAG 的知识库问答系统。
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Dictionary of attack patterns and primitives for black-box application fault injection and resource discovery.
Real-time and accurate open-vocabulary end-to-end object detection
Deep learning based content moderation from text, audio, video & image input modalities.
Refine high-quality datasets and visual AI models
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
The dataset for drone based detection and tracking is released, including both image/video, and annotations.
Implementation of popular deep learning networks with TensorRT network definition API
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
Official Code for DragGAN (SIGGRAPH 2023)
Generative Models by Stability AI
SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.