Stars
Weakly Supervised Object Detection for Automatic Tooth-marked Tongue Recognition
Defect Spectrum: A Granular Look of Large-Scale Defect Datasets with Rich Semantics (ECCV2024)
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
HaMeR: Reconstructing Hands in 3D with Transformers
A natural language interface for computers
A course on aligning smol models.
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Empowering RAG with a memory-based data interface for all-purpose applications!
A simple, fast and user-friendly alternative to 'find'
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
A third-party component library based on Gradio.
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
Full-sized drag & drop event calendar in JavaScript
Fast and extensible multi-platform HTTP/1-2-3 web server with automatic HTTPS
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
使用Github Action将国外的Docker镜像转存到阿里云私有仓库,供国内服务器使用,免费易用
stackblitz-labs / bolt.diy
Forked from stackblitz/bolt.newPrompt, run, edit, and deploy full-stack web applications using any LLM you want!
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory