Stars
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Cross-platform, customizable ML solutions for live and streaming media.
Robust Speech Recognition via Large-Scale Weak Supervision
OCR, layout analysis, reading order, table recognition in 90+ languages
Low-code platform for building business applications. Connect to databases, cloud storages, GraphQL, API endpoints, Airtable, Google sheets, OpenAI, etc and build apps using drag and drop applicatiβ¦
21 Lessons, Get Started Building with Generative AI π https://microsoft.github.io/generative-ai-for-beginners/
β‘ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
Free and Open Source Enterprise Resource Planning (ERP)
Hunt down social media accounts by username across social networks
Bring projects, wikis, and teams together with AI. AppFlowy is the AI collaborative workspace where you achieve more without losing control of your data. The leading open source Notion alternative.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yoβ¦
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
Langflow is a low-code app builder for RAG and multi-agent AI applications. Itβs Python-based and agnostic to any model, API, or database.
π 10x easier, π 140x lower storage cost, π high performance, π petabyte scale - Elasticsearch/Splunk/Datadog alternative for π (logs, metrics, traces, RUM, Error tracking, Session replay).
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
π₯ Blazing fast terminal file manager written in Rust, based on async I/O.
π A Hex Editor for Reverse Engineers, Programmers and people who value their retinas when working at 3 AM.
Build interactive dashboards in minutes.
State-of-the-art 2D and 3D Face Analysis Project
Instant voice cloning by MIT and MyShell. Audio foundation model.
The official gpt4free repository | various collection of powerful language models | gpt-4o and deepseek v3 & r1
Interact with your documents using the power of GPT, 100% privately, no data leaks
A feature-rich command-line audio/video downloader
An innovative superfamily of fonts for code