Genai
EZVitsDataset processes video datasets for VITS.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
SGLang is a fast serving framework for large language models and vision language models.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A high-throughput and memory-efficient inference and serving engine for LLMs
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
ETL, Analytics, Versioning for Unstructured Data
This is a repo with links to everything you'd ever want to learn about data engineering
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.
Efficient Triton Kernels for LLM Training
Real-time video and audio processing on Streamlit
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compa…
Flow is a custom node designed to provide a user-friendly interface for ComfyUI.
We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理
OpenAI's CLIP model ported to JavaScript using the ONNX web runtime
Sort a folder of images according to their similarity with provided text in your browser (uses a browser-ported version of OpenAI's CLIP model and the web's new File System Access API)
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
A generative world for general-purpose robotics & embodied AI learning.
🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Flexible and powerful framework for managing multiple AI agents and handling complex conversations