Stars
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Whisper realtime streaming for long speech-to-text transcription and translation
Virtual whiteboard for sketching hand-drawn like diagrams
Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
A browser automation framework and ecosystem.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
The kitchen sink of Python utility libraries for doing "stuff" in a functional way. Based on the Lo-Dash Javascript library.
☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️
Desktop app for prototyping and debugging LangGraph applications locally.
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Educational blog posts for Rust beginners
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow
OCR, layout analysis, reading order, table recognition in 90+ languages
Pattern-based table discovery in Open Data CSV files
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.