Stars
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
๐ฆ ๐๐ฒ๐ฎ๐ฟ๐ป about ๐๐๐ ๐, ๐๐๐ ๐ข๐ฝ๐, and ๐๐ฒ๐ฐ๐๐ผ๐ฟ ๐๐๐ for free by designing, training, and deploying a real-time financial advisor LLM system ~ ๐ด๐ฐ๐ถ๐ณ๐ค๐ฆ ๐ค๐ฐ๐ฅ๐ฆ + ๐ท๐ช๐ฅ๐ฆ๐ฐ & ๐ณ๐ฆ๐ข๐ฅ๐ช๐ฏ๐จ ๐ฎ๐ข๐ต๐ฆ๐ณ๐ช๐ข๐ญ๐ด
LLM powered development for VSCode
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
Machine Learning Toolkit for Kubernetes