Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 18,466 2,281 Updated Mar 7, 2025

plasma-umass / scalene

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 12,504 405 Updated Mar 2, 2025

llmware-ai / llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

Python 10,409 1,731 Updated Mar 4, 2025

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 9,857 1,159 Updated Mar 6, 2025

facebookresearch / nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,303 600 Updated Feb 21, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 8,801 969 Updated Mar 7, 2025

explodinggradients / ragas

Supercharge Your LLM Application Evaluations 🚀

Python 8,385 862 Updated Mar 4, 2025

marqo-ai / marqo

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Python 4,791 198 Updated Mar 7, 2025

guardrails-ai / guardrails

Adding guardrails to large language models.

Python 4,580 355 Updated Mar 3, 2025

NVIDIA / NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 4,486 446 Updated Mar 6, 2025

turboderp-org / exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 4,021 305 Updated Feb 28, 2025

Dataherald / dataherald

Interact with your SQL database, Natural Language to SQL using LLMs

Python 3,432 244 Updated Jul 24, 2024

deep-diver / LLM-As-Chatbot

LLM as a Chatbot Service

Python 3,305 378 Updated Nov 20, 2023

turboderp / exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Python 2,836 222 Updated Sep 30, 2023

hegelai / prompttools

Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).

Python 2,807 236 Updated Aug 15, 2024

X-PLUG / mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,420 178 Updated Jan 23, 2025

THUDM / P-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Python 2,012 202 Updated Nov 16, 2023

ajndkr / lanarky

The web framework for building LLM microservices

Python 987 76 Updated Jul 6, 2024

kennethleungty / Llama-2-Open-Source-LLM-CPU-Inference

Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A

Python 961 214 Updated Nov 6, 2023

hwchase17 / chat-your-data

Python 946 284 Updated Nov 5, 2023

jzbjyb / FLARE

Forward-Looking Active REtrieval-augmented generation (FLARE)

Python 612 56 Updated Nov 20, 2023

reasoning-machines / pal

PaL: Program-Aided Language Models (ICML 2023)

Python 482 61 Updated Jun 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jay-dox

Block or report jay-dox

Stars

OpenInterpreter / open-interpreter

meta-llama / llama

AntonOsika / gpt-engineer

run-llama / llama_index

lm-sys / FastChat

mem0ai / mem0

jina-ai / serve

PromtEngineer / localGPT

BerriAI / litellm