Skip to content
View jay-dox's full-sized avatar

Block or report jay-dox

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
48 stars written in Python
Clear filter

A natural language interface for computers

Python 58,582 5,002 Updated Jan 24, 2025

Inference code for Llama models

Python 57,812 9,716 Updated Jan 26, 2025

Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app

Python 53,264 6,960 Updated Nov 17, 2024

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 39,662 5,649 Updated Mar 7, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,041 4,647 Updated Mar 1, 2025

The Memory layer for AI Agents

Python 25,207 2,360 Updated Mar 6, 2025

☁️ Build multimodal AI applications with cloud-native stack

Python 21,382 2,216 Updated Feb 27, 2025

Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.

Python 20,351 2,261 Updated Mar 2, 2025

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 18,466 2,281 Updated Mar 7, 2025

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 12,504 405 Updated Mar 2, 2025

Unified framework for building enterprise RAG pipelines with small, specialized models

Python 10,409 1,731 Updated Mar 4, 2025

Large Language Model Text Generation Inference

Python 9,857 1,159 Updated Mar 6, 2025

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,303 600 Updated Feb 21, 2025

Go ahead and axolotl questions

Python 8,801 969 Updated Mar 7, 2025

Supercharge Your LLM Application Evaluations 🚀

Python 8,385 862 Updated Mar 4, 2025

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Python 4,791 198 Updated Mar 7, 2025

Adding guardrails to large language models.

Python 4,580 355 Updated Mar 3, 2025

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 4,486 446 Updated Mar 6, 2025

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 4,021 305 Updated Feb 28, 2025

Interact with your SQL database, Natural Language to SQL using LLMs

Python 3,432 244 Updated Jul 24, 2024

LLM as a Chatbot Service

Python 3,305 378 Updated Nov 20, 2023

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Python 2,836 222 Updated Sep 30, 2023

Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).

Python 2,807 236 Updated Aug 15, 2024

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,420 178 Updated Jan 23, 2025

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Python 2,012 202 Updated Nov 16, 2023

The web framework for building LLM microservices

Python 987 76 Updated Jul 6, 2024

Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A

Python 961 214 Updated Nov 6, 2023

Forward-Looking Active REtrieval-augmented generation (FLARE)

Python 612 56 Updated Nov 20, 2023

PaL: Program-Aided Language Models (ICML 2023)

Python 482 61 Updated Jun 30, 2023
Next