Skip to content
View shiftybit's full-sized avatar

Block or report shiftybit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI

25 repositories

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 41,707 5,428 Updated Jan 21, 2025

The Open Source Feature Store for Machine Learning

Python 5,733 1,018 Updated Jan 21, 2025

A guidance language for controlling large language models.

Jupyter Notebook 19,492 1,063 Updated Jan 16, 2025

πŸ€— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,847 27,634 Updated Jan 21, 2025

πŸ¦œπŸ”— Build context-aware reasoning applications

Jupyter Notebook 98,710 16,054 Updated Jan 22, 2025

Tool for chatting with your codebase and docs using OpenAI, LlamaCpp, and GPT-4-All

Python 508 42 Updated Nov 18, 2024

A RTSP audio server intended for the ESP32

C++ 43 5 Updated Mar 3, 2024

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,340 294 Updated Jan 7, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,825 467 Updated Dec 26, 2024

πŸ”Š Text-Prompted Generative Audio Model

Jupyter Notebook 36,719 4,322 Updated Aug 19, 2024

Bringing Characters to Life with Computer Brains in Unity

C++ 7,974 1,066 Updated Jul 23, 2024

Inference code for Llama models

Python 57,278 9,663 Updated Aug 18, 2024

Large Language Model Text Generation Inference

Python 9,615 1,123 Updated Jan 21, 2025

Speech recognition

C 665 109 Updated Jan 20, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,454 1,459 Updated Jan 20, 2025

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 5,592 444 Updated Nov 24, 2024

Public content repository for Windows Server content.

1,416 1,836 Updated Jan 21, 2025

Interactively explore unstructured datasets from your dataframe.

TypeScript 1,142 83 Updated Jan 10, 2025

LLM inference in C/C++

C++ 71,070 10,287 Updated Jan 21, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,895 234 Updated Jan 20, 2025

Example of local RAG QA with Langchain Agents

Python 27 1 Updated Sep 24, 2023

LLM Frontend for Power Users.

JavaScript 9,510 2,620 Updated Jan 20, 2025

Faster Whisper transcription with CTranslate2

Python 13,602 1,146 Updated Jan 1, 2025

Library to interface with an instance of ChromaDB

C# 7 1 Updated Dec 27, 2023

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 22,739 3,425 Updated Jan 22, 2025