Lists (6)
Sort Name ascending (A-Z)
Starred repositories
The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google
Implementation of Google's USM speech model in Pytorch
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Search google, bing, yahoo, and other search engines with python
A high-throughput and memory-efficient inference and serving engine for LLMs
Fine-tune mistral-7B on 3090s, a100s, h100s
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
The fastest JavaScript BPE Tokenizer Encoder Decoder for OpenAI's GPT-2 / GPT-3 / GPT-4 / GPT-4o / GPT-o1. Port of OpenAI's tiktoken with additional features.
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
AirLLM 70B inference with single 4GB GPU
Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.
An optical music recognition (OMR) system. Converts sheet music to a machine-readable version.
Weekly visualization report of Open LLM model performance based on 4 metrics.
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Come join the best place on the internet to learn AI skills. Use code "mckayprompts" for an extra 20% off.
Graph-oriented live coding language and music/audio DSP library written in Rust
Repository for Chat LLaMA - training a LoRA for the LLaMA (1 or 2) models on HuggingFace with 8-bit or 4-bit quantization. Research only.
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
Build Your own ChatGPT with OpenAI API and Streamlit
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW(🏄).
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
A timeline of the latest AI models for audio generation, starting in 2023!
Deep Performer: Score-to-audio music performance synthesis