Skip to content
View peakji's full-sized avatar
🔜
Making progress
🔜
Making progress

Highlights

  • Pro

Organizations

@Level @hyperonym

Block or report peakji

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 15,527 1,831 Updated Apr 14, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 43,192 7,398 Updated Apr 14, 2025

how to optimize some algorithm in cuda.

Cuda 2,095 186 Updated Apr 14, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,219 114 Updated Apr 14, 2025

Header-only C++/python library for fast approximate nearest neighbors

C++ 4,635 698 Updated Aug 11, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,507 245 Updated Apr 7, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 710 63 Updated Mar 14, 2025

Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.

Rust 6,155 448 Updated Apr 14, 2025

Everything we actually know about the Apple Neural Engine (ANE)

2,187 79 Updated Mar 7, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,087 264 Updated Apr 9, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 46,784 5,714 Updated Apr 14, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,068 2,440 Updated Apr 11, 2025

Minimalist ML framework for Rust

Rust 17,013 1,079 Updated Apr 14, 2025

Blazingly fast LLM inference.

Rust 5,427 390 Updated Apr 14, 2025

A natural language interface for computers

Python 59,094 5,035 Updated Mar 30, 2025

A powerful framework for building realtime voice AI agents 🤖🎙️📹

Python 5,578 763 Updated Apr 14, 2025

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 3,808 294 Updated Aug 10, 2024

A generative speech model for daily dialogue.

Python 35,750 3,879 Updated Mar 14, 2025

Minimal container for Chrome's headless shell, useful for automating / driving the web

Shell 536 66 Updated Mar 10, 2025

A collective list of free APIs

Python 335,604 35,508 Updated Oct 31, 2024

OpenGFW is a flexible, easy-to-use, open source implementation of GFW (Great Firewall of China) on Linux

Go 10,209 756 Updated Oct 28, 2024

📖 100 Go Mistakes and How to Avoid Them

Go 7,343 458 Updated Apr 12, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,576 912 Updated Jul 1, 2024

Detect file content types with deep learning

Python 8,534 443 Updated Apr 14, 2025

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 16,640 1,173 Updated Mar 14, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 44,204 6,140 Updated Apr 13, 2025

Make images smaller using best-in-class codecs, right in the browser.

TypeScript 22,796 1,637 Updated Nov 26, 2024

leaked prompts of GPTs

29,639 4,013 Updated Sep 27, 2024

A blazing fast inference solution for text embeddings models

Rust 3,422 241 Updated Apr 14, 2025
Next