Skip to content
View hexiay's full-sized avatar

Block or report hexiay

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Textbook on reinforcement learning from human feedback

TeX 795 68 Updated Apr 22, 2025

SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web

Python 1,657 78 Updated Apr 23, 2025

Client and server SDK for Golang

Go 244 113 Updated Apr 22, 2025
Python 672 30 Updated Apr 18, 2025

[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)

Jupyter Notebook 3,918 470 Updated Feb 12, 2025

Play with OpenAI's new Realtime API in your browser

TypeScript 322 128 Updated Dec 13, 2024

The Campsite monorepo

TypeScript 4,539 682 Updated Apr 21, 2025

优质AI开源项目周刊, 每周一更新

658 19 Updated Feb 23, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 45,619 6,407 Updated Apr 20, 2025

Make websites accessible for AI agents

Python 57,636 6,185 Updated Apr 23, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,116 671 Updated Apr 22, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 5,640 545 Updated Mar 24, 2025

BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.

C 16,250 629 Updated Mar 5, 2025

Python tool for converting files and office documents to Markdown.

Python 53,179 2,635 Updated Apr 13, 2025

Examples and guides for using the Gemini API

Jupyter Notebook 12,497 1,634 Updated Apr 23, 2025

Build your own AI friend

C++ 11,926 2,324 Updated Apr 23, 2025

Go bindings for the PortAudio audio I/O library

Go 752 99 Updated Feb 6, 2025

A microphone input stream for the gopxl/beep library

Go 43 7 Updated Nov 3, 2024

A little package that brings sound to any Go application. Suitable for playback and audio-processing.

Go 368 17 Updated Mar 31, 2025

公众号Write Prompt 发布的Prompt,同步记录于此

503 39 Updated Sep 28, 2024

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,646 208 Updated Apr 15, 2025

Run native macOS workloads on Kubernetes

Go 261 16 Updated Mar 25, 2025

♪ A low-level library to play sound on multiple platforms ♪

Go 1,701 141 Updated Apr 7, 2025

A little package that brings sound to any Go application. Suitable for playback and audio-processing.

Go 2,126 154 Updated Mar 19, 2024

2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,思想类,数学类,人物传记书籍

10,085 3,062 Updated Jun 11, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 80,557 9,668 Updated Jan 4, 2025

The world’s first real-time, distributed, cloud-edge collaborative multimodal AI Agent Framework that simultaneously supports C/C++/Go/Python/JS/TS

C 623 60 Updated Apr 24, 2025

📚 Freely available programming books

HTML 355,501 63,218 Updated Apr 23, 2025

AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording

TypeScript 13,614 977 Updated Apr 16, 2025
Next