- Berkeley, CA
- royh02.github.io
- https://orcid.org/0009-0009-3956-6603
Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
A script engine for "yu-gi-oh!" and sample gui
Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
Democratizing Reinforcement Learning for LLMs
Staging repo for development of native port of TypeScript
✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork
Open-source authentication protocol for agentic interactions. Let agents collaborate with Authed
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Sky-T1: Train your own O1 preview model within $450
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with…
[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.
An Open-Ended Embodied Agent with Large Language Models
LlamaIndex is the leading framework for building LLM-powered agents over your data.
DSPy: The framework for programming—not prompting—language models
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
A high-throughput and memory-efficient inference and serving engine for LLMs
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)