Skip to content
View H121951153's full-sized avatar
😆
我太聰明了
😆
我太聰明了

Block or report H121951153

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Collection of LLM completions for reasoning-gym task datasets

7 1 Updated Feb 28, 2025

maze datasets for investigating OOD behavior of ML systems

Jupyter Notebook 30 5 Updated Feb 25, 2025

The original sources of MS-DOS 1.25, 2.0, and 4.0 for reference purposes

Assembly 31,012 4,436 Updated Apr 25, 2024

Assistive Gym, a physics-based simulation framework for physical human-robot interaction and robotic assistance.

Python 335 79 Updated Jan 26, 2024

Action for checking out a repo

TypeScript 6,304 1,887 Updated Jan 16, 2025

The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.

TypeScript 9,770 430 Updated Mar 10, 2025

Genome modeling and design across all domains of life

Jupyter Notebook 2,450 239 Updated Mar 6, 2025

A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.

Python 28 Updated Feb 3, 2025

An Extensible Deep Learning Library

Python 1,979 300 Updated Mar 8, 2025

POSIX compatibility layer for WASI builds

C 1 Updated Mar 12, 2024

Node Version Manager - POSIX-compliant bash script to manage multiple active node.js versions

Shell 82,691 8,271 Updated Feb 6, 2025

Create workflows that enable you to use Continuous Integration (CI) for your projects.

253 140 Updated Sep 5, 2024

Xcode extension for GitHub Copilot

Swift 3,344 508 Updated Mar 10, 2025

Flutter Sticky Headers - Lets you place "sticky headers" into any scrollable content in your Flutter app. No special wrappers or magic required. Maintainer: @slightfoot

Dart 1,115 129 Updated Mar 15, 2024

Minimal hackable GRPO implementation

Python 167 22 Updated Jan 31, 2025

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 84 20 Updated Mar 10, 2025
Python 10 Updated Feb 2, 2025

Scratch card widget which temporarily hides content from user.

Dart 610 70 Updated Nov 24, 2023

RAGEN is the first open-source reproduction of DeepSeek-R1 for training agentic models via reinforcement learning.

Python 1 Updated Feb 20, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 1 Updated Jan 28, 2025

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 314 21 Updated Dec 15, 2024

🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite

Python 861 71 Updated Mar 5, 2025

JSON library for nelua

1 Updated Feb 1, 2025
Python 3 1 Updated Feb 4, 2025

🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

Python 213 13 Updated Mar 10, 2025

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Python 10,520 669 Updated Mar 5, 2025
Python 13 8 Updated Nov 26, 2024

Perforator is a cluster-wide continuous profiling tool designed for large data centers

C++ 3,035 134 Updated Mar 10, 2025

Everything you need to build state-of-the-art foundation models, end-to-end.

Python 7,683 546 Updated Mar 10, 2025

Apptainer: Application containers for Linux

Go 1,235 144 Updated Mar 7, 2025
Next