Skip to content
View dtunai's full-sized avatar

Block or report dtunai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Compatibility tool for Steam Play based on Wine and additional components

C++ 26,050 1,137 Updated Apr 23, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,477 827 Updated Apr 23, 2025

MLGym A New Framework and Benchmark for Advancing AI Research Agents

Python 483 45 Updated Apr 9, 2025
Python 425 43 Updated Jul 11, 2024

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Python 506 30 Updated Feb 21, 2025

Orbax provides common checkpointing and persistence utilities for JAX users

Python 369 47 Updated Apr 23, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen2.5, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3…

Python 7,117 605 Updated Apr 23, 2025

Curated list of open access books

43 4 Updated Mar 18, 2025

Optimizing inference proxy for LLMs

Python 2,172 170 Updated Apr 23, 2025

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 618 48 Updated Jan 20, 2025

Build resilient language agents as graphs.

Python 11,836 1,975 Updated Apr 23, 2025

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Python 265 27 Updated May 26, 2024

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,346 153 Updated Apr 17, 2025

This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.

TypeScript 5,444 630 Updated Mar 17, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6,386 628 Updated Apr 23, 2025

Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"

Python 2,348 166 Updated Dec 11, 2024

A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API

TypeScript 8,697 1,483 Updated Apr 22, 2025

PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simple tasks to complex challenges. It provides a low-code soluti…

Jupyter Notebook 4,109 589 Updated Apr 8, 2025

MarinaBox is a toolkit for creating and managing secure, isolated environments for AI agents

Python 119 12 Updated Feb 20, 2025

Task-Aware Agent-driven Prompt Optimization Framework

Python 3,182 266 Updated Mar 21, 2025

This repository contains the python package for Helical

Python 108 15 Updated Apr 17, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 51,008 1,437 Updated Apr 23, 2025

The official Python SDK for Model Context Protocol servers and clients

Python 10,370 1,094 Updated Apr 23, 2025

FoldFlow: SE(3)-Stochastic Flow Matching for Protein Backbone Generation

Jupyter Notebook 221 17 Updated Dec 10, 2024

Automating the Search for Artificial Life with Foundation Models!

Jupyter Notebook 408 45 Updated Jan 12, 2025

Repository for StripedHyena, a state-of-the-art beyond Transformer architecture

Python 363 26 Updated Mar 7, 2024

C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

C++ 677 234 Updated Apr 23, 2025
Next