Skip to content
View bhwangfy's full-sized avatar
  • USTC
  • Beijing, China

Block or report bhwangfy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
34 results for source starred repositories
Clear filter

Formal to Formal Mathematics Benchmark

Objective-C++ 325 45 Updated Aug 16, 2023

veRL: Volcano Engine Reinforcement Learning for LLM

Python 595 45 Updated Jan 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,238 5,057 Updated Jan 6, 2025
Lean 38 15 Updated Jan 4, 2025

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,175 381 Updated Dec 23, 2024

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,220 129 Updated Jan 6, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,561 337 Updated Jan 6, 2025

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Python 248 26 Updated May 26, 2024

AI for Mathematics (AI4Math) paper list

143 9 Updated Sep 29, 2024

The user home repository for the Mathematics in Lean tutorial.

HTML 294 196 Updated Jan 6, 2025

PyTorch implementation of AlphaZero Connect from scratch (with results)

Python 82 39 Updated Jan 9, 2020

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Python 2,040 179 Updated May 15, 2024
Python 16 1 Updated Apr 12, 2024

The math library of Lean 4

Lean 1,660 355 Updated Jan 6, 2025

Refine high-quality datasets and visual AI models

Python 9,035 587 Updated Jan 6, 2025

Making your benchmark of optimization algorithms simple and open

Python 254 62 Updated Dec 31, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,770 6,442 Updated Oct 18, 2024

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Jupyter Notebook 1,974 377 Updated Jun 7, 2022

Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.

Jupyter Notebook 232 47 Updated Apr 29, 2024

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,964 1,990 Updated Apr 16, 2024

Proper implementation of ResNet-s for CIFAR10/100 in pytorch that matches description of the original paper.

Python 1,243 334 Updated Jun 18, 2024

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 22,585 9,564 Updated Nov 8, 2024

This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and …

257 24 Updated Apr 10, 2024

TSP算法全复现:遗传(GA)、粒子群(PSO)、模拟退火(SA)、禁忌搜索(ST)、蚁群算法(ACO)、自自组织神经网络(SOM)

Python 773 188 Updated Jul 23, 2021

Model the sudoku puzzle as an Integer Program using google's ortools package in Python

1 Updated Aug 13, 2019

[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark

Python 224 31 Updated Feb 13, 2024

Automatically exported from code.google.com/p/weiyanmin

MATLAB 235 91 Updated Oct 6, 2015

Summer course on mathematical theory of deep learning

TeX 52 5 Updated Jul 31, 2019
Next