Skip to content
View bhwangfy's full-sized avatar
  • USTC
  • Beijing, China

Block or report bhwangfy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Formal to Formal Mathematics Benchmark

Objective-C++ 325 45 Updated Aug 16, 2023

veRL: Volcano Engine Reinforcement Learning for LLM

Python 573 42 Updated Jan 3, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,136 5,040 Updated Jan 4, 2025
Lean 37 15 Updated Jan 4, 2025

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,173 381 Updated Dec 23, 2024

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,215 129 Updated Jan 3, 2025

Making Google Deepmind's AlphaGeometry accessible to the Masses

Python 35 5 Updated Jan 4, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,520 331 Updated Jan 3, 2025

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Python 248 26 Updated May 26, 2024

AI for Mathematics (AI4Math) paper list

141 9 Updated Sep 29, 2024

The user home repository for the Mathematics in Lean tutorial.

HTML 294 196 Updated Dec 2, 2024

PyTorch implementation of AlphaZero Connect from scratch (with results)

Python 82 39 Updated Jan 9, 2020

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Python 2,038 179 Updated May 15, 2024
Python 16 Updated Apr 12, 2024

The math library of Lean 4

Lean 1,654 354 Updated Jan 5, 2025

Refine high-quality datasets and visual AI models

Python 9,030 584 Updated Jan 4, 2025

Official DeiT repository

Python 4,106 562 Updated Mar 15, 2024

Making your benchmark of optimization algorithms simple and open

Python 254 62 Updated Dec 31, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,763 6,443 Updated Oct 18, 2024

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Jupyter Notebook 1,970 377 Updated Jun 7, 2022

Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.

Jupyter Notebook 232 47 Updated Apr 29, 2024

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,964 1,990 Updated Apr 16, 2024

Proper implementation of ResNet-s for CIFAR10/100 in pytorch that matches description of the original paper.

Python 1,242 334 Updated Jun 18, 2024

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 22,579 9,563 Updated Nov 8, 2024

This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and …

257 24 Updated Apr 10, 2024

Python implementation of Tabu Search (TB), Genetic Algorithm (GA), and Simulated Annealing (SA) solving Travelling Salesman Problem (TSP). Term project of Intelligent Optimization Methods, UCAS cou…

Python 1 Updated May 9, 2022

TSP算法全复现:遗传(GA)、粒子群(PSO)、模拟退火(SA)、禁忌搜索(ST)、蚁群算法(ACO)、自自组织神经网络(SOM)

Python 772 188 Updated Jul 23, 2021

Model the sudoku puzzle as an Integer Program using google's ortools package in Python

1 Updated Aug 13, 2019
Next