Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.

Jupyter Notebook 232 47 Updated Apr 29, 2024

jadore801120 / attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,964 1,990 Updated Apr 16, 2024

akamaster / pytorch_resnet_cifar10

Proper implementation of ResNet-s for CIFAR10/100 in pytorch that matches description of the original paper.

Python 1,242 334 Updated Jun 18, 2024

pytorch / examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 22,579 9,563 Updated Nov 8, 2024

zeke-xie / deep-learning-dynamics-paper-list

This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and …

257 24 Updated Apr 10, 2024

bhwangfy / tsp-meta-heuristic

Forked from CarlossShi/tsp-meta-heuristic

Python implementation of Tabu Search (TB), Genetic Algorithm (GA), and Simulated Annealing (SA) solving Travelling Salesman Problem (TSP). Term project of Intelligent Optimization Methods, UCAS cou…

Python 1 Updated May 9, 2022

kellenf / TSP_collection

TSP算法全复现：遗传(GA)、粒子群(PSO)、模拟退火(SA)、禁忌搜索(ST)、蚁群算法(ACO)、自自组织神经网络(SOM)

Python 772 188 Updated Jul 23, 2021

brianhuck / Integer-Programing-Sudoku-Solver

Model the sudoku puzzle as an Integer Program using google's ortools package in Python

1 Updated Aug 13, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bohan Wang bhwangfy

Block or report bhwangfy

Stars

openai / miniF2F

volcengine / verl

vllm-project / vllm

deepseek-ai / DeepSeek-Prover-V1.5

yangky11 / miniF2F-lean4

opendilab / DI-engine

opendilab / LightZero

tpgh24 / ag4masses

OpenRLHF / OpenRLHF

waterhorse1 / LLM_Tree_Search

j991222 / ai4math-papers

leanprover-community / mathematics_in_lean

plkmo / AlphaZero_Connect4

opendilab / PPOxFamily

LR32768 / DL_theory_exp

leanprover-community / mathlib4

voxel51 / fiftyone

facebookresearch / deit

benchopt / benchopt

locuslab / edge-of-stability

facebookresearch / fairseq

jeonsworld / ViT-pytorch

brandokoch / attention-is-all-you-need-paper