Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.

Jupyter Notebook 232 47 Updated Apr 29, 2024

jadore801120 / attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,964 1,990 Updated Apr 16, 2024

akamaster / pytorch_resnet_cifar10

Proper implementation of ResNet-s for CIFAR10/100 in pytorch that matches description of the original paper.

Python 1,243 334 Updated Jun 18, 2024

pytorch / examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 22,585 9,564 Updated Nov 8, 2024

zeke-xie / deep-learning-dynamics-paper-list

This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and …

257 24 Updated Apr 10, 2024

kellenf / TSP_collection

TSP算法全复现：遗传(GA)、粒子群(PSO)、模拟退火(SA)、禁忌搜索(ST)、蚁群算法(ACO)、自自组织神经网络(SOM)

Python 773 188 Updated Jul 23, 2021

brianhuck / Integer-Programing-Sudoku-Solver

Model the sudoku puzzle as an Integer Program using google's ortools package in Python

1 Updated Aug 13, 2019

JieyuZ2 / wrench

[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark

Python 224 31 Updated Feb 13, 2024

61-- / weiyanmin

Automatically exported from code.google.com/p/weiyanmin

MATLAB 235 91 Updated Oct 6, 2015

leiwu0 / course.math_theory_nn

Summer course on mathematical theory of deep learning

TeX 52 5 Updated Jul 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bohan Wang bhwangfy

Block or report bhwangfy

Stars

openai / miniF2F

volcengine / verl

vllm-project / vllm

deepseek-ai / DeepSeek-Prover-V1.5

yangky11 / miniF2F-lean4

opendilab / DI-engine

opendilab / LightZero

OpenRLHF / OpenRLHF

waterhorse1 / LLM_Tree_Search

j991222 / ai4math-papers

leanprover-community / mathematics_in_lean

plkmo / AlphaZero_Connect4

opendilab / PPOxFamily

LR32768 / DL_theory_exp

leanprover-community / mathlib4

voxel51 / fiftyone

benchopt / benchopt

locuslab / edge-of-stability

facebookresearch / fairseq

jeonsworld / ViT-pytorch

brandokoch / attention-is-all-you-need-paper