Collection of machine learning research paper references
- Self-Rewarding Language Models
- Meta Large Language Model Compiler: Foundation Models of Compiler Optimization
- A Beginner's Guide to the Mathematics of Neural Networks
- Mathematics of Deep Learning
- The Matrix Calculus You Need For Deep Learning
- A guide to convolution arithmetic for deep learning
- Deep Learning: An Introduction for Applied Mathematicians - page 23
- Recent Advances in Deep Learning: An Overview
- Deep learning review
- Understanding deep learning requires rethinking generalization
- Learning the Number of Neurons in Deep Networks
- Lifelong Learning with Dynamically Expandable Networks
- Dropout: a simple way to prevent neural networks from overfitting
- StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks
- Self-Attention Generative Adversarial Networks
- Neural Architecture Search with Reinforcement Learning
- Large-Scale Evolution of Image Classifiers
- AutoAugment: Learning Augmentation Policies from Data
- Designing Neural Network Architectures using Reinforcement Learning
- Learning Transferable Architectures for Scalable Image Recognition
- Deep Neuroevolution: Genetic Algorithms are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
- MorphNet: Fast & Simple Resource-Constrained Structure Learning of Deep Networks
- Distributed Representations of Words and Phrases and their Compositionality Efficient Estimation of Word Representations in Vector Space
- Linguistic Regularities in Continuous Space Word Representations
- A Neural Probabilistic Language Model
- Glove
- Efficient Estimation of Word Representations in Vector Space
- Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings
- FastText.zip: Compressing text classification models
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
- Siamese Neural Networks for One-shot Image Recognition
- ImageNet Classification with Deep Convolutional Neural Networks
- Multi-column Deep Neural Networks for Image Classification
- Very Deep Convolutional Networks for Large-Scale Image Recognition
- Rethinking the Inception Architecture for Computer Vision
- Deep residual learning for image recognition
- Network In Network
- Going Deeper with Convolutions
- OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks
- You Only Look Once: Unified, Real-Time Object Detection
- FaceNet: A Unified Embedding for Face Recognition and Clustering
- Visualizing and Understanding Convolutional Networks
- A Neural Algorithm of Artistic Style
- Convolutional Sequence to Sequence Learning
- Deformable Convolutional Networks
- Deep Photo Style Transfer
- Wide Residual Networks
- WaveNet: A Generative Model for Raw Audio
- Densely Connected Convolutional Networks
- Resnet in Resnet: Generalizing Residual Architectures
- DRAW: A Recurrent Neural Network For Image Generation
- Playing Atari with Deep Reinforcement Learning
- Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
- Sequence to Sequence Learning with Neural Networks
- Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
- Neural Machine Translation by Jointly Learning to Align and Translate
- SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning
- Relational inductive biases, deep learning, and graph networks
- Interaction Networks for Learning about Objects,Relations and Physics
- Graph neural networks - Page 7
- Recurrent Relational Networks
- Graph Capsule Convolutional Neural Networks
- Graph Neural Networks for Ranking Web Pages
- Graph Convolutional Neural Networks for Web-Scale Recommender Systems
- Neural Module Networks
- End-To-End Memory Networks
- Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)
- Show and Tell: A Neural Image Caption Generator
- Neural Programmer-Interpreters
- Learning Simple Algorithms from Examples
- pix2code: Generating Code from a Graphical User Interface Screenshot
- DeepCoder: Learning to Write Programs
- A deep language model for software code
- Tree-to-tree Neural Networks for Program Translation
- Unsupervised Translation of Programming Languages
- TRANX: A Transition-based Neural Abstract Syntax Parser for Semantic Parsing and Code Generation
- TransCoder-IR: Code Translation with Compiler Representations