-
ofirpress.github.io Public
Forked from barryclark/jekyll-nowBuild a Jekyll blog in minutes, without touching the command line.
SCSS MIT License UpdatedFeb 21, 2025 -
SciCode Public
Forked from scicode-bench/SciCodeA benchmark that challenges language models to code solutions for scientific problems
Python Apache License 2.0 UpdatedSep 16, 2024 -
self-ask Public
Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"
-
attention_with_linear_biases Public
Code for the ALiBi method for transformer language models (ICLR 2022)
-
-
0plot Public
Use 0plot to automatically build matplotlib plots using ChatGPT.
-
BIG-bench Public
Forked from google/BIG-benchBeyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Python Apache License 2.0 UpdatedOct 24, 2022 -
composer Public
Forked from mosaicml/composerlibrary of algorithms to speed up neural network training
Python Other UpdatedApr 26, 2022 -
-
Megatron-DeepSpeed Public
Forked from bigscience-workshop/Megatron-DeepSpeedOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedSep 24, 2021 -
tstl_t5_bias Public
This is our implementation of the T5 bias for fairseq.
-
shortformer Public
Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.
-
sandwich_transformer Public
This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer Models by Reordering their Sublayers.
-
-
NLP-progress Public
Forked from sebastianruder/NLP-progressRepository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Python MIT License UpdatedApr 29, 2019 -
YouMayNotNeedAttention Public
Code for the Eager Translation Model from the paper You May Not Need Attention
-
awd-lstm-lm Public
Forked from salesforce/awd-lstm-lmPython BSD 3-Clause "New" or "Revised" License UpdatedDec 13, 2017 -
UsingTheOutputEmbedding Public
Code for the EACL paper "Using the Output Embedding to Improve Language Models" by Ofir Press and Lior Wolf
-
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedSep 3, 2017 -
sockeye Public
Forked from awslabs/sockeyeSequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet
Python Apache License 2.0 UpdatedAug 27, 2017 -
the-gan-zoo Public
Forked from hindupuravinash/the-gan-zooA list of all named GANs!
Python MIT License UpdatedJun 11, 2017 -
examples Public
Forked from pytorch/examplesA set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Python BSD 3-Clause "New" or "Revised" License UpdatedMar 13, 2017 -
RecurrentHighwayNetworks Public
Forked from jzilly/RecurrentHighwayNetworksRecurrent Highway Networks - Author implementation for Tensorflow and Torch
Python MIT License UpdatedOct 28, 2016 -
dl4mt-tutorial Public
Forked from nyu-dl/dl4mt-tutorialPython BSD 3-Clause "New" or "Revised" License UpdatedSep 25, 2016 -
tensorflow_with_latest_papers Public
Forked from KnHuq/tensorflow_with_latest_papersImplementation of Newest RNN and Seq2Seq Features
Python Apache License 2.0 UpdatedSep 20, 2016 -
tensorflow Public
Forked from tensorflow/tensorflowComputation using data flow graphs for scalable machine learning
C++ Apache License 2.0 UpdatedSep 5, 2016