Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

ofirpress Follow

Overview Repositories 26 Projects 0 Packages 0 Stars 18

More

Overview
Repositories
Projects
Packages
Stars

ofirpress

Follow

Ofir Press ofirpress

Follow

Modeling language

185 followers · 8 following

http://ofir.io/about
@ofirpress

Achievements

Achievements

Organizations

Block or report ofirpress

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Overview Repositories 26 Projects 0 Packages 0 Stars 18

More

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All SCSS Python Jupyter Notebook JavaScript Lua C++

Sort Last updated

Select order

Last updated Name Stars

ofirpress.github.io Public
Forked from barryclark/jekyll-now

Build a Jekyll blog in minutes, without touching the command line.

SCSS MIT License Updated Feb 21, 2025
SciCode Public
Forked from scicode-bench/SciCode

A benchmark that challenges language models to code solutions for scientific problems

Python Apache License 2.0 Updated Sep 16, 2024
self-ask Public

Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"

Jupyter Notebook 309 33 MIT License Updated Dec 28, 2023
attention_with_linear_biases Public

Code for the ALiBi method for transformer language models (ICLR 2022)

Python 516 38 MIT License Updated Oct 30, 2023
Snowballed_Hallucination Public
Forked from Nanami18/Snowballed_Hallucination

Updated May 24, 2023
0plot Public

Use 0plot to automatically build matplotlib plots using ChatGPT.

JavaScript 18 Apache License 2.0 Updated Apr 8, 2023
BIG-bench Public
Forked from google/BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python Apache License 2.0 Updated Oct 24, 2022
composer Public
Forked from mosaicml/composer

library of algorithms to speed up neural network training

Python Other Updated Apr 26, 2022
LeViT_ALiBi Public

LeViT + ALiBi

Python Apache License 2.0 Updated Mar 5, 2022
Megatron-DeepSpeed Public
Forked from bigscience-workshop/Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python Other Updated Sep 24, 2021
tstl_t5_bias Public

This is our implementation of the T5 bias for fairseq.

Python 2 MIT License Updated Aug 26, 2021
shortformer Public

Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.

Python 145 8 MIT License Updated Jul 26, 2021
sandwich_transformer Public

This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer Models by Reordering their Sublayers.

Python 55 2 Other Updated Jan 1, 2021
PartialShuffle Public

Python 14 2 Updated Jun 9, 2019
NLP-progress Public
Forked from sebastianruder/NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python MIT License Updated Apr 29, 2019
YouMayNotNeedAttention Public

Code for the Eager Translation Model from the paper You May Not Need Attention

Python 295 27 Updated Dec 17, 2018
awd-lstm-lm Public
Forked from salesforce/awd-lstm-lm

Python BSD 3-Clause "New" or "Revised" License Updated Dec 13, 2017
UsingTheOutputEmbedding Public

Code for the EACL paper "Using the Output Embedding to Improve Language Models" by Ofir Press and Lior Wolf

Lua 45 7 Updated Nov 28, 2017
pytorch Public
Forked from pytorch/pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python Other Updated Sep 3, 2017
sockeye Public
Forked from awslabs/sockeye

Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet

Python Apache License 2.0 Updated Aug 27, 2017
the-gan-zoo Public
Forked from hindupuravinash/the-gan-zoo

A list of all named GANs!

Python MIT License Updated Jun 11, 2017
examples Public
Forked from pytorch/examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python BSD 3-Clause "New" or "Revised" License Updated Mar 13, 2017
RecurrentHighwayNetworks Public
Forked from jzilly/RecurrentHighwayNetworks

Recurrent Highway Networks - Author implementation for Tensorflow and Torch

Python MIT License Updated Oct 28, 2016
dl4mt-tutorial Public
Forked from nyu-dl/dl4mt-tutorial

Python BSD 3-Clause "New" or "Revised" License Updated Sep 25, 2016
tensorflow_with_latest_papers Public
Forked from KnHuq/tensorflow_with_latest_papers

Implementation of Newest RNN and Seq2Seq Features

Python Apache License 2.0 Updated Sep 20, 2016
tensorflow Public
Forked from tensorflow/tensorflow

Computation using data flow graphs for scalable machine learning

C++ Apache License 2.0 Updated Sep 5, 2016

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.