MinLSTM Implementation in "Were RNNs All We Needed?"

This repository contains an implementation of the MinLSTM, as described in the paper Were RNNs All We Needed?.

~~This project is currently under development. Features may change, and further optimizations are expected. Your contributions and feedback are welcome!~~ All bugs fixed.

Usage Example

from minLSTMNet import MinLSTM
import torch

# Define parameters
input_size = 3   
hidden_size = 6 
seq_len = 100    
batch_size = 64  

# Create random input tensor
x = torch.randn(batch_size, seq_len, input_size)

# Initialize the MinLSTM model
model = MinLSTM(input_size=input_size, hidden_size=hidden_size)

# Forward pass through the model
output = model(x)

# Print output shape
print("Output shape: ", output.shape)
[batch_size, seq_len, hidden_size]

Citation

@inproceedings{Feng2024WereRA,
    title   = {Were RNNs All We Needed?},
    author  = {Leo Feng and Frederick Tung and Mohamed Osama Ahmed and Yoshua Bengio and Hossein Hajimirsadegh},
    year    = {2024},
    url     = {https://arxiv.org/pdf/2410.01201}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
LICENSE		LICENSE
minLSTMNet.py		minLSTMNet.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MinLSTM Implementation in "Were RNNs All We Needed?"

Usage Example

Citation

About

Languages

License

axion66/minLSTM-implementation

Folders and files

Latest commit

History

Repository files navigation

MinLSTM Implementation in "Were RNNs All We Needed?"

Usage Example

Citation

About

Resources

License

Stars

Watchers

Forks

Languages