This project is a tensorflow implementation of the DenseNet model provided by jianshu's Github.
Mainly based on his two papers:
Multi-Scale Attention with Dense Encoder for Handwritten Mathematical Expression Recognition.
In jianshu's model, the codes are in Theano, then I convert the codes into tensorflow version and the performance is comparative with jianshu's best model provided in his github. The performance of the codes can achieve 44%-45% ExpRate on single Tesla K80 GPU. The final results reported in jianshu's papers are obtained after five models ensembling. And the performance reported on single model is almost the same as ours as jianshu stated in his github issues
- Tensorflow v1.6
- Python 3
- cuda 9.0 [optional]
- cudnn 7 [optional]
python model-single-GPU.py --batch_size=4
If OOM Error occurs, please set batch_size to 4 or 2