In this project we attempt to reproduce the results of the following paper:
@article{lei2017sru,
title={Training RNNs as Fast as CNNs},
author={Lei, Tao and Zhang, Yu},
journal={arXiv preprint arXiv:1709.02755},
year={2017}
}
The code base is a fork of the source code provided by the authors; https://github.com/taolei87/sru
- Setup GCE instance for training
- Obtain GCP approval for additional GPUs
- Reproduce author's SRU implementation
- Reproduce classification model
- Reproduce question answering model
- Reproduce langauge model
- Reproduce speech model (Attempted but failed to reproduce)
Install requirements via pip install -r requirements.txt
. CuPy and pynvrtc needed to compile the CUDA code into a callable function at runtime. Only single GPU training is supported.
Check individual tests for steps taken to reproduce the results