Scalable and modularized RTL compilation of Convolutional Neural Networks onto FPGA