ALAMO: FPGA acceleration of deep learning algorithms with a modularized RTL compiler