Citation:
Tambe T, Hooper C, Pentecost L, Jia T, Yang E-Y, Donato M, Sanh V, Whatmough P, Rush A, Brooks D, et al. EdgeBERT: sentence-level energy optimizations for latency-aware multi-task NLP inference, in International Symposium on Microarchitecture (MICRO).; 2021.