EdgeBERT: sentence-level energy optimizations for latency-aware multi-task NLP inference