The abstract from the paper is the following: | |
As Transfer Learning from large-scale pre-trained models becomes more prevalent in Natural Language Processing (NLP), | |
operating these large models in on-the-edge and/or under constrained computational training or inference budgets | |
remains challenging. |