To ensure a common evaluation scheme and | |
promote models that generalize to different NLU tasks, the benchmark includes datasets from varying domains and | |
applications. |
To ensure a common evaluation scheme and | |
promote models that generalize to different NLU tasks, the benchmark includes datasets from varying domains and | |
applications. |