nyu-mll/glue
Viewer • Updated • 1.49M • 437k • 507
The super-networks fine-tuned on BERT-base with GLUE benchmark using EFTNAS.
Results of the optimal sub-network discoverd from the super-network:
| Model | GFLOPs | GLUE Avg. | MNLI-m | QNLI | QQP | SST-2 | CoLA | MRPC | RTE |
|---|---|---|---|---|---|---|---|---|---|
| Development Set: | |||||||||
| EFTNAS-S1 | 5.7 | 82.9 | 84.6 | 90.8 | 91.2 | 93.5 | 60.6 | 90.8 | 69.0 |
| Test Set: | |||||||||
| EFTNAS-S1 | 5.7 | 77.7 | 83.7 | 89.9 | 71.8 | 93.4 | 52.6 | 87.6 | 65.0 |
@inproceedings{
eftnas2024,
title={Searching for Efficient Language Models in First-Order Weight-Reordered Super-Networks},
author={J. Pablo Munoz and Yi Zheng and Nilesh Jain},
booktitle={The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation},
year={2024},
url={}
}
Apache-2.0