Instructions to use abirmondalind/lc-to-event-BART with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use abirmondalind/lc-to-event-BART with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("abirmondalind/lc-to-event-BART") model = AutoModelForSeq2SeqLM.from_pretrained("abirmondalind/lc-to-event-BART") - Notebooks
- Google Colab
- Kaggle
End of training
Browse files- README.md +23 -13
- model.safetensors +1 -1
- tokenizer.json +2 -2
README.md
CHANGED
|
@@ -19,12 +19,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 19 |
|
| 20 |
This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
|
| 21 |
It achieves the following results on the evaluation set:
|
| 22 |
-
- Loss:
|
| 23 |
-
- Rouge1: 0.
|
| 24 |
-
- Rouge2: 0.
|
| 25 |
-
- Rougel: 0.
|
| 26 |
-
- Bleu: 0.
|
| 27 |
-
- Gen Len:
|
| 28 |
|
| 29 |
## Model description
|
| 30 |
|
|
@@ -49,17 +49,27 @@ The following hyperparameters were used during training:
|
|
| 49 |
- seed: 42
|
| 50 |
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 51 |
- lr_scheduler_type: linear
|
| 52 |
-
- num_epochs:
|
| 53 |
- mixed_precision_training: Native AMP
|
|
|
|
| 54 |
|
| 55 |
### Training results
|
| 56 |
|
| 57 |
-
| Training Loss | Epoch | Step
|
| 58 |
-
|:-------------:|:------:|:----:|:---------------:|:------:|:------:|:------:|:------:|:-------:|
|
| 59 |
-
|
|
| 60 |
-
|
|
| 61 |
-
|
|
| 62 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 63 |
|
| 64 |
|
| 65 |
### Framework versions
|
|
|
|
| 19 |
|
| 20 |
This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
|
| 21 |
It achieves the following results on the evaluation set:
|
| 22 |
+
- Loss: 1.7576
|
| 23 |
+
- Rouge1: 0.2004
|
| 24 |
+
- Rouge2: 0.0429
|
| 25 |
+
- Rougel: 0.1999
|
| 26 |
+
- Bleu: 0.0340
|
| 27 |
+
- Gen Len: 8.0173
|
| 28 |
|
| 29 |
## Model description
|
| 30 |
|
|
|
|
| 49 |
- seed: 42
|
| 50 |
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 51 |
- lr_scheduler_type: linear
|
| 52 |
+
- num_epochs: 5
|
| 53 |
- mixed_precision_training: Native AMP
|
| 54 |
+
- label_smoothing_factor: 0.1
|
| 55 |
|
| 56 |
### Training results
|
| 57 |
|
| 58 |
+
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Bleu | Gen Len |
|
| 59 |
+
|:-------------:|:------:|:-----:|:---------------:|:------:|:------:|:------:|:------:|:-------:|
|
| 60 |
+
| 1.8178 | 0.3631 | 1000 | 1.7899 | 0.1979 | 0.0316 | 0.1975 | 0.0200 | 7.7348 |
|
| 61 |
+
| 1.8001 | 0.7262 | 2000 | 1.7773 | 0.1931 | 0.0360 | 0.1926 | 0.0274 | 7.9388 |
|
| 62 |
+
| 1.777 | 1.0893 | 3000 | 1.7707 | 0.1995 | 0.0384 | 0.1991 | 0.0306 | 7.8880 |
|
| 63 |
+
| 1.7732 | 1.4524 | 4000 | 1.7679 | 0.1978 | 0.0379 | 0.1973 | 0.0308 | 7.9067 |
|
| 64 |
+
| 1.7711 | 1.8155 | 5000 | 1.7643 | 0.1991 | 0.0405 | 0.1985 | 0.0308 | 7.9443 |
|
| 65 |
+
| 1.757 | 2.1786 | 6000 | 1.7632 | 0.2030 | 0.0409 | 0.2024 | 0.0317 | 7.9549 |
|
| 66 |
+
| 1.7564 | 2.5418 | 7000 | 1.7620 | 0.1966 | 0.0399 | 0.1959 | 0.0324 | 8.0146 |
|
| 67 |
+
| 1.7563 | 2.9049 | 8000 | 1.7598 | 0.2009 | 0.0415 | 0.2004 | 0.0333 | 7.9474 |
|
| 68 |
+
| 1.7432 | 3.2680 | 9000 | 1.7605 | 0.1951 | 0.0410 | 0.1943 | 0.0335 | 8.0106 |
|
| 69 |
+
| 1.7451 | 3.6311 | 10000 | 1.7590 | 0.2007 | 0.0413 | 0.2001 | 0.0350 | 7.9219 |
|
| 70 |
+
| 1.7447 | 3.9942 | 11000 | 1.7578 | 0.2000 | 0.0419 | 0.1996 | 0.0323 | 7.9596 |
|
| 71 |
+
| 1.7355 | 4.3573 | 12000 | 1.7585 | 0.2000 | 0.0419 | 0.1994 | 0.0339 | 7.9768 |
|
| 72 |
+
| 1.7374 | 4.7204 | 13000 | 1.7576 | 0.2004 | 0.0429 | 0.1999 | 0.0340 | 8.0173 |
|
| 73 |
|
| 74 |
|
| 75 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 557921848
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d1785c2f5ed9eb498735ec5b530e57562a92436388d728c2e3c62845c0820c9c
|
| 3 |
size 557921848
|
tokenizer.json
CHANGED
|
@@ -2,13 +2,13 @@
|
|
| 2 |
"version": "1.0",
|
| 3 |
"truncation": {
|
| 4 |
"direction": "Right",
|
| 5 |
-
"max_length":
|
| 6 |
"strategy": "LongestFirst",
|
| 7 |
"stride": 0
|
| 8 |
},
|
| 9 |
"padding": {
|
| 10 |
"strategy": {
|
| 11 |
-
"Fixed":
|
| 12 |
},
|
| 13 |
"direction": "Right",
|
| 14 |
"pad_to_multiple_of": null,
|
|
|
|
| 2 |
"version": "1.0",
|
| 3 |
"truncation": {
|
| 4 |
"direction": "Right",
|
| 5 |
+
"max_length": 32,
|
| 6 |
"strategy": "LongestFirst",
|
| 7 |
"stride": 0
|
| 8 |
},
|
| 9 |
"padding": {
|
| 10 |
"strategy": {
|
| 11 |
+
"Fixed": 32
|
| 12 |
},
|
| 13 |
"direction": "Right",
|
| 14 |
"pad_to_multiple_of": null,
|