abirmondalind
/

lc-to-event-BART

@@ -19,12 +19,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1944
-- Rouge1: 0.1995
-- Rouge2: 0.0404
-- Rougel: 0.1990
-- Bleu: 0.0298
-- Gen Len: 7.9850
 ## Model description
@@ -49,17 +49,27 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Bleu   | Gen Len |
-|:-------------:|:------:|:----:|:---------------:|:------:|:------:|:------:|:------:|:-------:|
-| 0.2133        | 0.7262 | 2000 | 0.2017          | 0.1922 | 0.0358 | 0.1917 | 0.0254 | 8.0175  |
-| 0.2006        | 1.4524 | 4000 | 0.1979          | 0.1978 | 0.0385 | 0.1972 | 0.0300 | 7.9790  |
-| 0.192         | 2.1786 | 6000 | 0.1958          | 0.2022 | 0.0404 | 0.2018 | 0.0321 | 7.9477  |
-| 0.1913        | 2.9049 | 8000 | 0.1944          | 0.1995 | 0.0404 | 0.1990 | 0.0298 | 7.9850  |
 ### Framework versions

 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.7576
+- Rouge1: 0.2004
+- Rouge2: 0.0429
+- Rougel: 0.1999
+- Bleu: 0.0340
+- Gen Len: 8.0173
 ## Model description
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 5
 - mixed_precision_training: Native AMP
+- label_smoothing_factor: 0.1
 ### Training results
+| Training Loss | Epoch  | Step  | Validation Loss | Rouge1 | Rouge2 | Rougel | Bleu   | Gen Len |
+|:-------------:|:------:|:-----:|:---------------:|:------:|:------:|:------:|:------:|:-------:|
+| 1.8178        | 0.3631 | 1000  | 1.7899          | 0.1979 | 0.0316 | 0.1975 | 0.0200 | 7.7348  |
+| 1.8001        | 0.7262 | 2000  | 1.7773          | 0.1931 | 0.0360 | 0.1926 | 0.0274 | 7.9388  |
+| 1.777         | 1.0893 | 3000  | 1.7707          | 0.1995 | 0.0384 | 0.1991 | 0.0306 | 7.8880  |
+| 1.7732        | 1.4524 | 4000  | 1.7679          | 0.1978 | 0.0379 | 0.1973 | 0.0308 | 7.9067  |
+| 1.7711        | 1.8155 | 5000  | 1.7643          | 0.1991 | 0.0405 | 0.1985 | 0.0308 | 7.9443  |
+| 1.757         | 2.1786 | 6000  | 1.7632          | 0.2030 | 0.0409 | 0.2024 | 0.0317 | 7.9549  |
+| 1.7564        | 2.5418 | 7000  | 1.7620          | 0.1966 | 0.0399 | 0.1959 | 0.0324 | 8.0146  |
+| 1.7563        | 2.9049 | 8000  | 1.7598          | 0.2009 | 0.0415 | 0.2004 | 0.0333 | 7.9474  |
+| 1.7432        | 3.2680 | 9000  | 1.7605          | 0.1951 | 0.0410 | 0.1943 | 0.0335 | 8.0106  |
+| 1.7451        | 3.6311 | 10000 | 1.7590          | 0.2007 | 0.0413 | 0.2001 | 0.0350 | 7.9219  |
+| 1.7447        | 3.9942 | 11000 | 1.7578          | 0.2000 | 0.0419 | 0.1996 | 0.0323 | 7.9596  |
+| 1.7355        | 4.3573 | 12000 | 1.7585          | 0.2000 | 0.0419 | 0.1994 | 0.0339 | 7.9768  |
+| 1.7374        | 4.7204 | 13000 | 1.7576          | 0.2004 | 0.0429 | 0.1999 | 0.0340 | 8.0173  |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:92129c78a016bd01eb9933b55ea6000b5c5298e471e1e74cfaee4ec4d4c70eeb
 size 557921848

 version https://git-lfs.github.com/spec/v1
+oid sha256:d1785c2f5ed9eb498735ec5b530e57562a92436388d728c2e3c62845c0820c9c
 size 557921848

tokenizer.json CHANGED Viewed

@@ -2,13 +2,13 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 64,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
-      "Fixed": 64
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 32,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
+      "Fixed": 32
     },
     "direction": "Right",
     "pad_to_multiple_of": null,