Instructions to use taiypeo/bart-large-reddit_tifu-sentence-paraphrased-100-cnt-supervised-basic with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use taiypeo/bart-large-reddit_tifu-sentence-paraphrased-100-cnt-supervised-basic with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("taiypeo/bart-large-reddit_tifu-sentence-paraphrased-100-cnt-supervised-basic") model = AutoModelForSeq2SeqLM.from_pretrained("taiypeo/bart-large-reddit_tifu-sentence-paraphrased-100-cnt-supervised-basic") - Notebooks
- Google Colab
- Kaggle
bart-large-reddit_tifu-sentence-paraphrased-100-cnt-supervised-basic
This model is a fine-tuned version of facebook/bart-large on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 3.1211
- Rouge1: 0.2428
- Rouge2: 0.0719
- Rougel: 0.1938
- Rougelsum: 0.1938
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 1e-05
- train_batch_size: 4
- eval_batch_size: 4
- seed: 42
- optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: linear
- lr_scheduler_warmup_ratio: 0.06
- num_epochs: 10
Training results
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|---|---|---|---|---|---|---|---|
| 4.028 | 0.4 | 10 | 3.4731 | 0.1186 | 0.0155 | 0.0951 | 0.0952 |
| 3.7319 | 0.8 | 20 | 3.0012 | 0.1742 | 0.0417 | 0.1401 | 0.1402 |
| 2.9968 | 1.2 | 30 | 2.9157 | 0.2093 | 0.0613 | 0.1719 | 0.1721 |
| 2.8508 | 1.6 | 40 | 2.8839 | 0.245 | 0.0717 | 0.1967 | 0.1968 |
| 2.9668 | 2.0 | 50 | 2.8702 | 0.2372 | 0.0692 | 0.1902 | 0.1903 |
| 2.6049 | 2.4 | 60 | 2.8676 | 0.2371 | 0.0704 | 0.1892 | 0.1894 |
| 2.3309 | 2.8 | 70 | 2.8970 | 0.2435 | 0.0729 | 0.1937 | 0.1938 |
| 2.5178 | 3.2 | 80 | 2.9207 | 0.2473 | 0.0761 | 0.1989 | 0.1988 |
| 2.2429 | 3.6 | 90 | 2.9262 | 0.2414 | 0.0712 | 0.1941 | 0.1941 |
| 2.2766 | 4.0 | 100 | 2.9126 | 0.2341 | 0.0706 | 0.1873 | 0.1874 |
| 2.1137 | 4.4 | 110 | 2.9393 | 0.2379 | 0.0695 | 0.1907 | 0.1908 |
| 2.0369 | 4.8 | 120 | 2.9756 | 0.2475 | 0.0742 | 0.1983 | 0.1984 |
| 1.7971 | 5.2 | 130 | 2.9907 | 0.254 | 0.0772 | 0.204 | 0.2041 |
| 2.0246 | 5.6 | 140 | 2.9957 | 0.2478 | 0.0734 | 0.1991 | 0.1991 |
| 1.901 | 6.0 | 150 | 3.0116 | 0.2436 | 0.072 | 0.1943 | 0.1944 |
| 1.7319 | 6.4 | 160 | 3.0312 | 0.2369 | 0.0696 | 0.1881 | 0.1879 |
| 1.6978 | 6.8 | 170 | 3.0353 | 0.2421 | 0.0718 | 0.1922 | 0.1923 |
| 1.6149 | 7.2 | 180 | 3.0481 | 0.241 | 0.0717 | 0.1925 | 0.1925 |
| 1.6909 | 7.6 | 190 | 3.0738 | 0.2422 | 0.0715 | 0.1931 | 0.193 |
| 1.8133 | 8.0 | 200 | 3.0847 | 0.2442 | 0.0722 | 0.1942 | 0.1944 |
| 1.4801 | 8.4 | 210 | 3.0897 | 0.2416 | 0.0717 | 0.1939 | 0.194 |
| 1.5867 | 8.8 | 220 | 3.0997 | 0.2406 | 0.072 | 0.1933 | 0.1932 |
| 1.651 | 9.2 | 230 | 3.1093 | 0.2413 | 0.0716 | 0.1933 | 0.1932 |
| 1.4213 | 9.6 | 240 | 3.1188 | 0.243 | 0.0722 | 0.1938 | 0.1938 |
| 1.4747 | 10.0 | 250 | 3.1211 | 0.2428 | 0.0719 | 0.1938 | 0.1938 |
Framework versions
- Transformers 4.57.3
- Pytorch 2.9.1+cu128
- Datasets 3.6.0
- Tokenizers 0.22.1
- Downloads last month
- 2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for taiypeo/bart-large-reddit_tifu-sentence-paraphrased-100-cnt-supervised-basic
Base model
facebook/bart-large