Update REAME
Browse files
README.md
CHANGED
|
@@ -1,27 +1,25 @@
|
|
| 1 |
---
|
| 2 |
license: gemma
|
| 3 |
---
|
| 4 |
-
*Gemma-SEA-LION-v4-27B (Base Model) Last updated: 2025-08-21*
|
| 5 |
-
|
| 6 |
-
---
|
| 7 |
-
|
| 8 |
# Model Card for Gemma-SEA-LION-v4-27B
|
| 9 |
|
| 10 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 11 |
|
| 12 |
-
Last updated: 2025-08-
|
| 13 |
|
| 14 |
|
| 15 |
Gemma-SEA-LION-v4-27B is based on Gemma 3 (which supports over 100 languages)
|
| 16 |
-
and is a multilingual model which has undergone continued pre-training on approximately **500B** tokens
|
| 17 |
-
|
|
|
|
|
|
|
|
|
|
| 18 |
|
| 19 |
-
|
| 20 |
-
for the Southeast Asia (SEA) region
|
| 21 |
|
| 22 |
-
|
| 23 |
-
|
| 24 |
-
Advanced function calling and structured
|
| 25 |
|
| 26 |
|
| 27 |
## Model Details
|
|
@@ -176,6 +174,7 @@ The cutoff date of this version is September 2020.
|
|
| 176 |
|
| 177 |
- We utilized 0.5% of synthetically generated datasets for the low-resource language, Khmer.
|
| 178 |
|
|
|
|
| 179 |
### Training Procedure
|
| 180 |
|
| 181 |
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
|
@@ -254,10 +253,6 @@ The following metrics were used:
|
|
| 254 |
Coming soon.
|
| 255 |
|
| 256 |
|
| 257 |
-
#### Summary
|
| 258 |
-
|
| 259 |
-
TBC
|
| 260 |
-
|
| 261 |
|
| 262 |
## Environmental Impact
|
| 263 |
|
|
@@ -283,16 +278,14 @@ For more info, please contact us at sealion@aisingapore.org
|
|
| 283 |
|
| 284 |
## Team
|
| 285 |
|
| 286 |
-
|
| 287 |
-
|
| 288 |
-
|
| 289 |
-
|
| 290 |
-
|
| 291 |
-
|
| 292 |
-
|
| 293 |
-
|
| 294 |
-
Liew Rachel, Liu Bing Jie Darius, Teo Wei Yi, Lin Zhou, Roshan Gopalakrishnan, Cuahtemoc Anda,
|
| 295 |
-
Sri Devi Wijaya and Partha Nandi
|
| 296 |
|
| 297 |
|
| 298 |
## Contact
|
|
|
|
| 1 |
---
|
| 2 |
license: gemma
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
# Model Card for Gemma-SEA-LION-v4-27B
|
| 5 |
|
| 6 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 7 |
|
| 8 |
+
Last updated: 2025-08-22
|
| 9 |
|
| 10 |
|
| 11 |
Gemma-SEA-LION-v4-27B is based on Gemma 3 (which supports over 100 languages)
|
| 12 |
+
and is a multilingual model which has undergone continued pre-training on approximately **500B** tokens
|
| 13 |
+
sampled from a bucket of over one trillion tokens across 11 SEA languages: Bahasa Indonesia, Burmese, Chinese,
|
| 14 |
+
English, Khmer, Lao, Malay, Tagalog, Tamil, Thai and Vietnamese.
|
| 15 |
+
|
| 16 |
+
Gemma-SEA-LION-v4-27B inherits the following features from Gemma 3:
|
| 17 |
|
| 18 |
+
- A large 128K context length,
|
|
|
|
| 19 |
|
| 20 |
+
- Image and text understanding capabilities, including document comprehension, visual question answering, and image-grounded reasoning,
|
| 21 |
+
|
| 22 |
+
- Advanced function calling and structured output capabilities to facilitate seamless integration into larger systems.
|
| 23 |
|
| 24 |
|
| 25 |
## Model Details
|
|
|
|
| 174 |
|
| 175 |
- We utilized 0.5% of synthetically generated datasets for the low-resource language, Khmer.
|
| 176 |
|
| 177 |
+
|
| 178 |
### Training Procedure
|
| 179 |
|
| 180 |
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
|
|
|
| 253 |
Coming soon.
|
| 254 |
|
| 255 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 256 |
|
| 257 |
## Environmental Impact
|
| 258 |
|
|
|
|
| 278 |
|
| 279 |
## Team
|
| 280 |
|
| 281 |
+
Antonyrex Sajeban, Chan Hok Teng Adwin, Cheng Zi Yi Nicholas, Choa Hsueh Mei Esther, Heng Jonathan, Huang Yuli, Hulagadri Adithya Venkatadri,
|
| 282 |
+
Jann Railey Estrada Montalan, Kang Siow Wei Bryan, Lau Wayne, Lee Chwan Ren, Leong Wai Yi, Leong Wei Qi,
|
| 283 |
+
Limkonchotiwat Peerat, Muhammad Ridzuan Bin Mokhtar, Nagarajan Karthik, Ng Boon Cheong Raymond, Ngee Chia Tai,
|
| 284 |
+
Ngui Jian Gang, Nguyen Thanh Ngan, Ong Jin Jie Brandon, Ong Tat-Wee David, Ong Zhi Hao, Pereira Mark,
|
| 285 |
+
Rengarajan Hamsawardhini, Susanto Yosephine, Sutaveephamochanon Anocha, Tan Choon Meng, Tan Chor Phin Evelyn,
|
| 286 |
+
Tan Siao Wei Jessica, Teng Kok Wai Walter, Teo Eng Sipp Leslie, Tjhi William, Yeo Yeow Tong, Yong Xianbin,
|
| 287 |
+
Liew Rachel, Liu Bing Jie Darius, Teo Wei Yi, Lin Zhou (NCS), Roshan Gopalakrishnan (NCS), Cuahtemoc Anda (NCS),
|
| 288 |
+
Sri Devi Wijaya (NCS), Partha Nandi (NCS)
|
|
|
|
|
|
|
| 289 |
|
| 290 |
|
| 291 |
## Contact
|