Safetensors
gemma3
SAnocha commited on
Commit
02ae669
·
verified ·
1 Parent(s): e4c5094

Update REAME

Browse files
Files changed (1) hide show
  1. README.md +19 -26
README.md CHANGED
@@ -1,27 +1,25 @@
1
  ---
2
  license: gemma
3
  ---
4
- *Gemma-SEA-LION-v4-27B (Base Model) Last updated: 2025-08-21*
5
-
6
- ---
7
-
8
  # Model Card for Gemma-SEA-LION-v4-27B
9
 
10
  <!-- Provide a quick summary of what the model is/does. -->
11
 
12
- Last updated: 2025-08-21
13
 
14
 
15
  Gemma-SEA-LION-v4-27B is based on Gemma 3 (which supports over 100 languages)
16
- and is a multilingual model which has undergone continued pre-training on approximately **500B** tokens across 11 SEA languages:
17
- Bahasa Indonesia, Burmese, Chinese, English, Khmer, Lao, Malay, Tagalog, Tamil, Thai and Vietnamese.
 
 
 
18
 
19
- **SEA-LION** is a collection of Large Language Models (LLMs) which have been pretrained and instruct-tuned
20
- for the Southeast Asia (SEA) region
21
 
22
- Gemma-SEA-LION-v4-27B inherits image and text understanding capabilities, including document comprehension,
23
- visual Q&A, and image-grounded reasoning. It supports a large 128K context length.
24
- Advanced function calling and structured outputs allow for seamless integration into larger systems.
25
 
26
 
27
  ## Model Details
@@ -176,6 +174,7 @@ The cutoff date of this version is September 2020.
176
 
177
  - We utilized 0.5% of synthetically generated datasets for the low-resource language, Khmer.
178
 
 
179
  ### Training Procedure
180
 
181
  <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
@@ -254,10 +253,6 @@ The following metrics were used:
254
  Coming soon.
255
 
256
 
257
- #### Summary
258
-
259
- TBC
260
-
261
 
262
  ## Environmental Impact
263
 
@@ -283,16 +278,14 @@ For more info, please contact us at sealion@aisingapore.org
283
 
284
  ## Team
285
 
286
- Adithya Venkatadri Hulagadri, Adwin Chan Hok Teng, Anocha Sutaveephamochanon,
287
- Brandon Ong Jin Jie, Bryan Siow Wei Kang, David Ong Tat-Wee, Esther Choa Hsueh Mei,
288
- Evelyn Tan Chor Phin, Hamsawardhini Rengarajan, Huang Yuli, Jann Railey Estrada Montalan,
289
- Jessica Tan Siao Wei, Jonathan Heng, Karthik Nagarajan, Lee Chwan Ren, Leong Wai Yi,
290
- Leong Wei Qi, Leslie Teo, Mark Pereira, Muhammad Ridzuan Bin Mokhtar, Ngee Chia Tai,
291
- Ngui Jian Gang, Nguyen Thanh Ngan, Nicholas Cheng Zi Yi, Ong Zhi Hao, Peerat Limkonchotiwat,
292
- Raymond Ng Boon Cheong, Sajeban Antonyrex, Susanto Yosephine, Tan Choon Meng,
293
- Walter Teng Kok Wai, Wayne Lau, William Tjhi Chandra, Yeo Yeow Tong, Yong Xianbin,
294
- Liew Rachel, Liu Bing Jie Darius, Teo Wei Yi, Lin Zhou, Roshan Gopalakrishnan, Cuahtemoc Anda,
295
- Sri Devi Wijaya and Partha Nandi
296
 
297
 
298
  ## Contact
 
1
  ---
2
  license: gemma
3
  ---
 
 
 
 
4
  # Model Card for Gemma-SEA-LION-v4-27B
5
 
6
  <!-- Provide a quick summary of what the model is/does. -->
7
 
8
+ Last updated: 2025-08-22
9
 
10
 
11
  Gemma-SEA-LION-v4-27B is based on Gemma 3 (which supports over 100 languages)
12
+ and is a multilingual model which has undergone continued pre-training on approximately **500B** tokens
13
+ sampled from a bucket of over one trillion tokens across 11 SEA languages: Bahasa Indonesia, Burmese, Chinese,
14
+ English, Khmer, Lao, Malay, Tagalog, Tamil, Thai and Vietnamese.
15
+
16
+ Gemma-SEA-LION-v4-27B inherits the following features from Gemma 3:
17
 
18
+ - A large 128K context length,
 
19
 
20
+ - Image and text understanding capabilities, including document comprehension, visual question answering, and image-grounded reasoning,​
21
+
22
+ - Advanced function calling and structured output capabilities to facilitate seamless integration into larger systems.
23
 
24
 
25
  ## Model Details
 
174
 
175
  - We utilized 0.5% of synthetically generated datasets for the low-resource language, Khmer.
176
 
177
+
178
  ### Training Procedure
179
 
180
  <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
 
253
  Coming soon.
254
 
255
 
 
 
 
 
256
 
257
  ## Environmental Impact
258
 
 
278
 
279
  ## Team
280
 
281
+ Antonyrex Sajeban, Chan Hok Teng Adwin, Cheng Zi Yi Nicholas, Choa Hsueh Mei Esther, Heng Jonathan, Huang Yuli, Hulagadri Adithya Venkatadri,
282
+ Jann Railey Estrada Montalan, Kang Siow Wei Bryan, Lau Wayne, Lee Chwan Ren, Leong Wai Yi, Leong Wei Qi,
283
+ Limkonchotiwat Peerat, Muhammad Ridzuan Bin Mokhtar, Nagarajan Karthik, Ng Boon Cheong Raymond, Ngee Chia Tai,
284
+ Ngui Jian Gang, Nguyen Thanh Ngan, Ong Jin Jie Brandon, Ong Tat-Wee David, Ong Zhi Hao, Pereira Mark,
285
+ Rengarajan Hamsawardhini, Susanto Yosephine, Sutaveephamochanon Anocha, Tan Choon Meng, Tan Chor Phin Evelyn,
286
+ Tan Siao Wei Jessica, Teng Kok Wai Walter, Teo Eng Sipp Leslie, Tjhi William, Yeo Yeow Tong, Yong Xianbin,
287
+ Liew Rachel, Liu Bing Jie Darius, Teo Wei Yi, Lin Zhou (NCS), Roshan Gopalakrishnan (NCS), Cuahtemoc Anda (NCS),
288
+ Sri Devi Wijaya (NCS), Partha Nandi (NCS)
 
 
289
 
290
 
291
  ## Contact