Qwen and Llama SEA-GUARD

SEA-Guard is a collection of safety-focused Large Language Models (LLMs) built upon the SEA-LION family, designed specifically for the Southeast Asia (SEA) region.

Model Details

Model Description

SEA-LION stands for Southeast Asian Languages In One Network and is a collection of Large Language Models (LLMs) which have been pretrained and instruct-tuned for the Southeast Asia (SEA) region.

This model is a fine-tuned version of aisingapore/Qwen-SEA-LION-v4-4B-VL, aisingapore/Qwen-SEA-LION-v4-8B-VL and aisingapore/Llama-SEA-LION-v3-8B-IT on 1M instruction-following pairs. For more details on training data, please refer to the paper SEA-Guard.

For tokenization, the model employs the default tokenizer used in Qwen3-VL.

Developed by: AI Products Pillar, AI Singapore
Funded by: Singapore NRF
Shared by: AI Products Pillar, AI Singapore
Model type: Decoder
Context length: 128k tokens
Language(s) (text): Burmese, English, Indonesian, Malay, Tagalog, Tamil, Thai, and Vietnamese
License:
- Qwen : Apache-2.0
- Llama : Llama 3.1 Community License
Finetuned from model: aisingapore/Qwen-SEA-LION-v4-4B-VL, aisingapore/Qwen-SEA-LION-v4-8B-VL and aisingapore/Llama-SEA-LION-v3-8B-IT

Download the Models

Qwen and Llama SEA-Guard models are available for download via the following channels:

Repository: 🤗HuggingFace SEA-Guard Collection

Intended Uses and Limitations

This model is optimized to return a binary classification in text form: ["safe", "unsafe"]. However, users must be aware that the model is subject to the limitations common to generative AI, including the potential to hallucinate or generate ungrounded, irrelevant text. Due to these inherent risks, human oversight is advised, and the model’s outputs should not be treated as absolute determinations without secondary verification.

Uses

Direct Use

The output of the model is only "safe" or "unsafe". Users can directly use it without any finetune or in-context learning since it is already trained with cultural safety for SEA contexts.

Downstream Use

Users can also continue training this model further on the target tasks, e.g., vision-text safety datasets. Also, this model is supported by vLLM for fast inference.

Training and evaluation data

For more details on training data, please refer to the paper SEA-Guard.

Training procedure

We employ a supervised-finetuning technique (SFT) on Llama-factory with the following hyperparameters.

Testing Data, Factors & Metrics

We use SEA-SafeguardBench to evaluate our SEA-Guard. Note that we also evaluated the vision-text safety classification in our research paper

Citation

BibTeX:

@misc{tasawong2026seaguardculturallygroundedmultilingual,
      title={SEA-Guard: Culturally Grounded Multilingual Safeguard for Southeast Asia}, 
      author={Panuthep Tasawong and Jian Gang Ngui and Alham Fikri Aji and Trevor Cohn and Peerat Limkonchotiwat},
      year={2026},
      eprint={2602.01618},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2602.01618}, 
}

More Information

This is the repository for the commercial instruction-tuned model. Notwithstanding the model's safety-aligned training, developers and users are advised to conduct their own safety fine-tuning and implement appropriate security measures. In no event shall the authors be held liable for any claims, damages, or other liabilities arising from the use of the released weights and codes.

AI Singapore is a national programme supported by the National Research Foundation, Singapore and hosted by the National University of Singapore. Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not reflect the views of the National Research Foundation or the National University of Singapore.

Contact

[email protected]

PreviousGemma-SEA-GUARD-12B NextGetting the models

Last updated 1 day ago

hashtagModel Details

hashtagModel Description

hashtagDownload the Models

hashtagIntended Uses and Limitations

hashtagUses

hashtagDirect Use

hashtagDownstream Use

hashtagTraining and evaluation data

hashtagTraining procedure

hashtagTesting Data, Factors & Metrics

hashtagCitation

hashtagMore Information

hashtagContact