Qwen-SEA-LION-v4.5

Last update: 2026-05-19

SEA-LION is a collection of Large Language Models (LLMs) which have been pretrained and instruct-tuned for the Southeast Asia (SEA) region.

The Qwen-SEA-LION-v4.5-27B-IT sub-collection — comprising the standard high-fidelity model and its speed-optimized companion, the 27B-IT-SpecDecoder — is built upon the Qwen3.6-27B dense architecture, a 27-billion parameter model featuring a hybrid Linear and Full Attention design. To ensure deep domain adaptation, both models underwent extensive distillation from Qwen/Qwen3.5-397B-A17B on the updated aisingapore/SEA-Instruct-2602 dataset. This instills native multilingual and multicultural fluency across English and key SEA languages (Burmese, Indonesian, Filipino, Malay, Tamil, Thai, and Vietnamese), with the SpecDecoder variant specifically engineered to maximize throughput and minimize inference latency in production environments.

Qwen-SEA-LION-v4.5-27B-IT-SpecDecoder is a draft model using speculative decoding method to employ a lightweight block diffusion model to draft multiple tokens in parallel trained from Qwen-SEA-LION-v4.5-27B-IT. This is the drafter model, which must be paired with aisingapore/Qwen-SEA-LION-v4.5-27B-IT.

Qwen-SEA-LION-v4.5-27B-IT inherits the following features from Qwen3.6:

  • Context Window (262K): Reduce if facing OOM errors but keep ≥128K to preserve full reasoning capabilities.

  • Unified Vision-Language: Early fusion training delivers good performance across multimodal reasoning, coding, and visual tasks.

  • Scalable RL: Trained in million-agent environments for robust, real-world SEA adaptability.

  • Broad Linguistic Coverage: Deeply specialized in SEA cultural nuances while supporting 201 languages globally.

  • Advanced Infrastructure: Utilizes highly efficient multimodal training and asynchronous RL frameworks.

  • Agentic Coding: High-precision handling of repository-level reasoning and frontend workflows.

  • Thinking Preservation: Retains historical reasoning context to streamline iterative development and reduce compute overhead.

Model Details

Model Description

SEA-LION stands for Southeast Asian Languages In One Network.

We performed post-training in English and SEA languages on Qwen3.6-27B, a multimodal learning model using the Qwen3.6 architecture, to create Qwen-SEA-LION-v4.5.

For tokenization, the model employs the default tokenizer used in Qwen3.6.

  • Developed by: AI Products Pillar, AI Singapore

  • Funded by: Singapore NRF

  • Shared by: AI Products Pillar, AI Singapore

  • Model type: Causal Language Model with Vision Encoder

  • Training Stage: Post-Training (Logit Distillation & Model Merging))

  • Context length: 262k

  • Language(s): fine-tuned on Burmese, Indonesian, Filipino, Malay, Tamil, Thai, and Vietnamese

  • License: Apache-2.0

SpecDecoder was Finedtuned from z-lab/Qwen3.6-27B-DFlash targeted to Qwen-SEA-LION-v4.5-27B-IT

Model Sources

Qwen-SEA-LION-v4.5-27B-IT models are available for download via the following channels:

HuggingFace SEA-LION v4.5 Collection

Model
Download

Qwen-SEA-LION-v4.5-27B-IT

Qwen-SEA-LION-v4.5-27B-IT-SpecDecoder

How to Get Started with the Model

Use the code below to get started with the model with 🤗 Transformers libraries.

Tool Calling example

Agentic Example:

Output

Use the code below to get aisingapore/Qwen-SEA-LION-v4.527B-IT-SpecDecoder booster with vLLM.

Training Details

Training Data

🤗aisingapore/SEA-Instruct-2602

Training Regime

Our post-training workflow consists solely of distillation and model merging.

Evaluation

Testing Data, Factors & Metrics

We evaluated Qwen-SEA-LION-v4.5 on general language, multi-turn chat and instruction-following capabilities.

Results

For details on Qwen-SEA-LION-v4.5-27B-IT performance, please refer to the SEA-LION Leaderboard.

Technical Specifications

Model Architecture

The architecture is based on the highly efficient Qwen3.6 foundation. The detailed architecture can be found at https://huggingface.co/Qwen/Qwen3.6-27B#model-overview.

Uses

Out-of-Scope Use

The model has not been aligned for safety. Developers and users should perform their own safety fine-tuning and related security measures. In no event shall the authors be held liable for any claims, damages, or other liabilities arising from the use of the released weights and codes.

Bias, Risks, and Limitations

The model was not tested for robustness against adversarial prompting. It is important for users to be aware that our model exhibits certain limitations that warrant consideration. Like many LLMs, the model can hallucinate and occasionally generates irrelevant content, introducing fictional elements that are not grounded in the provided context. Users should also exercise caution in interpreting and validating the model's responses due to the potential inconsistencies.

Last updated