Qwen-SEA-LION-v4.5
Last update: 2026-05-19
SEA-LION is a collection of Large Language Models (LLMs) which have been pretrained and instruct-tuned for the Southeast Asia (SEA) region.
The Qwen-SEA-LION-v4.5-27B-IT sub-collection — comprising the standard high-fidelity model and its speed-optimized companion, the 27B-IT-SpecDecoder — is built upon the Qwen3.6-27B dense architecture, a 27-billion parameter model featuring a hybrid Linear and Full Attention design. To ensure deep domain adaptation, both models underwent extensive distillation from Qwen/Qwen3.5-397B-A17B on the updated aisingapore/SEA-Instruct-2602 dataset. This instills native multilingual and multicultural fluency across English and key SEA languages (Burmese, Indonesian, Filipino, Malay, Tamil, Thai, and Vietnamese), with the SpecDecoder variant specifically engineered to maximize throughput and minimize inference latency in production environments.
Qwen-SEA-LION-v4.5-27B-IT-SpecDecoder is a draft model using speculative decoding method to employ a lightweight block diffusion model to draft multiple tokens in parallel trained from Qwen-SEA-LION-v4.5-27B-IT. This is the drafter model, which must be paired with aisingapore/Qwen-SEA-LION-v4.5-27B-IT.
Qwen-SEA-LION-v4.5-27B-IT inherits the following features from Qwen3.6:
Context Window (262K): Reduce if facing OOM errors but keep ≥128K to preserve full reasoning capabilities.
Unified Vision-Language: Early fusion training delivers good performance across multimodal reasoning, coding, and visual tasks.
Scalable RL: Trained in million-agent environments for robust, real-world SEA adaptability.
Broad Linguistic Coverage: Deeply specialized in SEA cultural nuances while supporting 201 languages globally.
Advanced Infrastructure: Utilizes highly efficient multimodal training and asynchronous RL frameworks.
Agentic Coding: High-precision handling of repository-level reasoning and frontend workflows.
Thinking Preservation: Retains historical reasoning context to streamline iterative development and reduce compute overhead.
Model Details
Model Description
SEA-LION stands for Southeast Asian Languages In One Network.
We performed post-training in English and SEA languages on Qwen3.6-27B, a multimodal learning model using the Qwen3.6 architecture, to create Qwen-SEA-LION-v4.5.
For tokenization, the model employs the default tokenizer used in Qwen3.6.
Developed by: AI Products Pillar, AI Singapore
Funded by: Singapore NRF
Shared by: AI Products Pillar, AI Singapore
Model type: Causal Language Model with Vision Encoder
Training Stage: Post-Training (Logit Distillation & Model Merging))
Context length: 262k
Language(s): fine-tuned on Burmese, Indonesian, Filipino, Malay, Tamil, Thai, and Vietnamese
License: Apache-2.0
Finetuned from model: https://huggingface.co/Qwen/Qwen3.6-27B
SpecDecoder was Finedtuned from z-lab/Qwen3.6-27B-DFlash targeted to Qwen-SEA-LION-v4.5-27B-IT
Model Sources
Qwen-SEA-LION-v4.5-27B-IT models are available for download via the following channels:
HuggingFace SEA-LION v4.5 Collection
Qwen-SEA-LION-v4.5-27B-IT
Qwen-SEA-LION-v4.5-27B-IT-SpecDecoder
How to Get Started with the Model
Use the code below to get started with the model with 🤗 Transformers libraries.
Tool Calling example
Agentic Example:
Output
Use the code below to get aisingapore/Qwen-SEA-LION-v4.527B-IT-SpecDecoder booster with vLLM.
Training Details
Training Data
🤗aisingapore/SEA-Instruct-2602
Training Regime
Our post-training workflow consists solely of distillation and model merging.
Evaluation
Testing Data, Factors & Metrics
We evaluated Qwen-SEA-LION-v4.5 on general language, multi-turn chat and instruction-following capabilities.
Results
For details on Qwen-SEA-LION-v4.5-27B-IT performance, please refer to the SEA-LION Leaderboard.
Technical Specifications
Model Architecture
The architecture is based on the highly efficient Qwen3.6 foundation. The detailed architecture can be found at https://huggingface.co/Qwen/Qwen3.6-27B#model-overview.
Uses
Out-of-Scope Use
The model has not been aligned for safety. Developers and users should perform their own safety fine-tuning and related security measures. In no event shall the authors be held liable for any claims, damages, or other liabilities arising from the use of the released weights and codes.
Bias, Risks, and Limitations
The model was not tested for robustness against adversarial prompting. It is important for users to be aware that our model exhibits certain limitations that warrant consideration. Like many LLMs, the model can hallucinate and occasionally generates irrelevant content, introducing fictional elements that are not grounded in the provided context. Users should also exercise caution in interpreting and validating the model's responses due to the potential inconsistencies.
Last updated