Thailand's Frontier AI Research Lab
Advancing open-source LLMs and multimodal Thai language technologies with global-level research and local community engagement
Try our Models
Typhoon — #1 open-source Thai large language technologies (LLMs) with most downloads and usage by the community. Experience the power of AI across multiple modalities.
API Calls
Downloads
Unique Users
Explore Our AI Models
Discover a wide range of powerful AI models designed to enhance your applications and workflows.
Latest Publications
Explore our latest research publications covering Thai language models, multimodal systems, and evaluation frameworks.
Typhoon 2: A Family of Open Text and Multimodal Thai Large Language Model
This paper presents Typhoon 2, Thai-optimized models for text, vision, and audio. It outlines methods like continual pre-training and post-training to enhance Thai performance, with evaluation across tasks. The series includes models from 1 to 70 billion parameters, safety tools, and advances in document understanding and speech processing.
Typhoon: Thai Large Language Models
The Typhoon series introduces Thai LLMs optimized for low-resource challenges, using continual training and ThaiExam for evaluation. Fine-tuned for Thai tasks, Typhoon outperforms open-source models and rivals GPT-3.5 in Thai, with greater efficiency.
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models
CrossCheckGPT introduces a reference-free method for ranking hallucinations in multimodal foundation models, leveraging cross-system consistency as a measure of robustness. Applicable across domains and tasks, it uses explicit and implicit consistency metrics to assess hallucination levels. The method demonstrates high correlation with human judgments and supports new benchmarks, including the first audio-visual hallucination benchmark, AVHalluBench. In collaboration with University of Cambridge, Tsinghua University.
Enhancing Low-Resource Language and Instruction Following Capabilities of Audio Language Models
This paper evaluates audio language models in low-resource languages, using Thai as an example, revealing their limitations despite multilingual pretraining. It explores data mixtures to optimize models for both a target language and English, integrating audio comprehension and speech instruction-following into a unified framework. The proposed model, Typhoon-Audio, significantly outperforms open-source models and rivals state-of-the-art systems like Gemini-1.5-Pro in both English and Thai.
Our Events
Driving Innovation Across Industries
Discover how organizations and researchers are leveraging Typhoon AI to transform their work.
"Typhoon เป็น LLM ที่ทำมาสำหรับภาษาไทยโดยเฉพาะ ตัวโมเดลมีความสามารถในการเข้าใจคำสั่งที่เป็นภาษาไทยได้เป็นอย่างดี นอกจากนี้ ความเร็วในการประมวลผลก็ดีเท่าหรือสูงกว่า Commercial LLMs หลายๆเจ้า ทำให้ทีมเราสามารถนำ Typhoon มาประยุกต์ใช้ในงานต่างๆ เช่น Text2SQL และ RAG ได้อย่างสะดวกและมีประสิทธิภาพ"
Data Analytics Team
SCB
Join the Typhoon Community
Connect with developers, researchers, and AI enthusiasts building the future of Thai language technology. Get support, share ideas, and stay updated with the latest models and features.
Trusted By































