Typhoon Logo
TYPHOON
Typhoon ASR Real-Time

Typhoon ASR Real-Time

General
114M

Real-time Thai speech-to-text, fast, accurate, and affordable—from laptops to enterprise.

Released
Sep 8, 2025
Context
Input
Audio
Output
Text
Typhoon ASR Real-Time
About this Model

A streaming-first, open-source Thai speech recognition model designed for real-world applications, Typhoon ASR Real-Time delivers fast, accurate, and low-latency transcriptions while running efficiently on CPUs and compact GPUs. Built on NVIDIA NeMo’s fastConformer-transducer-large architecture, it overcomes the limitations of existing ASR systems like Whisper by enabling true streaming without costly hardware or cloud dependencies. Optimized for accessibility, privacy, and affordability, Typhoon ASR Real-Time democratizes real-time Thai speech-to-text—from classrooms and small businesses to enterprise-scale deployments.

Key Features
⚡ True streaming capability
Delivers near-instant Thai transcriptions as audio arrives
💻 CPU-optimized performance
Runs efficiently on standard hardware without relying on costly GPUs
🔒 Privacy-first design
Supports full on-premises deployment, ensuring sensitive audio stays under your control.
🎯 Fine-tuning accessibility
Compact enough to be customized with minimal resources, even on Google Colab
💰 Low-cost deployment
Cheap to host and run, making real-time Thai ASR truly democratized
Release History
Version 1
Sep 8, 2025
Initial release