Next-generation bilingual vision-language model for document parsing with superior Thai document understanding.
A next-generation, bilingual vision-language document parsing model built for real-word use cases, TYPHOON OCR delivers structured, layout-aware, and semantically rich outputs as well as outperforms both GPT-4o and Gemini 2.5 Flash in Thai document understanding, particularly on documents with complex layouts and mixed-language content.