Hot Topic

Market News

Events & Promo

Career Tips

Education News

Health & Life

PRNewswire

Unisound U1-OCR: The First Industrial-Grade Document Intelligence Foundation Model Ushering in the OCR 3.0 Era

Publish date: 26 Feb 2026

Stay updated on the job market

Popular Articles

【最新失業率】本港失業率維持3.7% 失業人數升至139,200人

【AI大軍來襲】機械人搶人類飯碗打工仔如何自保？

【打工仔必備Social技巧】4招教你打破 Dead Air

【Fresh Grad求生指南】初入職場唔知點算好?4招助你成功融入職場

私人駕駛教師執照2026 332個「師傅牌」5月11日起接受申請

Unisound Unveils U1-OCR: The First Industrial-Grade Document Intelligence Model, Ushering in OCR 3.0 Era

BEIJING, Feb. 26, 2026 /PRNewswire/ -- Unisound has officially launched its Unisound U1-OCR, the world's first industrial-grade foundation model for document intelligence, a groundbreaking release that ushers in the OCR 3.0 era and sets a new industry standard with five core strengths: SOTA performance, verifiable results, out-of-the-box functionality, efficient deployment, and robust adaptability.

Document intelligence leverages AI to automatically read, understand, classify digitized documents and extract key information. OCR 1.0 only enabled basic text recognition, while OCR 2.0 added preliminary layout understanding capabilities. U1-OCR takes a quantum leap to OCR 3.0, moving far beyond layout recognition to deliver deep semantic insight, automatic document classification and business-level information extraction—marking a transformative shift from "character perception" to "document cognition".

As a SOTA-level document intelligence model, U1-OCR resolves the longstanding bottleneck of traditional models that "recognize text but fail to grasp layout", enabling it to interpret complex documents like human experts. It pioneers a "semantic-driven + dynamic focus" strategy, first mapping a document's hierarchical structure of headings and structural metadata before extracting content on demand, and builds a semantic map to identify the relationship between titles, charts and text—even in disorganized layouts. Its enhanced spatial alignment module leverages positional data to accurately restore document structure for dense tables and mixed text-image content, effectively mitigating spatial recognition errors. Equipped with Multi-Token Prediction technology and full-task reinforcement learning, it boosts reasoning efficiency by over 80%, ensuring logical coherence for long documents.

Trained with multi-task collaborative reinforcement learning and optimized for both semantics and coordinates, U1-OCR suppresses spatial hallucinations for reliable outputs, and achieves SOTA results across major authoritative benchmarks: scoring 95.1 in OmniDocBench V1.5, outperforming leading models like GLM-OCR and Gemini-3-Pro; hitting an F1 score of 90.8 in D4LA and 95.9 in DocLayNet, excelling in table recognition and cross-page association; and outperforming models such as Gemini-2.5-Flash and Qwen-2.5-VL in internal business tests, with standout performance in medical document processing such as admission and discharge records.

Figure：Comparison of Unisound U1-OCR Evaluation Scores on OmniDocBench V1.5

Built for real-world industrial applications, U1-OCR features four key capabilities that bridge the gap between document understanding and business action. Its proprietary "coordinate-text-semantics" architecture enables pixel-level positioning and full evidence traceability, making audit processes transparent and efficient. Integrated with Unisound's industry expertise in healthcare and finance, it achieves over 99% classification accuracy for more than 50 common business documents, supporting cross-field logical verification with zero-shot capabilities. It supports private on-premise and offline deployment while delivering highly efficient document processing, meeting strict data privacy requirements for government, healthcare, and finance sectors while lowering hardware costs. Most notably, it delivers stable, high-precision performance in extreme scenarios—including non-standard photos, blurred documents, complex formatting and multilingual text—freeing businesses from reliance on standardized document formats.

Validated in real-world use cases, U1-OCR enables visual traceability of extracted information, automatic classification of mixed documents, performing intelligent image purification for cluttered layouts, and accurate recognition of complex nested tables with full structural retention.

The launch of U1-OCR marks AI's evolution from simple text recognition to business logic comprehension, a key step for Unisound toward AGI. By taking multimodal documents as a knowledge entry point, Unisound is empowering machines with autonomous reasoning and evidence traceability capabilities, driving AI from perceptual intelligence to cognitive intelligence—with the vision to build a general intelligent agent that reads, thinks and solves complex problems like humans, turning every document into a stepping stone to AGI.

Stay updated on the job market

Hottest Tags

#AI自動化

#裁員潮

#政府統計處

#失業率

#本港失業率

#就業不足率

#工程業

#銀行業

#機械人搶飯碗

#德意志銀行

#牛津大學

#Google

Jobs you may interested

分店經理(薪金面議) RX Beauty

14 days ago

Management Trainee (Welcome fresh grad) Sunlife HK 永明金融

4 days ago

AEON百貨(收銀售貨員 - 家電) AEON Stores (Hong Kong) Co., Ltd

8 days ago

全職冷氣技工招聘 Good Yield Property Management Limited (Waldorf Garden) 高耀物業管理有限公司 - 華都花園

10 days ago

經驗美容顧問(月入可達$60,000或以上) RX Beauty

14 days ago

Concierge Supervisor Hopewell Hotel 合和酒店

14 days ago

籌募大使【薪金可達$20,000 - $25,000+】 國際培幼會 Plan International Hong Kong

14 days ago

麵包學徒(固定更返早收早) simplylife

14 days ago

Café Attendant Starbucks

14 days ago

💎招聘見習營業員👑 香港置業(地產代理)有限公司

14 days ago

Recommeneded Jobs

Come and Join Us 誠邀您成為SSP一份子！ Select Service Partner Hong Kong Limited

Negotiable

New Territories- Chek Lap Kok,HK International Airport,Tung Chung, Hong Kong Island- The Peak

14 days ago

Full time

分店經理(薪金面議) RX Beauty

面議

Anywhere in Hong Kong

14 days ago

Full time

【新分店: 東涌/尖東】BARISTA / Shift Supervisor Starbucks

Negotiable

Kowloon- Tsim Sha Tsui, New Territories- Tung Chung

14 days ago

Full time Part time

LOG-ON招聘日(4月) LOG-ON

$13,800 per Month

Anywhere in Hong Kong

14 days ago

Full time Part time

Articles you may be interested in

【最新失業率】本港失業率維持3.7% 失業人數升至139,200人

Market News

【AI大軍來襲】機械人搶人類飯碗打工仔如何自保？

Hot Topic

【打工仔必備Social技巧】4招教你打破 Dead Air

Career Tips

【Fresh Grad求生指南】初入職場唔知點算好?4招助你成功融入職場

Career Tips

私人駕駛教師執照2026 332個「師傅牌」5月11日起接受申請

Market News

公司立場同你唔同打工仔如何自保？

Hot Topic

香港中醫醫院舉辦「招聘日」醫・教・研並重招募中醫師、西醫等支援及行政人才加入中西醫協作平台

Events & Promo

【報稅2026】「綠色炸彈」來襲！填表流程、時間表+扣稅全攻略

Market News

「先問AI再問老闆」成職場新文化

Hot Topic

公司要你借400萬？警惕求職借貸騙案

Hot Topic

科大醫學大樓動土 2028年落成培育新一代醫學人才

Education News

香港都會大學李嘉誠專業進修學院舉辦非洲文化之夜啟動2026中非人文交流年

Education News

運輸署擬增發332張「教車師傅牌」 5月起接受申請最快7月考試

Market News

【網民熱話】錢大媽8萬高薪招聘「豬肉分割師」網民: 仲高人工過3大畢業生

Market News

旅行唔止放鬆更能提升工作效率

Hot Topic

研究指工作電郵亂加Emoji反而令你失去專業感

Career Tips

對工作愈不滿 40歲後健康出現警號

Health & Life

7種最難有拖拍職業

Hot Topic

平等只係口號？職場歧視無處不在

Hot Topic

打工仔6種最常見職業病

Health & Life

【網路廣告術語知多D】2026數位廣告趨勢最新廣告KPI大洗牌

Hot Topic

女性不宜返夜班？研究指工作表現遜男性

Health & Life

【打工仔注意】MPF強積金供款下限擬升至1.05萬供款上限每月增至2000元

Market News

【綜合招聘考試CRE】公務員考試新一輪綜合招聘考試 3.28接受報名大學3年級生可投考

Market News

More Job Categories: Manufacturing Jobs Government Jobs Publish Jobs Catering Jobs

Find Jobs

Recruitment Day

Career News

Learning

Contact us

Find Jobs

Recruitment Day

Career News

Learning

Contact us

Unisound U1-OCR: The First Industrial-Grade Document Intelligence Foundation Model Ushering in the OCR 3.0 Era

Follow us

Popular Articles

Follow us

Popular Articles

Hottest Tags

Jobs you may interested

You may also like

Find Jobs

Recruitment Day

Career News

Learning

Recommeneded Jobs

Articles you may be interested in