首页 馆藏资源 舆情信息 标准服务 科研活动 关于我们
现行 SME MS900611
到馆提醒
收藏跟踪
购买正版
A New Segmentation Method For Japanese Printed Documents Using 一种新的日语打印文档分割方法
发布日期: 1990-06-01
本文描述了一种新的基于版面知识的日文打印文档分割方法。大多数日语文档都使用日语字符和英语单词。日语文档识别机很难识别字符串是英语单词还是日语单词。固定音高和混合音高分割算法可以从日语句子中提取英语单词。结果表明,一种用于字符识别的混合基音分割算法得到了改进。
THIS PAPER DESCRIBES A NEW SEGMENTATION METHOD FOR JAPANESE PRINTED DOCUMENTS USING LAYOUT KNOWLEDGE. MOST JAPANESE DOCUMENTS USE BOTH JAPANESE CHARACTERS AND ENGLISH WORDS. IT IS VERY DIFFICULT FOR A RECOGNITION MACHINE OF JAPANESE DOCUMENTS TO RECOGNIZE WHETHER THE CHARACTER STRINGS ARE ENGLISH WORDS OR JAPANESE WORDS. THE FIXED PITCH AND MIXED PITCH SEGMENTATION ALGORITHMS CAN EXTRACT ENGLISH WORDS FROM JAPANESE SENTENCES. IT CONCLUDES THAT A MIXED PITCH SEGMENTATION ALGORITHM FOR CHARACTER RECOGNITION IS IMPROVED.
分类信息
发布单位或类别: 日本-日本船用装置工业会
关联关系
研制信息
相似标准/计划/法规