글로버메뉴 바로가기 본문 바로가기 하단메뉴 바로가기

논문검색은 역시 페이퍼서치

> 한국언어정보학회 > 국제 워크샵 > 1995권 0호

The Postprocessing of Optical Character Recognition based on Statistical Noisy channel and language model

The Postprocessing of Optical Character Recognition based on Statistical Noisy channel and language model

( Jason J. S. Chang ) , ( Shun Der Chen )

- 발행기관 : 한국언어정보학회

- 발행년도 : 1995

- 간행물 : 국제 워크샵, 1995권 0호

- 페이지 : pp.127-131 ( 총 5 페이지 )


학술발표대회집, 워크숍 자료집 중 1,2 페이지 논문은 ‘요약’만 제공되는 경우가 있으니,

구매 전에 간행물명, 페이지 수 확인 부탁 드립니다.

4,500
논문제목
초록(외국어)
The techniques of image processing have been used in optical character recognition (OCR) for a long time. The recognition method evolved from early "pattern recognition" to "feature extraction" recently. The recognition rate is raised from 70% to 90%. But the character by character recognition technique has its limitation. Using language models to assist the OCR system in improving recognition rate is the topic of many recent researches. Recently, the related research on Chinese nature language processing has improved rapidly. These improvement include the Chinese word segmentation, syntax analysis, semantic analysis, collocation analysis, statistical language models. In this paper, we will propose a new techniques for Chinese OCR postprocessing and postediting. We combine noisy channel model and the technique of natural language processing to implement an OCR postprocessing system. From the result of experiments, we found noisy channel model very effective for postprocessing. Under the approach, it is possible to recover the correct character, even when it is not in the candidate list produced by the OCR system.

논문정보
  • - 주제 : 어문학분야 > 언어학
  • - 발행기관 : 한국언어정보학회
  • - 간행물 : 국제 워크샵, 1995권 0호
  • - 발행년도 : 1995
  • - 페이지 : pp.127-131 ( 총 5 페이지 )
  • - UCI(KEPA) : I410-ECN-0102-2015-700-001901808
저널정보
  • - 주제 : 어문학분야 > 언어학
  • - 성격 : 학술지
  • - 간기 : 기타
  • - 국내 등재 : -
  • - 해외 등재 : -
  • - ISSN :
  • - 수록범위 : 1983–2002
  • - 수록 논문수 : 265