글로버메뉴 바로가기 본문 바로가기 하단메뉴 바로가기

논문검색은 역시 페이퍼서치

> 한국언어정보학회 > 언어와 정보 > 22권 1호

기계학습을 이용한 역사 텍스트의 저자판별 : 1920년대 『개벽』 잡지의 논설 텍스트

Machine Learning-Based Authorship Attribution for Historical Texts - a case study of GaeByeok magazine in the 1920s

최지명 ( Ji-myoung Choi )

- 발행기관 : 한국언어정보학회

- 발행년도 : 2018

- 간행물 : 언어와 정보, 22권 1호

- 페이지 : pp.91-122 ( 총 32 페이지 )


학술발표대회집, 워크숍 자료집 중 1,2 페이지 논문은 ‘요약’만 제공되는 경우가 있으니,

구매 전에 간행물명, 페이지 수 확인 부탁 드립니다.

7,200
논문제목
초록(외국어)
This study aims to demonstrate how the authorship attribution techniques can be applied to historical texts, exploring the potential of authorship attribution as a solution to the real world authorship disputes and the possibility of multidisciplinary research that combines humanities and quantitative text analytics. History and literary studies have used traditional methods of judging the similarity of topics and subject matters or relying on extra-textual information to solve the authorship problems. This subjective and anecdotal approach to authorship needs to be complemented by incorporating objective and quantitative methodology that examines intra-textual clues. As the first case study, we performed machine learning-based authorship attribution analysis on the 164 opinion texts with unknown authorship from GaeByeok magazine of the 1920s. To enhance accuracy and reliability of the analysis, an improved machine learning algorithm was devised based on SVM by incorporating three parameters α, β, θ into the prediction model. This study is also a case study showing how to perform the authorship attribution analysis in an open setting, not in a closed setting. We hope that the prediction results of the analysis will encourage and facilitate more productive discussion among related disciplines on authorship identification and verification of real historical texts.

논문정보
  • - 주제 : 어문학분야 > 언어학
  • - 발행기관 : 한국언어정보학회
  • - 간행물 : 언어와 정보, 22권 1호
  • - 발행년도 : 2018
  • - 페이지 : pp.91-122 ( 총 32 페이지 )
  • - UCI(KEPA) : I410-ECN-0102-2018-700-004278577
저널정보
  • - 주제 : 어문학분야 > 언어학
  • - 성격 : 학술지
  • - 간기 : 반년간
  • - 국내 등재 : KCI 등재
  • - 해외 등재 : -
  • - ISSN : 1226-7430
  • - 수록범위 : 1997–2022
  • - 수록 논문수 : 328