글로버메뉴 바로가기 본문 바로가기 하단메뉴 바로가기

논문검색은 역시 페이퍼서치

> 범태평양 응용언어학회 > Journal of Pan-Pacific Association of Applied Linguistics (Journal of PAAL ) > 20권 1호

Automated Scoring of L2 Spoken English with Random Forests

Automated Scoring of L2 Spoken English with Random Forests

( Yuichiro Kobayashi ) , ( Mariko Abe )

- 발행기관 : 범태평양 응용언어학회

- 발행년도 : 2016

- 간행물 : Journal of Pan-Pacific Association of Applied Linguistics (Journal of PAAL ), 20권 1호

- 페이지 : pp.55-73 ( 총 19 페이지 )


학술발표대회집, 워크숍 자료집 중 1,2 페이지 논문은 ‘요약’만 제공되는 경우가 있으니,

구매 전에 간행물명, 페이지 수 확인 부탁 드립니다.

5,900
논문제목
초록(외국어)
The purpose of the present study is to assess second language (L2) spoken English using automated scoring techniques. Automated scoring aims to classify a large set of learners` oral performance data into a small number of discrete oral proficiency levels. In automated scoring, objectively measurable features such as the frequencies of lexical and grammatical items are generally used as "exploratory variables" to predict oral proficiency levels, any of which can be used as a "criterion variable" in this study. We have chosen the NICT JLE Corpus, a corpus of 1,281 Japanese EFL learners` speech productions coded into nine oral proficiency levels (Izumi, Uchimoto, & Isahara, 2004). The nine oral proficiency levels were used as the criterion variables and linguistic features analyzed in Biber (1988) as explanatory variables. We employed random forests (Breiman, 2001), a powerful method for text classification and feature extraction, to predict oral proficiency. As a result of random forests with the out-of-bag error estimate, 60.11% of the productions were correctly classified. Compared to the baseline accuracy of the simplest possible algorithm of always choosing the most frequent level (37.63%), our random forests model improved prediction by 22.48 points. The Pearson product-moment correlation coefficient with human scoring was 0.85. Predictors that showed a clear discrimination of oral proficiency levels were tokens, types, and the frequency of nouns in the order of strength.

논문정보
  • - 주제 : 어문학분야 > 언어학
  • - 발행기관 : 범태평양 응용언어학회
  • - 간행물 : Journal of Pan-Pacific Association of Applied Linguistics (Journal of PAAL ), 20권 1호
  • - 발행년도 : 2016
  • - 페이지 : pp.55-73 ( 총 19 페이지 )
  • - UCI(KEPA) : I410-ECN-0102-2017-700-000553791
저널정보
  • - 주제 : 어문학분야 > 언어학
  • - 성격 : 학술지
  • - 간기 : 반년간
  • - 국내 등재 : KCI 등재
  • - 해외 등재 : -
  • - ISSN : 1345-8353
  • - 수록범위 : 1998–2021
  • - 수록 논문수 : 384