• DocumentCode
    186231
  • Title

    Computational model for syntactic development: Identifying how children learn to generalize nouns and verbs for different languages

  • Author

    Kawai, Yusuke ; Oshima, Yoshiaki ; Sasamoto, Yuki ; Nagai, Yukie ; Asada, Minoru

  • Author_Institution
    Grad. Sch. of Eng., Osaka Univ., Suita, Japan
  • fYear
    2014
  • fDate
    13-16 Oct. 2014
  • Firstpage
    117
  • Lastpage
    123
  • Abstract
    By three years of age, children are supposed to start learning to understand syntactic structures, and at around five years of age, they are reported to be able to infer a syntactic category, such as a noun or a verb, for a novel word. Finding the syntactic cue enables them to infer a target directed by a novel word in visual stimuli. The study also found that their inference performances depended on their native languages. In this article, we propose a model to explain how children learn to generalize novel nouns and verbs in the Japanese, English, and Chinese languages. We use a Bayesian hidden Markov model (BHMM) to learn syntactic categories represented as hidden states in a BHMM. Here, an increase in the number of hidden states indicates the children´s syntactic development. A model with a larger number of hidden states is able to infer a clearer syntactic category of a novel word, resulting in the correct choice of a category for the visual target. Syntactic categories that depend on input languages are acquired by BHMMs, and therefore result in different performances among the languages. We entered English-, Japanese-, or Chinese-corpus into the model and examined how the model inferred a correct target indicated by a novel word through the acquired syntactic categories. The results showed that the performances by our model are very similar to the children´s performances. Further analysis of representations of hidden states clarified that the model acquires syntactic categories reflecting orders of words in English, suffixes in Japanese, and adverbs in Chinese.
  • Keywords
    computer aided instruction; hidden Markov models; natural language processing; BHMM; Bayesian hidden Markov model; Chinese language; English language; Japanese language; children learning; syntactic category; syntactic cue; syntactic development; syntactic structures; visual stimuli; Computational modeling; Estimation; Hidden Markov models; Speech; Standards; Syntactics; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Development and Learning and Epigenetic Robotics (ICDL-Epirob), 2014 Joint IEEE International Conferences on
  • Conference_Location
    Genoa
  • Type

    conf

  • DOI
    10.1109/DEVLRN.2014.6982965
  • Filename
    6982965