• DocumentCode
    3681402
  • Title

    A learning-based approach for Romanian syllabification and stress assignment

  • Author

    Diana Balc;Anamaria Beleiu;Rodica Potolea;Camelia Lemnaru

  • Author_Institution
    Computer Science Department, Technical University of Cluj-Napoca, Romania
  • fYear
    2015
  • Firstpage
    37
  • Lastpage
    42
  • Abstract
    This paper tackles the Romanian syllabification and stress assignment problems, and proposes an efficient machine learning based solution. We show that by designing the appropriate feature sets for each specific problem, learning algorithms achieve satisfactory accuracy rates for both problems (~92% for syllabification, ~85% for stress assignment), even for relatively small training set sizes. We have found that unigram-based features are powerful enough to characterize these problems, and therefore the introduction of bi-gram or tri-gram features (often utilized in syllabification problems for other languages) is unnecessary.
  • Keywords
    "Stress","Accuracy","Training","Radio frequency","Dictionaries","Support vector machines","Speech"
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Computer Communication and Processing (ICCP), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/ICCP.2015.7312603
  • Filename
    7312603