A Variable Break Prediction Method Using CART in a Japanese Text-to-Speech System
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Na, Deok-Su | - |
dc.contributor.author | Bae, Myung-Jin | - |
dc.date.available | 2018-05-10T15:21:43Z | - |
dc.date.created | 2018-04-17 | - |
dc.date.issued | 2009-02 | - |
dc.identifier.issn | 0916-8532 | - |
dc.identifier.uri | http://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/15883 | - |
dc.description.abstract | Break prediction is an important step in text-to-speech systems as break indices (BIs) have a great influence on how to correctly represent prosodic phrase boundaries. However, an accurate prediction is difficult since BIs are often chosen according to the meaning of a sentence or the reading style of the speaker. In Japanese, the prediction of an accentual phrase boundary (APB) and major phrase boundary (MPB) is particularly difficult. Thus, this paper presents a method to complement the prediction errors of an APB and MPB. First, we define a subtle BI in which it is difficult to decide between an APB and MPB clearly as a variable break (VB), and an explicit BI as a fixed break (FB). The VB is chosen using the classification and regression tree, and multiple prosodic targets in relation to the pith and duration are then generated. Finally, unit-selection is conducted using multiple prosodic targets. The experimental results show that the proposed method improves the naturalness of synthesized speech. | - |
dc.publisher | IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG | - |
dc.relation.isPartOf | IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | - |
dc.title | A Variable Break Prediction Method Using CART in a Japanese Text-to-Speech System | - |
dc.type | Article | - |
dc.identifier.doi | 10.1587/transinf.E92.D.349 | - |
dc.type.rims | ART | - |
dc.identifier.bibliographicCitation | IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, v.E92D, no.2, pp.349 - 352 | - |
dc.description.journalClass | 1 | - |
dc.identifier.wosid | 000265700800030 | - |
dc.identifier.scopusid | 2-s2.0-77950344445 | - |
dc.citation.endPage | 352 | - |
dc.citation.number | 2 | - |
dc.citation.startPage | 349 | - |
dc.citation.title | IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | - |
dc.citation.volume | E92D | - |
dc.contributor.affiliatedAuthor | Bae, Myung-Jin | - |
dc.type.docType | Article | - |
dc.description.oadoiVersion | published | - |
dc.subject.keywordAuthor | text-to-speech system | - |
dc.subject.keywordAuthor | break prediction | - |
dc.subject.keywordAuthor | variable break | - |
dc.description.journalRegisteredClass | scopus | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
Soongsil University Library 369 Sangdo-Ro, Dongjak-Gu, Seoul, Korea (06978)02-820-0733
COPYRIGHT ⓒ SOONGSIL UNIVERSITY, ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.