Facebook Instagram Twitter RSS Feed PodBean Back to top on side

Development of the Slovak HMM-Based TTS System and Evaluation of Voices in Respect to the Used Vocoding Techniques

In: Computing and Informatics, vol. 35, no. 6
Martin Sulír - Jozef Juhár - M. Rusko

Details:

Year, pages: 2016, 1467 - 1490
Keywords:
Hidden Markov models, Slovak language, statistical parametric speech synthesis, text to speech
About article:
This paper describes the development of a Slovak text-to-speech system which applies a technique wherein speech is directly synthesized from hidden Markov models. Statistical models for Slovak speech units are trained by using the newly created female and male phonetically balanced speech corpora. In addition, contextual informations about phonemes, syllables, words, phrases, and utterances were determined, as well as questions for decision tree-based context clustering algorithms. In this paper, recent statistical parametric speech synthesis methods including the conventional, STRAIGHT and AHOcoder speech synthesis systems are implemented and evaluated. Objective evaluation methods (mel-cepstral distortion and fundamental frequency comparison) and subjective ones (mean opinion score and semantically unpredictable sentences test) are carried out to compare these systems with each other and evaluation of their overall quality. The result of this work is a set of text to speech systems for Slovak language which are characterized by very good intelligibility and quite good naturalness of utterances at the output of these systems. In the subjective tests of intelligibility the STRAIGHT based female voice and AHOcoder based male voice reached the highest scores.
How to cite:
ISO 690:
Sulír, M., Juhár, J., Rusko, M. 2016. Development of the Slovak HMM-Based TTS System and Evaluation of Voices in Respect to the Used Vocoding Techniques. In Computing and Informatics, vol. 35, no.6, pp. 1467-1490. 1335-9150.

APA:
Sulír, M., Juhár, J., Rusko, M. (2016). Development of the Slovak HMM-Based TTS System and Evaluation of Voices in Respect to the Used Vocoding Techniques. Computing and Informatics, 35(6), 1467-1490. 1335-9150.