Normal view MARC view ISBD view

Quality of Synthetic Speech : Perceptual Dimensions, Influencing Factors, and Instrumental Assessment.

By: Hinterleitner, Florian.
Material type: TextTextSeries: eBooks on Demand.T-Labs Series in Telecommunication Services: Publisher: Singapore : Springer Singapore, 2017Copyright date: ©2017Description: 1 online resource (170 pages).Content type: text Media type: computer Carrier type: online resourceISBN: 9789811037344.Subject(s): Computer scienceGenre/Form: Electronic books.Additional physical formats: Print version:: Quality of Synthetic Speech : Perceptual Dimensions, Influencing Factors, and Instrumental AssessmentDDC classification: 005.437 Online resources: Click here to view this ebook.
Contents:
Acknowledgements -- Contents -- Acronyms -- Abstract -- 1 Introduction -- 1.1 Motivation -- 1.2 Outline -- References -- 2 Speech Synthesis -- 2.1 Setup of a Speech Synthesizer -- 2.1.1 Natural Language Processing (NLP) -- 2.1.2 Prosody Generation -- 2.1.3 Concatenation and Generation of Speech-Signal Parameters -- 2.1.4 Speech Signal Generation -- 2.2 The Mary Text-to-Speech System (MaryTTS) -- References -- 3 Auditory and Instrumental Quality Evaluation Metrics -- 3.1 What Is Perceptual Quality? -- 3.2 Taxonomy for the Quality Assessment of Synthetic Speech -- 3.2.1 Glass Box Versus Black Box -- 3.2.2 Laboratory Versus Field Studies -- 3.2.3 Linguistic Versus Acoustic -- 3.2.4 Auditory Versus Instrumental -- 3.3 Auditory Quality Evaluation Metrics -- 3.3.1 Functional TestsThe content of this section has previously been published in a slightly different version in [6]. -- 3.3.2 Judgment TestsParts of the content of this section have previously been published in a slightly different version in [13] and [6]. -- 3.4 Instrumental Quality Evaluation Metrics -- 3.4.1 Reference-Based MeasuresParts of the content of this section have previously been published in a slightly different version in [21]. -- 3.4.2 Reference-Free Measures -- References -- 4 Perceptual Quality Dimensions -- 4.1 State-of-the-Art Perceptual Quality DimensionsParts of the content of this section have previously been published in a slightly different version in [1]. -- 4.1.1 Study: Kraft and Portele (Kraft1995) -- 4.1.2 Study: Mayo et al. I (Mayo2005) -- 4.1.3 Study: Viswanathan and Viswanathan (Vis2005) -- 4.1.4 Study: Seget (Seget2007) -- 4.1.5 Study: Hinterleitner (Hint2010) -- 4.1.6 Study: Mayo et al. II (Mayo2011) -- 4.1.7 Restrictions of Discussed Studies.
4.2 Semantic Differential and Factor AnalysisParts of the content of this section have previously been published in a slightly different version in [13]. -- 4.2.1 Experimental Setup -- 4.2.2 Statistical Analysis -- 4.3 Sorting Task and Multidimensional ScalingParts of the content of this section have previously been published in a slightly different version in [16]. -- 4.3.1 Experimental Setup -- 4.3.2 Statistical Analysis -- 4.4 Summary of the SD/FA and ST/MDS StudiesParts of the content of this section have previously been published in a slightly different version in [16]. -- 4.5 4.5 Universal Perceptual Quality Dimensions -- 4.5.1 Naturalness of Voice -- 4.5.2 Prosodic Quality -- 4.5.3 Fluency and Intelligibility -- 4.5.4 Absence of Disturbances -- 4.5.5 Calmness -- 4.5.6 Instructions for TTS Quality Assessment -- 4.6 Summary -- References -- 5 Influencing Factors on Perceptual Quality -- 5.1 Influence of the ApplicationParts of the content of this section have previously been published in a slightly different version in [1]. -- 5.1.1 Pretest -- 5.1.2 Main TestThe content of this section has previously been published in a slightly different version in [10]. -- 5.1.3 Conclusions -- 5.2 Influence of a Speakers VoiceThe content of this section has previously been published in a slightly different version in [12]. -- 5.2.1 Experimental Setup -- 5.2.2 Statistical Analysis -- 5.2.3 Conclusions -- 5.3 Influence of Corpus Size and UtteranceParts of the content of this section have been submitted for publishing in a slightly different version [16]. -- 5.3.1 Experimental Setup -- 5.3.2 Statistical Analysis -- 5.3.3 Conclusions -- 5.4 Summary -- References -- 6 Instrumental Quality Assessment -- 6.1 Reference-Based Measures -- 6.1.1 State of the Art.
6.1.2 Quality PredictionThe content of this section has previously been published in a slightly different version in [6]. -- 6.1.3 Conclusions -- 6.2 Reference-Free Measures -- 6.2.1 State of the Art -- 6.2.2 Linear Regression Models -- 6.2.3 Conclusions -- 6.3 Summary -- References -- 7 Requirements for the Integration of an Instrumental Quality Measure into a Concatenative TTS System -- 7.1 Regular Perception Model (RPM) -- 7.1.1 Model Training -- 7.1.2 Results -- 7.2 Unit Selection Voice Creation in MaryTTS -- 7.2.1 Test Database -- 7.2.2 Generation of Alternative Versions -- 7.3 Experimental Setup -- 7.3.1 Pairwise Comparison (PC) -- 7.3.2 Semantic Differential (SD) -- 7.3.3 Test Procedure -- 7.4 Statistical Analysis -- 7.4.1 PC Data -- 7.4.2 SD Data -- 7.4.3 Discussion -- 7.5 Quality Prediction -- 7.5.1 RPM1 Performance -- 7.5.2 RPM2 Training -- 7.5.3 RPM2 Performance -- 7.6 Automatic Selection of Alternative Versions -- 7.7 Potential Improvements -- 7.8 Summary -- References -- 8 Conclusions and Future Work -- 8.1 Summary -- 8.2 Conclusions -- 8.3 Future Work -- 8.3.1 Perceptual Quality Dimensions -- 8.3.2 Influencing Factors -- 8.3.3 Instrumental Quality Measurement -- Appendix Statistical Analysis of Chap.7 -- A.1 Results of the PC Test -- A.2 Results of the SD.
Tags from this library: No tags from this library for this title. Log in to add tags.
Item type Current location Call number URL Status Date due Barcode
Electronic Book UT Tyler Online
Online
TA1-2040 (Browse shelf) http://ebookcentral.proquest.com/lib/uttyler/detail.action?docID=4838376 Available EBC4838376

Acknowledgements -- Contents -- Acronyms -- Abstract -- 1 Introduction -- 1.1 Motivation -- 1.2 Outline -- References -- 2 Speech Synthesis -- 2.1 Setup of a Speech Synthesizer -- 2.1.1 Natural Language Processing (NLP) -- 2.1.2 Prosody Generation -- 2.1.3 Concatenation and Generation of Speech-Signal Parameters -- 2.1.4 Speech Signal Generation -- 2.2 The Mary Text-to-Speech System (MaryTTS) -- References -- 3 Auditory and Instrumental Quality Evaluation Metrics -- 3.1 What Is Perceptual Quality? -- 3.2 Taxonomy for the Quality Assessment of Synthetic Speech -- 3.2.1 Glass Box Versus Black Box -- 3.2.2 Laboratory Versus Field Studies -- 3.2.3 Linguistic Versus Acoustic -- 3.2.4 Auditory Versus Instrumental -- 3.3 Auditory Quality Evaluation Metrics -- 3.3.1 Functional TestsThe content of this section has previously been published in a slightly different version in [6]. -- 3.3.2 Judgment TestsParts of the content of this section have previously been published in a slightly different version in [13] and [6]. -- 3.4 Instrumental Quality Evaluation Metrics -- 3.4.1 Reference-Based MeasuresParts of the content of this section have previously been published in a slightly different version in [21]. -- 3.4.2 Reference-Free Measures -- References -- 4 Perceptual Quality Dimensions -- 4.1 State-of-the-Art Perceptual Quality DimensionsParts of the content of this section have previously been published in a slightly different version in [1]. -- 4.1.1 Study: Kraft and Portele (Kraft1995) -- 4.1.2 Study: Mayo et al. I (Mayo2005) -- 4.1.3 Study: Viswanathan and Viswanathan (Vis2005) -- 4.1.4 Study: Seget (Seget2007) -- 4.1.5 Study: Hinterleitner (Hint2010) -- 4.1.6 Study: Mayo et al. II (Mayo2011) -- 4.1.7 Restrictions of Discussed Studies.

4.2 Semantic Differential and Factor AnalysisParts of the content of this section have previously been published in a slightly different version in [13]. -- 4.2.1 Experimental Setup -- 4.2.2 Statistical Analysis -- 4.3 Sorting Task and Multidimensional ScalingParts of the content of this section have previously been published in a slightly different version in [16]. -- 4.3.1 Experimental Setup -- 4.3.2 Statistical Analysis -- 4.4 Summary of the SD/FA and ST/MDS StudiesParts of the content of this section have previously been published in a slightly different version in [16]. -- 4.5 4.5 Universal Perceptual Quality Dimensions -- 4.5.1 Naturalness of Voice -- 4.5.2 Prosodic Quality -- 4.5.3 Fluency and Intelligibility -- 4.5.4 Absence of Disturbances -- 4.5.5 Calmness -- 4.5.6 Instructions for TTS Quality Assessment -- 4.6 Summary -- References -- 5 Influencing Factors on Perceptual Quality -- 5.1 Influence of the ApplicationParts of the content of this section have previously been published in a slightly different version in [1]. -- 5.1.1 Pretest -- 5.1.2 Main TestThe content of this section has previously been published in a slightly different version in [10]. -- 5.1.3 Conclusions -- 5.2 Influence of a Speakers VoiceThe content of this section has previously been published in a slightly different version in [12]. -- 5.2.1 Experimental Setup -- 5.2.2 Statistical Analysis -- 5.2.3 Conclusions -- 5.3 Influence of Corpus Size and UtteranceParts of the content of this section have been submitted for publishing in a slightly different version [16]. -- 5.3.1 Experimental Setup -- 5.3.2 Statistical Analysis -- 5.3.3 Conclusions -- 5.4 Summary -- References -- 6 Instrumental Quality Assessment -- 6.1 Reference-Based Measures -- 6.1.1 State of the Art.

6.1.2 Quality PredictionThe content of this section has previously been published in a slightly different version in [6]. -- 6.1.3 Conclusions -- 6.2 Reference-Free Measures -- 6.2.1 State of the Art -- 6.2.2 Linear Regression Models -- 6.2.3 Conclusions -- 6.3 Summary -- References -- 7 Requirements for the Integration of an Instrumental Quality Measure into a Concatenative TTS System -- 7.1 Regular Perception Model (RPM) -- 7.1.1 Model Training -- 7.1.2 Results -- 7.2 Unit Selection Voice Creation in MaryTTS -- 7.2.1 Test Database -- 7.2.2 Generation of Alternative Versions -- 7.3 Experimental Setup -- 7.3.1 Pairwise Comparison (PC) -- 7.3.2 Semantic Differential (SD) -- 7.3.3 Test Procedure -- 7.4 Statistical Analysis -- 7.4.1 PC Data -- 7.4.2 SD Data -- 7.4.3 Discussion -- 7.5 Quality Prediction -- 7.5.1 RPM1 Performance -- 7.5.2 RPM2 Training -- 7.5.3 RPM2 Performance -- 7.6 Automatic Selection of Alternative Versions -- 7.7 Potential Improvements -- 7.8 Summary -- References -- 8 Conclusions and Future Work -- 8.1 Summary -- 8.2 Conclusions -- 8.3 Future Work -- 8.3.1 Perceptual Quality Dimensions -- 8.3.2 Influencing Factors -- 8.3.3 Instrumental Quality Measurement -- Appendix Statistical Analysis of Chap.7 -- A.1 Results of the PC Test -- A.2 Results of the SD.

Description based on publisher supplied metadata and other sources.

There are no comments for this item.

Log in to your account to post a comment.