Abstract:
A Text-To-Speech (TTS) synthesizer is a computer-based system able to read any text and convert it into speech that resembles as closely as possible a native speaker of the language. This thesis describes the
first Text-to-Speech (TTS) system for the Tigrigna language, using speech synthesis architecture in Matlab.
The TTS system is working based on concatenative synthesis and applying LPC technique. The conversion
process from input text into acoustic waveform is performed in a number of steps consisting of functional
components. Finally, the performance of the system is measured and the quality of synthesized speech is
assessed in terms of intelligibility and naturalness. The result of the synthesizer is evaluated in two ways,
in word level and sentences level. The test results indicate in the word level is evaluated by NeoSpeech tool
online and most of the words are recognizable. The overall performance of the system in the word level
which is evaluated by NeoSpeech tool is found to be 78%. When it comes to the intelligibility and naturalness of the synthesized speech in the sentence level, it is measured in MOS scale and the overall intelligibility and naturalness of the system is found to be 3.28 and 3.27 respectively. The values of performance,
intelligibility and naturalness are encouraging and show that diphone speech units are good candidates to
develop fully functional speech synthesizer. But there are areas that can be improved. Inclusion of text
analyzer to pronounce zonal dialects of the language and prosody generator are some of the things that
need further investigation.