![]() Still, this does not meet our vision of speech quality. Next was Pico2Wave, which still uses the same technique as eSpeak, but on a higher level. Unfortunately, it is not currently available from Fedora Linux’s default repositories. But it’s still far from being “good”, as it relies on eSpeak. It produces more understandable audio than eSpeak. One of the first speech engines I tried was MBROLA. It sounds great, but it requires your cellphone and a special server application. □ One outputs the text via the Samsung TTS engine on an Android device. Over the last year, we’ve experimented with several TTS systems. Others produce MP3 or WAV files which you can then play with, for example, sox. Some systems output the waveform directly. Text-To-Speech systems translate the written text to a listenable waveform. But today we can choose (even by voice command □) what speech engine we want to use. But trust me, Fedora Linux can do better. □ Naturally, eSpeak was the first choice because it was installed on the system by default. You might expect it to sound like a creepy robot voice from the 1960s when it was invented. The time of eSpeak has run outĪ voice assistant doesn’t just react to your speech. But don’t be afraid, it is all described in the READ.ME and it is pretty simple. For now, because a few dependencies are unavailable from Fedora Linux’s default repositories, you will need to install the system manually. There are videos out about Carola which show these kinds of interactions in reality. I will use rough translations here which should work once someone helps with translating the config. Note: This PVA has no English translations yet because it was developed in German. But back to why I chose “Carola” - it was the most recognizable by the STT system I was using at the time. □ If you have a smart-home (which I haven’t □) you can even switch on the light in the kitchen. Today, you can tell him/her/it to start apps, redirect audio streams, control audio and video playback, call someone, handle incoming calls, and more. After a few more days of work, it was capable of executing more complex commands. A few hours and a hundred lines of Java code later, I could give my PC simple commands. But there was no use-case except for writing what one said down to the screen. In 2021 I read an article about the Speech-To-Text (STT) system Vosk and started to play a bit with it. the name, the default TTS engine, and the default apps). Some of them can even be changed by voice command (e.g. These settings can be personalized for each user. ![]() You can also configure other things such as your location, which applications to use by default when opening media files, what CardDAV server to use when looking up contact information, etc. It is similar to “Alexa” or “OK, Google” for those who are familiar with those products. It’s the keyword the PVA reacts to by default. The first thing people often ask is, “Why did you name it Carola?” □ This is a common misconception. For Linux users, this is now possible by installing “Carola” as your Personal Voice Assistant (PVA). What sounded like Sci-Fi twenty years ago is now a reality for many PC users. “Carola, I wanna hear hurricanes.” My HD LED starts to blink for a second and I hear Carola saying “I found one match” and the music starts. While we are waiting on a Titan to bridge us to the fighting area, a tune comes to my mind. We want to brawl with Red Alliance over some systems they attacked the usual stuff on a Friday evening in EVE. I sit down at my Fedora Linux PC and start a MMORPG.
0 Comments
Leave a Reply. |