Fl4m3Ph03n1x

Fl4m3Ph03n1x

Are there any text-to-speech ai tools available using elixir?

Background

As I often do, I read books to learn and improve myself. I also enjoy teaching and helping others when I can, so this is something I try to do regularly.

However, as time goes by (and so do the years) I have slowly come to realize it is becoming increasingly harder not only to maintain focus when reading/studying a new topic, I also forget things faster.

It is not an uncommon occurrence to re-read an “old” article or book, and to become surprised with things I had learned before. I have come to the inevitable conclusion this will only happen more frequently with time.

Although I do enjoy reading and learning, if I keep reading and the learning the same things over and over again, I will not evolve. Eventually I will gradually lose my ability to help others in topics of relevance, something I cannot live with.

So I decided to take a new approach. Re-learning old concepts is now something I have accepted, but I do get to choose how I re-learn them. I have decided to make a series of instructional videos (10-15) minutes long, freely available on youtube, free lectures for all to see.

Not only does this achieve my end goal of helping others more efficiently, it is also a quick and convenient way of re-learning things for myself. It also allows me to add my opinions, experiences and additional context, which is something I value immensely.

Problem

I have several scripts done, and I have power-point like presentations done as well for them. The issue now is that in order to make videos, I need to record sound. I have it from several sources, that my voice is … not ideal. Not only that, should my initiative succeed and my channel gather traction, I will be barred from certain jobs in the country I live because it is a common occurrence for managers and bosses to “avoid hiring known people”, for company political reasons.

In order to protect myself and to create videos that are somewhat acceptable I have decided to use text-to-speech audio generation.

However, all the tools I have found so far are in Python. Many of them are abandoned and even when they are not, there are always dependency issues with the likes of PyTorch and Tensor and so on.

I have not yet seen a Python project where things just work because there is always a dependency issue somewhere. Even the creators of some of those tools acknowledge this. To make things even more complex, some of these tools require specifically NVIDIA GPUs, which I don’t have.

Question

So I was wondering if there are out there any Elixir tts generation tools that one could use. Because Elixir uses BEAM, I assume requiring an NVIDIA GPU is not a necessity. Furthermore, I also assume I wouldn’t have to constantly fight with Python dependencies.

Are there any tools in Elixir that follow this requirements? Or is it too early in the Elixir ecosystem for such tools to exist yet?

Marked As Solved

Fl4m3Ph03n1x

Fl4m3Ph03n1x

As discussed in the Elixir forums, at this point in time the ecosystem is not yet mature enough and such tools do not yet exist.

However, I was able to bypass this by using Google Colab with kokoro-tts: hexgrad/Kokoro-82M · Hugging Face

Source:

This solves the issue and I don’t need to run anything locally!

Also Liked

Fl4m3Ph03n1x

Fl4m3Ph03n1x

Do you mean this Piper ? I didn’t find any mention of Rust in the README, and the files are mostly written in C++ and Python. What am I missing here?

As for Voxygen, I was not able to find anything except for an SaaS service which I have to pay to use. Can you link the project?

Thanks for the reply!

Scarlet

Scarlet

Elixir’s ecosystem is still young when it comes to TTS, and I’m not aware of any mature, high-quality TTS tools built natively for BEAM. You might have better luck interfacing with external libraries via Ports or NIFs, or using HTTP APIs to call TTS services from within an Elixir app. If avoiding Python entirely is the goal, you could check out Rust-based TTS engines like Voxygen or Piper and see if they can be wrapped into an Elixir-friendly format. Otherwise, the Elixir community might need more time before native TTS solutions emerge.

Where Next?

Popular Backend topics Top

chasekaylee
Hi there everyone! Recently, I have fallen in love with programming with Elixir and have been having so much fun with it. I have been do...
New
Rainer
Is there somewhere a good introduction to rust for experienced programmers (with years of C++/C#/Java experience)? Wanted to give it a t...
New
jaimeiniesta
I maintain a project that lists hundreds of thousands of web pages, and I’d like to show a screenshot for each web page. There are alread...
New
Fl4m3Ph03n1x
Background I am now trying Gradual type checking, as a consequence I am giving a shot to Gradient. As I see it, this is an alternative to...
New
Fl4m3Ph03n1x
Background I am trying out polymorphic typing with dialyzer. As an example I am using the famous Option type (aka, Maybe Monad) that is n...
New
JimmyCarterSon
Hello, I am working on a new application with Elixir, Dish_out. I want to see Data I follow this tutorial with Elixir Casts. However, I ...
New
sona11
I’m having a difficulty. I want to modify an attribute’s data type from String to Array. { “id”: “trn:tarb:tradingpartner:uuid:00000...
New
sona11
If isReachable throws an IOException in Java, what is the right step to do and why? The application, I believe, should halt the process ...
New
Fl4m3Ph03n1x
Background I have a release file inside a tarball. However I want the final release to have some additional files and to move things aro...
New
Fl4m3Ph03n1x
Background As I often do, I read books to learn and improve myself. I also enjoy teaching and helping others when I can, so this is somet...
New

Other popular topics Top

AstonJ
SpaceVim seems to be gaining in features and popularity and I just wondered how it compares with SpaceMacs in 2020 - anyone have any thou...
New
Rainer
My first contact with Erlang was about 2 years ago when I used RabbitMQ, which is written in Erlang, for my job. This made me curious and...
New
gagan7995
API 4 Path: /user/following/ Method: GET Description: Returns the list of all names of people whom the user follows Response [ { ...
New
AstonJ
Was just curious to see if any were around, found this one: I got 51/100: Not sure if it was meant to buy I am sure at times the b...
New
PragmaticBookshelf
Author Spotlight Jamis Buck @jamis This month, we have the pleasure of spotlighting author Jamis Buck, who has written Mazes for Prog...
New
New
PragmaticBookshelf
Develop, deploy, and debug BEAM applications using BEAMOps: a new paradigm that focuses on scalability, fault tolerance, and owning each ...
New
AstonJ
If you’re getting errors like this: psql: error: connection to server on socket “/tmp/.s.PGSQL.5432” failed: No such file or directory ...
New
PragmaticBookshelf
Get the comprehensive, insider information you need for Rails 8 with the new edition of this award-winning classic. Sam Ruby @rubys ...
New
AstonJ
Curious what kind of results others are getting, I think actually prefer the 7B model to the 32B model, not only is it faster but the qua...
New