Fl4m3Ph03n1x

Fl4m3Ph03n1x

Are there any text-to-speech ai tools available using elixir?

Background

As I often do, I read books to learn and improve myself. I also enjoy teaching and helping others when I can, so this is something I try to do regularly.

However, as time goes by (and so do the years) I have slowly come to realize it is becoming increasingly harder not only to maintain focus when reading/studying a new topic, I also forget things faster.

It is not an uncommon occurrence to re-read an “old” article or book, and to become surprised with things I had learned before. I have come to the inevitable conclusion this will only happen more frequently with time.

Although I do enjoy reading and learning, if I keep reading and the learning the same things over and over again, I will not evolve. Eventually I will gradually lose my ability to help others in topics of relevance, something I cannot live with.

So I decided to take a new approach. Re-learning old concepts is now something I have accepted, but I do get to choose how I re-learn them. I have decided to make a series of instructional videos (10-15) minutes long, freely available on youtube, free lectures for all to see.

Not only does this achieve my end goal of helping others more efficiently, it is also a quick and convenient way of re-learning things for myself. It also allows me to add my opinions, experiences and additional context, which is something I value immensely.

Problem

I have several scripts done, and I have power-point like presentations done as well for them. The issue now is that in order to make videos, I need to record sound. I have it from several sources, that my voice is … not ideal. Not only that, should my initiative succeed and my channel gather traction, I will be barred from certain jobs in the country I live because it is a common occurrence for managers and bosses to “avoid hiring known people”, for company political reasons.

In order to protect myself and to create videos that are somewhat acceptable I have decided to use text-to-speech audio generation.

However, all the tools I have found so far are in Python. Many of them are abandoned and even when they are not, there are always dependency issues with the likes of PyTorch and Tensor and so on.

I have not yet seen a Python project where things just work because there is always a dependency issue somewhere. Even the creators of some of those tools acknowledge this. To make things even more complex, some of these tools require specifically NVIDIA GPUs, which I don’t have.

Question

So I was wondering if there are out there any Elixir tts generation tools that one could use. Because Elixir uses BEAM, I assume requiring an NVIDIA GPU is not a necessity. Furthermore, I also assume I wouldn’t have to constantly fight with Python dependencies.

Are there any tools in Elixir that follow this requirements? Or is it too early in the Elixir ecosystem for such tools to exist yet?

Popular Backend topics Top

PragmaticBookshelf
Dig under the surface and explore Ruby’s most advanced feature: a collection of techniques and tricks known as metaprogramming. Pa...
New
pillaiindu
Cross posting from HashNode. A friend of mine is creating Uber-like app for a small company with 200 to 1000 cars. The app will operate ...
New
ErlangSolutions
Hi this week it’s Code Mesh V when London’s leading alternative tech conference goes virtual for the first time. Talks are set for both U...
New
First poster: bot
Node.js v12.20.0, v15.3.0 and v14.15.1 has been released. Link: Release 2020-11-24, Version 12.20.0 'Erbium' (LTS), @mylesborins · node...
New
PragmaticBookshelf
Build highly interactive applications without ever leaving Elixir, the way the experts do. Let LiveView take care of performance, scalabi...
New
AstonJ
More cool tech inspired by <span class="hashtag-icon-placeholder"></span>erlang :nerd_face: By combining the fault-tolerance and massiv...
New
New
PragmaticBookshelf
Leverage Elixir and the Nx ecosystem to build intelligent applications that solve real-world problems in computer vision, natural languag...
New
PragmaticBookshelf
Develop, deploy, and debug BEAM applications using BEAMOps: a new paradigm that focuses on scalability, fault tolerance, and owning each ...
New
AstonJ
If you’re getting errors like this: psql: error: connection to server on socket “/tmp/.s.PGSQL.5432” failed: No such file or directory ...
New

Other popular topics Top

Exadra37
I am thinking in building or buy a desktop computer for programing, both professionally and on my free time, and my choice of OS is Linux...
New
AstonJ
Thanks to @foxtrottwist’s and @Tomas’s posts in this thread: Poll: Which code editor do you use? I bought Onivim! :nerd_face: https://on...
New
AstonJ
If you are experiencing Rails console using 100% CPU on your dev machine, then updating your development and test gems might fix the issu...
New
New
PragmaticBookshelf
A Hero’s Journey with Chris Pine @chrispine Chris Pine, author of Learn to Program, Third Edition, discusses his journey to beco...
New
PragmaticBookshelf
Author Spotlight Rebecca Skinner @RebeccaSkinner Welcome to our latest author spotlight, where we sit down with Rebecca Skinner, auth...
New
AstonJ
Chris Seaton, the creator of TruffleRuby has died. It appears from suicide :cry: He left this note on Twitter on the weekend: And one...
New
DevotionGeo
I have always used antique keyboards like Cherry MX 1800 or Cherry MX 8100 and almost always have modified the switches in some way, like...
New
PragmaticBookshelf
A Ruby-Centric Chat with Noel Rappin @noelrappin Once you start noodling around with Ruby you quickly figure out, as Noel Rappi...
New
AstonJ
This is cool! DEEPSEEK-V3 ON M4 MAC: BLAZING FAST INFERENCE ON APPLE SILICON We just witnessed something incredible: the largest open-s...
New