this post was submitted on 13 May 2025
258 points (100.0% liked)

Technology

69999 readers
4475 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] [email protected] 26 points 3 hours ago (3 children)

trained on stolen books? then I guess I can download these from anywhere I may find for free as well, right?

[–] [email protected] 1 points 6 minutes ago

Well, yeah, you can. Whoever told you that you can't, don't believe them, they are probably being payed to say it. You could also pay for the book to support the author but most likely your money will not go to the author so don't bother.

[–] [email protected] 2 points 2 hours ago

I like your way of thinking

[–] [email protected] 10 points 3 hours ago

It's Amazon, what did you expect? Enshittification and monopoly abuse, no surprise.

[–] [email protected] 1 points 3 hours ago

So you can take the square root of that:

5x+7integral from 5z to 9x derivative of deltaT minus minus multiply times 3. Figure 1

Figure 1 shows a typical lizard living in a square root.

[–] [email protected] 57 points 10 hours ago (4 children)

I can get that for free. There are apps that will read an ebook to you already. The whole point of paying the premium on audible is the superior reading/acting. Not put up with mispronounced words, weird cadence and an inability to handle acronyms

[–] [email protected] 1 points 2 hours ago

I've tried one that works surprisingly well. Each sentence had great pacing, cadence, and correct enunciation- even had tone right when someone was shouting or angry or sad.

I wouldn't really recommend it, though. While I couldn't pick any single thing out that was wrong, overall it just didn't quite flow. It's like watching someone try to act that is technically doing everything right, but it just isn't good. It basically didn't understand the greater context of the story and was saying lines.

It was uncanny valley, but exclusively with voice.

[–] [email protected] 1 points 3 hours ago* (last edited 3 hours ago) (2 children)

Is there an offline tool that generates realistic audio for epubs as Mp3 ? Something like the free Ai tool, Vibe which is for transcription. Is there something similar for TTS, runs locally without complicated setup ( most are complicated using python and etc just for installation)

[–] [email protected] 1 points 2 hours ago* (last edited 2 hours ago) (1 children)

I've loaded epubs into the app ReadEra, which lets you read it like any other novel app or will, in real time, read it to you. It's not the most natural of speech, but was good enough for my commute when I was in the midst of a compelling book.

[–] [email protected] 1 points 1 hour ago (1 children)

Download TTS Server, and change the engine in Readera to use it. Use the Microsoft Azure settings in TTS, much more realistic. Little slow though is my only complaint as it sends/receives a paragraph at time, resulting in a pause now and again.

[–] [email protected] 1 points 31 minutes ago

How do I do that? Have both readera and tts server on a Samsung Galaxy

[–] [email protected] 1 points 2 hours ago

Great question! I need to come back to this thread to see if something is suggested.

[–] [email protected] 1 points 3 hours ago (1 children)

Looking for iOS recommendations, preferably without a subscription that can read epub/pdf

[–] [email protected] 2 points 2 hours ago

I'm an android user, so not sure if it's on iOS but I've used ReadEra

[–] [email protected] 1 points 3 hours ago

I thought people mainly paid for the large library

[–] [email protected] 19 points 12 hours ago* (last edited 12 hours ago) (3 children)

This is clearly the future despite the outrage here.

There are at least 389 living languages with over 1M speakers. That alone means it's impossible to reach some people and they get left out. Most of these languages dont even have enough professional voice actors to cover the bandwidth.

There are thousands of books released every year. That's impossible to cover even in English alone.

Its an objective net good to have more accessible audio books and the privileged people who do care about this stuff can very much afford to vote with their wallets for non-ai voices.

In fact since AI moat is so minimal this will very quickly be adapted by open source solution providing audio book access to millions if not billions of people to whom this was not an option. Its amazing.

[–] [email protected] 2 points 3 hours ago

dont even have enough professional voice actors to cover the bandwidth

I'm pretty sure they'd be a lot more people ready to do that job if there was a good remuneration. Heck that sounds a lot more fun that a LOT of jobs out there!

[–] [email protected] 12 points 7 hours ago (2 children)

Most of these languages dont even have enough professional voice actors to cover the bandwidth.

And you think anyone is training AI voice models for those languages? Have you even seen how long it takes even large companies like Google to support the languages with hundreds of millions of speakers?

[–] [email protected] 2 points 3 hours ago* (last edited 3 hours ago) (1 children)

That's the benefit of using AI and machine learning - once you have enough source material, you can throw it all in and it'll eventually spit out a model.
Which is exactly what Meta did with their Massively Multilingual Speech project which supports text-to-speech and speech-to-text for 1107 different languages.

Is it actually any good in 99% of them, I don't have a clue, but it exists.

[–] [email protected] 1 points 3 hours ago

Seems more like a proof of concept project for that paper than something they are pursuing seriously judging by the GitHub location in some example folder that hasn't seen any significant updates in over a year. If it is so great I would assume they would pursue it more actively and replace existing models with it two years later.

[–] [email protected] 3 points 6 hours ago (2 children)

It becomes easier and cheaper every day. Today's open source LLMs are better than last year's best model.

[–] [email protected] 4 points 3 hours ago

Is it? I just tried again yesterday for a simple script since coding is the one thing apparently AI will replace people like me and it could not put together a working JavaScript script.

I have yet to see tangible results not announced by the people with sunken cost exploding their balls.

[–] [email protected] 5 points 5 hours ago

You're fundamentally misunderstanding the comment you replied to, they are not saying that voice AI are bad, they are saying there is not enough training data to improve the AI for these languages. How will it improve without good training data?

[–] [email protected] 1 points 7 hours ago

but for a service like audible.

[–] [email protected] 9 points 11 hours ago

Is voice AI trained on stolen data? I was under the impression that was LLMs.

[–] [email protected] 16 points 16 hours ago (2 children)
[–] [email protected] 15 points 11 hours ago

YouTube is crawling with it. It's unlistenable shit. The prosody is badly implemented, pronunciation is infuriatingly bad, and a lot of the text that these TTS are reading appears to be AI-generated. Otherwise, already dire standards of literacy are getting worse at an accelerating rate.

[–] [email protected] 26 points 15 hours ago

And it's shit

[–] [email protected] 116 points 22 hours ago (31 children)

No publisher is going to pay a professional to narrate their audiobooks when they can have AI do a shitty job for much less.

A shitty narrator can get me to hate a book I like. A great narrator can bring the characters to life, enhance the experience, and turn me from a listener to a fan. I've searched for books by narrators like Nick Podehl and Jeff Hayes and bought audiobooks I wouldn't have otherwise.

[–] [email protected] 1 points 4 hours ago (1 children)

I'm not sure why AI would automatically mean it's doing a shitty job.

[–] [email protected] 4 points 3 hours ago

Because... the tool has no understanding of anything? It reads written words, yes, but no intention, no cultural context, no intonation. Unless everything is spelled out like a script, then it will not sound great, would it?

[–] [email protected] 3 points 9 hours ago

Nick Podehl is such an amazing narrator. The voices and performance are amazing.

I've been slowly getting through the Kel Kade books and the narration just makes it for me

load more comments (29 replies)
[–] Big_Boss_77 64 points 21 hours ago (6 children)

This is dumb as hell... if I wanted AI to read a book poorly to me, I'd just use screen reading accessibility features.

load more comments (6 replies)
[–] [email protected] 2 points 11 hours ago

Beautiful, it works. Why not.

[–] [email protected] 28 points 20 hours ago

Meanwhile I unveil a plan to continue not giving a goddamn cent to J Bozo. Ever.

load more comments
view more: next ›