One of the currently rather strong signals for AI-voiceovers on #YouTube videos is unusual pacing and pauses in narrations.
Post
One of the currently rather strong signals for AI-voiceovers on #YouTube videos is unusual pacing and pauses in narrations.
@lauren - A Google Gemini produced conversation I heard had one 'commentator' respond to another with the words, "polite chuckle" before continuing.
@lauren also, mispronounced words and names accompanying the dead delivery and everything else mentioned here.
@ianto_jones You can't necessarily depend on dead delivery. Stress intonations are getting much more realistic. It doesn't take all that much. Hell, I could do it with the Votrax speech synth I used at UCLA for Touch-Tone UNIX about 1000 years ago.
@lauren I listened to a couple when looking for reviews on air fryers and the pronunciation of Fahrenheit was incredibly wrong but consistent.
@lauren And completely dead delivery.
I was watching a video by Adam Rieman the other day. Australian bloke talking in his helmet, grunting heavy breathing while riding way too big a motorcycle way too hardcore offroad.
That was the original. The AI translation, 100% dead delivery, as if it was sitting in a comfy chair reading out the phone book.
Yeah nah, that does not work.
@Tubemeister I'm not really critical that way of auto-translations. That's a whole 'nuther situation and they can be extremely handy even in their most basic forms. I'm really talking about the primary audio track in the original language as used by the video creator.
@lauren As a tool, those translations can be useful even if they're imperfect.
As the default audiotrack for where I am it was rather hopeless.
But hard agree, there's a ton of videos out there that are just AI narration, and they have that weird halting yet flat, dead narration.
Always takes me a minute to catch on somehow, and always feel sort of betrayed.
@Tubemeister I find the auto-translations (captioned and/or voice) to be useful with many technical videos that would otherwise be useless to me. I don't care if these read or sound natural.