16 comments

  • xnx 16 hours ago
    What model is this using? I've had good results with e2-ft-tts running locally via Pinokio. You can also run it online for free https://huggingface.co/spaces/mrfakename/E2-F5-TTS
    • porjo 9 hours ago
      Thanks, I got a better result with this than Anyvoice.
  • esperent 14 hours ago
    We've been advertising to get someone to take over the lease on a commercial building. Surprisingly, we've had several of what seem like very obvious scam attempts - people stringing us along, not trying to bargain (we are in a haggling country, people always try to bargain), asking us to wait unreasonable amounts of time, and finally when pressed breaking down into logical inconsistencies. So, not even good scam attempts.

    I was wondering, what's the point? I mean, it's a building. You pay money, you sign the lease (in person), you get the use of the building. No money, no building. Where's the scam opportunity?

    The only thing I can think of is that they're trying to get enough data and personal info to clone our voices and use that to try and gain access to bank accounts or to scam our relatives. Even if I'm wrong in this case, this seems like a major new vulnerability in society. I mean, if someone who sounded (and with video AI, perhaps even looked) exactly like me called up my mother and pretended I'd been violently robbed or had an accident, she'd transfer money in a heartbeat.

    I'm considering that I should set up some kind of code system with my family for this. As in, if I ever end up in a situation where I need help, I'll use a particular code phrase. If I don't know it, assume it's an AI clone.

    • 0x20cowboy 14 hours ago
      > I'm considering that I should set up some kind of code system with my family for this. As in, if I ever end up in a situation where I need help, I'll use a particular code phrase.

      You absolutely should. And include something for videos (like FaceTime calls). Especially if these members of your family are boomers+

      • esperent 11 hours ago
        Have you done it yourself? What did you use, a phrase or code, something like that?
  • delgaudm 12 hours ago
    Hey there /u/blacktechnology, could you email me a few seconds of your voice so I can upload it to this site and see how the cloning goes? I'd love to see what I could do with a copy of your voice. Kthxbye.
  • ge96 17 hours ago
    3 seconds? That's crazy

    "Huuhhhhhhhhhhh"

    I wonder what their "fox jump" sentence is

    • mk_stjames 15 hours ago
      A "Panphonic Poem" is what may do well here. As in...

        The pleasure of Shawn’s company
        Is what I most enjoy.
        He put a tack on Ms. Yancey’s chair
        When she called him a horrible boy.
        At the end of the month he was flinging two kittens
        Across the width of the room.
        I count on his schemes to show me a way now
        Of getting away from my gloom.
      
      As discussed here:

      https://literalminded.wordpress.com/2006/05/05/a-panphonic-p...

      And recited very famously, in part and slightly modified, here:

      https://www.youtube.com/watch?v=CgX4uJSj00Y

    • sailfast 16 hours ago
      Default for me was: “What a beautiful day it is today, with bright sunshine and gentle breeze. Let's talk about the future of artificial intelligence.”

      That said, I'm not going to be submitting a sample because [reasons]

  • croemer 15 hours ago
    Getting error: Failed to generate voice
    • HeatrayEnjoyer 15 hours ago
      I am hitting this error as well. I was additionally unable to create an account. Seems beta?
      • blacktechnology 14 hours ago
        fixed
        • windsignaling 6 hours ago
          Still getting the same error I was (and the same error mentioned by parent commenter) when this was first posted.
        • croemer 14 hours ago
          No, still doesn't work. Progress bar now stuck at 95% for dozens of seconds after initial progress to 95% takes only 5 seconds. Broken.
  • 0_____0 8 hours ago
    I'm surprised you were able to repost this so quickly.

    To reiterate, among my friends, if you use a tool like this to clone my voice for any reason, you are dead to me.

  • bugglebeetle 17 hours ago
    Sure, just let me submit my voice for cloning to a closed sourced, online service of unknown provenance. What could ever go wrong?
    • dvh 16 hours ago
      That's why you submit politician's voice instead
      • dunham 14 hours ago
        It would be fun to have a clone of Majel Barrett's voice for something like Siri or Alexa.
      • HanClinto 16 hours ago
        Yeah, but they have you read a specific text, so not as much of an option if you use the primary demo.

        Seems like a heck of a nice way to gather a training set! :)

        • unsnap_biceps 16 hours ago
          The "upload audio" feature doesn't require any specific text.
        • lubujackson 15 hours ago
          Cue reference to "Sneakers"...
  • croemer 14 hours ago
    The title is editorialized, it should be something like: "Anyvoice - AI Voice Cloning"
  • superkuh 16 hours ago
    I submitted an 8 second clip of speech and the resulting synthesized speech did not sound like the same voice. Too bad.
    • infogulch 16 hours ago
      I hope you have a nice voice, I'll be listening to it try to sell me an extended car warranty for the next 3 months.
  • croemer 14 hours ago
    This is almost definitely against GDPR, there's no indication whatsoever of which legal entity is holding the data and how long it is stored on which servers where.
  • krainboltgreene 16 hours ago
    Getting a 500 from the HTTP API and also there's an `debugger` in the javascript.
  • clueless 15 hours ago
    anybody try this and have a good result?
  • gamblor956 15 hours ago
    This was a great way for them to collect a lot of free voice data to train their model.
    • inerte 14 hours ago
      Every time there's a voice recognition post here someone comments about acquiring data. Why is this method better than having access to all of the video and podcasts sites on the internet?
      • rahimnathwani 14 hours ago
        You can get people to utter the same sentence.
        • gamblor956 13 hours ago
          Righto. Everyone is saying the same thing so it's the cleanest data set you can get.
  • xqcgrek2 15 hours ago
    Has anyone tried multiple iterations? That is, upload a real voice, get its synthesized version, upload synthesized version 1 to get synthesized version 2, rinse and repeat...
    • abeppu 15 hours ago
      Perhaps Alvin Lucifer reading his "I am sitting in a room" text would be ideal.
  • mxuribe 16 hours ago
    Immediately, i thought that cybersecurity is now ruined for the distant future. Imagine if you will, a starship captain ready with a plot to overcome the evil plaguing their crew...and all they need to do is over-ride the starship computer's safety controls with the captain';s own voice override authorization...but, alas, early in 2025 a tech company developed the means by which said evil entity could re-override the captain's voice auth....and block the captain's plan...thereby dooming the entire crew of the starship.

    This is why we can not have nice things; not now nor in the far off future! All of our uniqueness will be more easily duplicated. Thankfully, i won';t upload any of my voice recordings, and i will continue to walk around in my faraday cage suit. /s

    • montag 15 hours ago
      Yep, this is a real Star Trek TNG episode, S4 E3 “Brothers”
      • mxuribe 12 hours ago
        Lol, yep, good one!