• @[email protected]
    link
    fedilink
    36
    edit-2
    1 year ago

    Google: “I’m looking to make an AI that’s incredibly opinionated, confidently incorrect, and prone to circlejerk behaviors.”

    Spez: “I mean OpenAI pretty much did that already, but if you want to pay me to recreate that then I got you.”

    • @[email protected]
      link
      fedilink
      61 year ago

      One thing that doesn’t seem to get brought up is the AI porn angle. Gonewild is pretty big on reddit still. A lot of OnlyFans creators, and general perverts (respect). Reddit wants to commodify this content, so those people are selling the images of their bodies so AI can make porn of random strangers. Kinda fucked when you think about it.

    • @[email protected]
      link
      fedilink
      English
      61 year ago

      How quickly you forget that half of it is just “I also choose this guy’s wife” and “the narwhal bacon’s at midnight”

    • @[email protected]
      link
      fedilink
      English
      41 year ago

      I’m personally curious whether Reddit actually has any ability to protect that database. I don’t remember Reddit TOS, but usually those things give them license to use and copy the data, maybe even to sell it, but not actually the copyright on it. So if someone made a Reddit scraper and copied the comments, wouldn’t only the actual commenter be able to sue?

      $60M may be reflecting that, in that it’s more a convenience fee to shield Google against individual Redditors going after them than something that Reddit itself could actually sue over.

    • @[email protected]
      link
      fedilink
      121 year ago

      I wonder if Google’s unlimited legal budget plays a role. Not a lawyer, so probably way off here…

      But, for example, reddit’s success in part depends on Google ingesting their data — reddit shows up in Google searches all the time, which can only happen if Google uses reddit’s content. So reddit telling Google “you can’t use our content” doesn’t work, and they need to say something like, “you can use our content for search results but you can’t consume it as training data.”

      This is a pretty straightforward statement/request/demand, but one could imagine Google lawyers maliciously complying and throwing their hands up dramatically, claiming “well we use some amount of AI in our search results, so if we can’t use your content for AI training then we can’t risk using it for search results.” Which would, I imagine, really, really hurt reddit (no Google results would be catastrophic I suspect).

      So, perhaps the “low” 60M figure is just Google using their leverage.

      Or not. As a random person on the Internet, I can say I’m probably not contributing anything meaningful here…

    • bobburger
      link
      fedilink
      691 year ago

      To be fair it’s a pretty terrible dataset. The AI is just going to say “this” to every question you ask

    • Ebby
      link
      fedilink
      171 year ago

      Perhaps, but not worth buying if you can’t make profit or keep it from your competition.

      60M is for over almost 20 years of data, but once it’s ingested, google will only want new content. Next year, it’ll be more like 3M if the dataset isn’t poisoned by bots or the AI fad hasn’t collapsed. Reddit will struggle with finances again and users will suffer. At least that’s my prediction.

      • AwkwardLookMonkeyPuppet
        link
        fedilink
        English
        31 year ago

        the AI fad

        LOL. Do you realize that makes you sound like Boomers talking about the internet in the late 90’s and early 00’s?

        • Ebby
          link
          fedilink
          71 year ago

          Haha! Wow I guess so. I’ll keep some shelf space available in the geezer museum next to 3D TV’s, deep fakes, fidget spinners, and my pogs. :D

        • @[email protected]
          link
          fedilink
          71 year ago

          It currently looks very much like a bubble. After the dot com bubble, the internet didn’t go away, but most companies died off and all the stupid monetisation went bankrupt.

          We may be seeing something similar

      • @[email protected]
        link
        fedilink
        71 year ago

        Spez has already grifted his money out of the initial stock pump so it literally doesn’t matter. Reddit could shut down tomorrow and he’d be happy as a clam.

        • Ebby
          link
          fedilink
          2
          edit-2
          1 year ago

          Yeah, what a load. Though now they can boot his arse and save.

          Edited to remove number.

          • AwkwardLookMonkeyPuppet
            link
            fedilink
            English
            11 year ago

            I doubt he’s getting 120M per year. I think that big compensation package was a 1 time deal. That’s more than Satya Nadella makes.

            • Ebby
              link
              fedilink
              1
              edit-2
              1 year ago

              You’re right. Total compensation was $193M for 2023 but that was a lot of stock too. It may have been one time like you said now that they went public. Hopefully enough to retire haha.

    • @[email protected]
      link
      fedilink
      31 year ago

      Considering it’s all full of Nazis and bots, and if you get to filter all of them out you’re left with reposts and low quality memes followed by comments that represent the hostile side of each of us… I’d say anything over $5 is a good deal for spez.

      Now, I hope Google uses this data exclusively for detecting inappropriate answers. Can you imagine it giving answers based on the endless threads i of " I’m not your mate, bro; I’m not your bro, dude…".

  • Endorkend
    link
    fedilink
    181 year ago

    And this is how Skynet was born.

    That one Microsoft Twitter bot turned into a full blown Nazi in just one day.

    I can’t even imagine how fucked up and depraved one trained on Reddit data will get.

  • @[email protected]
    link
    fedilink
    171 year ago

    Can someone point me the way of that bot or whatever that changes all your old Reddit posts before deleting them? I thought I had it saved somewhere but I can’t find it now and have no idea what it’s called.

    • @[email protected]
      link
      fedilink
      151 year ago

      They keep copies of posts because people who mass edited their posts saw them reverted or have people reply still as if they were not edited.

      • @[email protected]
        link
        fedilink
        21 year ago

        I had read that with some people, is was a delay from their server instance between read/write and in the end the changes did end up sticking, but I don’t know if that was true. A lot of people were mass editing at the same time, and since editing isn’t something that happens super frequently, it might have less priority in the stack and caused backups.

        • @[email protected]
          link
          fedilink
          41 year ago

          They change it on their website but the data that’s collected and sold isn’t changed.

          It still devalues their google search though but also makes it harder to scrap data for free and ups the value of what they are selling.

          • @[email protected]
            link
            fedilink
            11 year ago

            For sure. They definitely have change records for everything. It would be borderline negligent if they didn’t.

  • @[email protected]
    link
    fedilink
    3
    edit-2
    1 year ago

    It’s self hosted plus open source, or barbarism.

    You think… “Y sucks, I’m switching to Z”.

    But Z just is biding its time till it can be Y. The most principled owners in existence have a finite life span. It is only a matter of time till the vultures put their claws into anything.

    Gabe is gonna die, steam is gonna be fucked. We should have all started with Good Old Games. But we didn’t. But it’s not to late to switch. To be free with your software. Don’t be like me, wishing you had been smarter before.

    It’s okay, better now then never.

    • yeehaw
      link
      fedilink
      31 year ago

      In the case of steam and gog, steam was around a lot longer before gog. I only purchase what I must on steam, and if it’s available on both I always purchase from gog. They more align with my values. But valve from my point of view is such a prosumer business I don’t mind. But you’re right. When Gabe goes, I’m willing to bet some cock sucker ceo from oracle or ibm will swoop in and fuck the whole thing up.

  • @[email protected]
    link
    fedilink
    61 year ago

    is there a way to mass delete my old content? the service i used in the past doesn’t seem to have worked. i recently got a reply from a 6 year-old post from someone saying they got there on google.

    • yeehaw
      link
      fedilink
      21 year ago

      Yes but they just reverse it. That ship has sailed.

    • @[email protected]
      link
      fedilink
      31 year ago

      My understanding is that the mass delete you did probably had worked, but reddit rolled back your deletions. I heard it happened to a lot of mass deleters after the lemmy exodus.

      • @[email protected]
        link
        fedilink
        51 year ago

        Can we still mass edit our previous comments with random stuff, a little bit at a time to avoid detection? Poison the data, yada yada.

      • @[email protected]
        link
        fedilink
        41 year ago

        Is worse nothing gotten really deleted admins admitted in like 2018 that they can see deleted posts. I think even some mods can. The access they give to Google is to the backend they can see EVERYTHING.

      • @[email protected]
        link
        fedilink
        11 year ago

        I think I’m gonna be sick. so all the stuff I wrote, it’s just THERE? what the fuck do i do? what about private info that I dont want on a public fucking search engine?? I’ve had that account since I was a kid, there’s a lot of shit I regret posting, what the FUCK!