I know there’s other plausible reasons, but thought I’d use this juicy title.

What does everyone think? As someone who works outside of tech I’m curious to hear the collective thoughts of the tech minds on Lemmy.

  • alternative_factor@kbin.social
    link
    fedilink
    arrow-up
    42
    arrow-down
    1
    ·
    8 months ago

    I think he was probably lying about where he got all the data used to train the model from, I’m guessing training a model on tons of copyrighted material and stolen user data won’t be legal in the near future.

      • alternative_factor@kbin.social
        link
        fedilink
        arrow-up
        12
        ·
        8 months ago

        Yeah I’ve done a tiny bit of AI stuff for what I do (biology) and I think it’s very sus they can build such a strong model out of data which costs lots of money. The reason the algos in my field of biology are so strong is because the NCBI has the genomes of everything that’s be sequenced FOR FREE, because obviously you don’t want people patenting genomes and it should all be free for science, etc.

        Which begs the question how the a start up that started out as a non-profit get that much user data and keep costs low? I know you can buy user data and I’m not sure how much it is to buy a bunch of google docs from a data broker, but if you buy from hackers who just data breached or used some illegal crawler you can probably cut that to prices a nonprofit could afford.

      • alternative_factor@kbin.social
        link
        fedilink
        arrow-up
        1
        ·
        8 months ago

        Very true but they don’t always win, and besides, there are other lobbyists who are out there batting for Disney. If there is one hint of Micky Mouse™ in their data set they might as well just dissolve the company now.