• 4 Posts
  • 205 Comments
Joined 1 year ago
cake
Cake day: June 7th, 2023

help-circle









  • Scraping is legal

    Have you been following any of the court battles involving LLMs lately?

    The New York Times suing OpenAI. Getty Images suing Stability AI. Sarah Silverman and George R.R. Martin suing OpenAI.

    All of those cases involve data that has been scraped. (In the latter two cases, the memoir/novels were scraped from excerpts and archives found online).

    It’s too late to say with complete certainty that it’s all legal (the appeal processes haven’t all been finished yet), but at this point it looks like using scraped and copyrighted data in training LLMs is legal. Even if it’s going to turn out not to be legal, it’s very clear that nobody’s shying away from doing it, because we have the courts showing as a statement of fact that it’s been happening for years.

    Everything you’ve written is just fantasy. We have a lot of reality which contradicts it. Every LLM company has been primarily relying upon scraping data (which we know to completely legal) and has been incorporated copyrighted and scraped data in its data sets (which is still legally a grey area, but is happening anyway).






  • In a certain light, you could argue that Linus doesn’t really have any control at all. He doesn’t write any code for Linux (hasn’t in many years), doesn’t do any real planning or commanding or managing. “All” he does is coordinate merges and maintain his own personal git branch. (And he’s not alone in that: a lot of people maintain their own Linux branches). He has literally no formal authority at all in Linux development.

    It just so happens that, by a very large margin, his own personal git branch is the most popular and trusted in the world. People trust his judgment for what goes in and doesn’t go in.

    It’s not like Linux development is stopped because Linus goes offline (or goes on vacation or whatever). People keep writing code and discussing and testing and whatnot. It’s just that without Linus’s discerning eye casting judgment on their work, it doesn’t enter the mainstream.

    Nothing will really get slowed down. Whether something officially gets labelled by Linus as “6.8” or “6.whatever” doesn’t really matter in the big picture of Linux development.



  • I’m going to reframe the question as “Are computers good for someone tech illiterate?”

    I think the answer is “yes, if you have someone that can help you”.

    The problem with proprietary systems like Windows or OS X is that that “someone” is a large corporation. And, in fairness, they generally do a good job of looking after tech illiterate people. They ensure that their users don’t have to worry about how to do updates, or figure out what browser they should be using, or what have you.

    But (and it’s a big but) they don’t actually care about you. Their interest making sure you have a good experience ends at a dollar sign. If they think what’s best for you is to show you ads and spy on you, that’s what they’ll do. And you’re in a tricky position with them because you kind of have to trust them.

    So with Linux you don’t have a corporation looking after you. You do have a community (like this one) to some degree, but there’s a limit to how much we can help you. We’re not there on your computer with you (thankfully, for your privacy’s sake), so to a large degree, you are kind of on your own.

    But Linux actually works very well if you have a trusted friend/partner/child/sibling/whoever who can help you out now and then. If you’ve got someone to help you out with it, Linux can actually work very very well for tech illiterate people. The general experience of browsing around, editing documents, editing photos, etc., works very much the same way as it does on Windows or OS X. You will probably be able to do all that without help.

    But you might not know which software is best for editing photos. Or you might need help with a specific task (like getting a printer set up) and having someone to fall back on will give you much better experience.