this post was submitted on 12 Jun 2025
287 points (97.7% liked)
Technology
71441 readers
2585 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
You did not read your source. Some quotes you apparently missed:
Please read your source before posting it and claiming it says something it doesn't actually say.
Now why does Doctrow distinguish between good scraping and bad scraping, and even between good LLM training and bad LLM training in his post?
Because the good applications are actually covered by fair use while the bad parts aren't.
Because fair use isn't actually about what is done (scraping, LLM training, ...) but about who does it (researchers, non-profit vs. companies, for-profit) and for what purpose (research, critique, teaching, news reporting vs. making a profit by putting original copyright owners out of work).
That's the whole point of fair use. It's even in the name. It's about the use, and the use needs to be fair. It's not called "Allowed techniques, don't care if it's fair".