Jokes on you for crawling mostly synthetic text?
Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
I hope they lose this case badly.
For the concerns I have about AI and stealing others work, I want to see Reddit burn for pretending that they are all about community and connection, while actively harming their users’ experience on the platform and attempting to profit off their content.
Yeah, something about a company making billions of dollars off completely user generated content and moderation just runs me the wrong way. As much as I hate Facebook, they at least pay people to do moderation there, and regularly update their site (as shitty as it is). I dont use either anymore, and I hope they die in a pit of flames owing billions to their shareholders.
As much as I hate Facebook, they at least pay people to do moderation there, and regularly update their site
Facebook pays content creators too (https://creators.facebook.com/earn-money ), including for things other than videos (like photo/image posts). Platforms like YouTube do too, but as far as I know, Reddit doesn't.
No matter who wins, everyone loses.
I can't believe you beat me to this. Well done.
You've fallen for one of the classic blunders!
In the filing, Reddit calls Anthropic a “late-blooming artificial intelligence (‘AI’) company that bills itself as the white knight of the AI industry,” alleging that “it is anything but.”
“This case is about the two faces of Anthropic: the public face that attempts to ingratiate itself into the consumer’s consciousness with claims of righteousness and respect for boundaries and the law, and the private face that ignores any rules that interfere with its attempts to further line its pockets.”
I mean, Reddit's objection is that they want to sell the same data to Google to do the same training.
I dunno, it just reads like a reddit comment to me. 🤣
They actually quite that in a real legal filing?
Jesus.
Did they ask /r/pettyrevenge to write that?
So if reddit wins, that means the content is theirs. So if the content is theirs, they are liable for any content that is illegal. Is that true?
yes to both regardless of this lawsuit
The wiggle room for large businesses is that they remove content that violates local laws when notified of it
This is like one of those cases where I'm kind of hoping they both lose somehow. Neither party are right in this case, Reddit is trying to claim copyright over content they have no rights to, and anthropic shouldn't be violating copyright without a licence.
But apparently you are actually allowed to violate copyright without a licence if you're an AI company because apparently llms are the future? So I guess Reddit are going to lose, which will be funny.
I am squarely on the reddit should lose this side.
Anthropic may be breaking copyright, but not Reddit's copyright. Sure maybe Anthropic should be sued, but not by Reddit.
Actually this case could be a good thing. The whole question of who owns user generated content needs hashing out, because no one seems to actually know.
Obviously the logical answer would be that the people who created it own the content, but that's never been officially decided.
Because that's the only common sense conclusion to make, but that doesnt make rich fucks more money
Yeah maybe we shouldn't have the case in the US where money rules everything.
EU get on it.
Judge finds that anthropic has to pay restitution to the reddit users. Affirms that posts belong to users.
Well, I can dream.
You mean Reddit, the company that would be very happy if Anthropic did the exact same thing, but paid Reddit first?
“Reddit’s humanity is uniquely valuable in a world flattened by AI,“ Lee said. ”Now more than ever, people are seeking authentic human-to-human conversation. Reddit hosts nearly 20 years of rich, human discussion on virtually every topic imaginable. These conversations don’t happen anywhere else—and they’re central to training language models like Claude.”
LMAO, reddit's days of genuine conversations between humans is long gone.
Only 100,000 times? Shit, do I need to be worried about getting sued too?
All porn subreddits are exempted
Reddit is just mad that Anthropic didn't pay them
... Yes?
Yes that is how capitalism works
The Users posted for free. They didn't get paid. They should be publicly available for scraping
Long story short: They are not combatting bots on their platform. They sold training data to google and these guys aren't paying, that's why they're suing.
Spez can forever get fucked
Suck shit reddit.
while half of reddit is infested with propaganda bots from russia.
Not just Russia.
Israel, US, China, North Korea, India and other countries... Nuclear Lobby, Fossil Fuel Lobby and countless other industry lobbyists... Private companies advertising their products...
I hope they both choke on their own bots.
100.000 accesses isn't that much, right?
100,000 requests in 11 months? That's about 12.5 requests an hour
That's hardly anything. Facebook has a bot accessing my server's robots.txt multiple times a second. (My robots.txt used to say "Facebook bot go away" but now I just respond 404 to any requests from the Facebook bot. Pretend I said that all technical and stuff, it's 2 am and I ought to go to sleep.)
Obviously Reddit isn't averse to bots scraping the site for data, just ones that aren't paying them. I'm regretting not going through and systematically deleting all my posts and comments before deleting my account, but I thought that happened automatically.
I don't regret not deleting all my comments. For me, It's a mishmash of helpful/comedic/observational comments that I don't care that they have sold off for use as training data.
But, I just got shadowbanned, because of my VPN or something, so they aren't getting any more!
Edit: I guess great minds think alike.