Technology

72859 readers

2039 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

448

2 authors say OpenAI 'ingested' their books to train ChatGPT. Now they're suing, and a 'wave' of similar court cases may follow. (www.businessinsider.com)

submitted 2 years ago by L4s@lemmy.world to c/technology@lemmy.world

115 comments fedilink hide all child comments

Two authors sued OpenAI, accusing the company of violating copyright law. They say OpenAI used their work to train ChatGPT without their consent.

you are viewing a single comment's thread
view the rest of the comments

[–] ulu_mulu@lemmy.world 1 points 2 years ago (1 children)

I'm not against artificial intelligence, it could be a very valuable tool, but that's nowhere near a valid reason to break laws as OpenAI has done, that's why I too hope authors win.

[–] bioemerl@kbin.social 12 points 2 years ago (1 children)

What laws are you saying they've broken?

[–] ulu_mulu@lemmy.world 0 points 2 years ago* (last edited 2 years ago) (1 children)

[–] bioemerl@kbin.social 9 points 2 years ago (1 children)

Scraping the web is legal and training AI on data is also legal.

[–] ulu_mulu@lemmy.world 3 points 2 years ago* (last edited 2 years ago) (2 children)

Reusing the content you scraped, if copyright protected, is not.

Edit: unless you get the authorization of the original authors but OpenAI didn't even asked, that's why it's a crime.

[–] GnothiSeauton@lemmy.world 12 points 2 years ago (1 children)

Sounds like fair use to me.

[–] LegendofDragoon@kbin.social 1 points 2 years ago

That really will be the question at hand. Is the ai producing work that could be considered transformative, educational, or parody? The answer is of course yes, it is capable of doing all three of those things, but it's also capable of being coaxed into reproducing things exactly.

I don't know if current copyright laws are capable of dealing with the ai Renaissance.

[–] bioemerl@kbin.social 3 points 2 years ago

Yeah it is. The only protection in copyright is called derivative works, and an AI is not a derivative of a book, No more than your brain is after you've read one.

The only exception would be if you manage to overtrain and encode the contents of the book inside of the model file. That's not what happened here because I'll chat GPT output was a summary.

The only valid claim here is the fact that the books were not supposed to be on the public internet and it's likely that the way open AI the books in the first place was through some piracy website through scraping the web.

At that point you just have to hold them liable for that act of piracy, not the fact that the model release was an act of copyright violation.