Technology

38729 readers

537 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 3 years ago

MODERATORS

alyaza@beehaw.org

TheRtRevKaiser@beehaw.org

gyrfalcon@beehaw.org

rs5th@beehaw.org

coldredlight@beehaw.org

SemioticStandard@beehaw.org

TheRtRevKaiser@kbin.social

remington@beehaw.org

254

OpenAI says it’s “impossible” to create useful AI models without copyrighted material (arstechnica.com)

submitted 1 year ago by sculd@beehaw.org to c/technology@beehaw.org

246 comments fedilink hide all child comments

Apparently, stealing other people's work to create product for money is now "fair use" as according to OpenAI because they are "innovating" (stealing). Yeah. Move fast and break things, huh?

"Because copyright today covers virtually every sort of human expression—including blogposts, photographs, forum posts, scraps of software code, and government documents—it would be impossible to train today’s leading AI models without using copyrighted materials," wrote OpenAI in the House of Lords submission.

OpenAI claimed that the authors in that lawsuit "misconceive[d] the scope of copyright, failing to take into account the limitations and exceptions (including fair use) that properly leave room for innovations like the large language models now at the forefront of artificial intelligence."

you are viewing a single comment's thread
view the rest of the comments

[–] lemmyvore@feddit.nl 17 points 1 year ago* (last edited 1 year ago) (1 children)

This isn't about scraping the internet. The internet is full of crap and the LLMs will add even more crap to it. It will shortly become exponentially harder to find the meaningful content on the internet.

No, this is about dipping into high quality, curated content. OpenAI wants to be able to use all existing human artwork without paying anything for it, and then flood the world with cheap knockoff copies. It's that simple.

[–] towerful@programming.dev 10 points 1 year ago (1 children)

Shortly? It's happening already. I notice it when using Google and Duckduckgo. There are always a few hits that are AI written blog spam word soup

[–] lemmyvore@feddit.nl 8 points 1 year ago (2 children)

Unfortunately you haven't seen the full impact of LLMs yet. What you're seeing now is stuff that's already been going on for a decade. SEO content generators have been a thing for many years and used by everybody from small business owners to site chains pinching ad pennies.

When the LLM crap will kick in you won't see anything except their links. I wouldn't be surprised if we'll have to go back to 90s tech and use human-curated webrings and directories.

[–] emptiestplace@lemmy.ml 2 points 1 year ago

It's especially amusing when you consider that it's not even fully autonomous yet; we're actively doing this to ourselves.

[–] prex@aussie.zone 2 points 1 year ago

I wonder how many comments in this thread are ai generated. I wonder how many comments on Lemmy will be in 5 years time.