this post was submitted on 29 Jan 2025
1122 points (99.0% liked)

Not The Onion

13002 readers
2072 users here now

Welcome

We're not The Onion! Not affiliated with them in any way! Not operated by them in any way! All the news here is real!

The Rules

Posts must be:

  1. Links to news stories from...
  2. ...credible sources, with...
  3. ...their original headlines, that...
  4. ...would make people who see the headline think, “That has got to be a story from The Onion, America’s Finest News Source.”

Comments must abide by the server rules for Lemmy.world and generally abstain from trollish, bigoted, or otherwise disruptive behavior that makes this community less fun for everyone.

And that’s basically it!

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] cm0002@lemmy.world 1 points 2 days ago (1 children)

That's not how training works with LLMs at all

[–] nickwitha_k@lemmy.sdf.org -2 points 2 days ago (1 children)

I can’t recall a time when I downloaded an album, took samples of the entire thing, pretended I made it without making any actual alterations, and tried to sell access to it.

[–] cm0002@lemmy.world 4 points 2 days ago (1 children)

It does make alterations of it, it's completely shredded up as part of the training process and turned into numbers and statistics mushed with a bunch of other numbers and statistics.

It's like baking a cake, you mix in flour, butter, eggs, and bake it. Once mixed and baked you can't get the flour, butter and eggs back to their original form and the final product is completely different

If it wasn't you'd be able to pull full unaltered copies directly from the model files, but that hasn't been accomplished. The best that people have been able to do is get the AI to recreate something pretty close to the original with very careful and specific prompts. But it's still a recreation, based on what it "learned".

[–] nickwitha_k@lemmy.sdf.org 4 points 1 day ago

Yes, my edit was a bit hyperbolic. The point being that current AI/LLM companies have been, at best, encoding data that they do not have permission to use into their models.

It's like baking a cake, you mix in flour, butter, eggs, and bake it. Once mixed and baked you can't get the flour, butter and eggs back to their original form and the final product is completely different

It's more like baking a cake with flour, butter, and eggs that you snagged from other people's grocery baskets after they paid for them. Then, started selling the cakes made from said ingredients.

Ideally, none of that would matter because knowledge and data want to be free and everyone would benefit. However, we don't live in such a world. Instead, the technology is being used almost exclusively to extract wealth from people and make the average human being's life worse, both in the short-term by reducing their ability to support themselves and in the long-term by drastically increasing consumption of fossil fuels and potable water, putting more pressure on the biosphere.