this post was submitted on 28 Jan 2025
876 points (94.5% liked)
memes
11300 readers
3812 users here now
Community rules
1. Be civil
No trolling, bigotry or other insulting / annoying behaviour
2. No politics
This is non-politics community. For political memes please go to !politicalmemes@lemmy.world
3. No recent reposts
Check for reposts when posting a meme, you can only repost after 1 month
4. No bots
No bots without the express approval of the mods or the admins
5. No Spam/Ads
No advertisements or spam. This is an instance rule and the only way to live.
A collection of some classic Lemmy memes for your enjoyment
Sister communities
- !tenforward@lemmy.world : Star Trek memes, chat and shitposts
- !lemmyshitpost@lemmy.world : Lemmy Shitposts, anything and everything goes.
- !linuxmemes@lemmy.world : Linux themed memes
- !comicstrips@lemmy.world : for those who love comic stories.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
The source OP is referring to is the training data what they used to compute those weights. Meaning, petabytes of text. Without that we don't know which content theynused for training the model.
The running/training engines might be open source, the pretrained model isn't and claiming otherwise is wrong.
Nothing wrong with it being this way, most commercial models operate the same way obviously. Just don't claim that themselves is open source because a big part of it is that people can reproduce your training to verify that there's no fowl play in the input data. We literally can't. That's it.