this post was submitted on 30 Sep 2023
553 points (97.8% liked)

World News

39000 readers
2278 users here now

A community for discussing events around the World

Rules:

Similarly, if you see posts along these lines, do not engage. Report them, block them, and live a happier life than they do. We see too many slapfights that boil down to "Mom! He's bugging me!" and "I'm not touching you!" Going forward, slapfights will result in removed comments and temp bans to cool off.

We ask that the users report any comment or post that violate the rules, to use critical thinking when reading, posting or commenting. Users that post off-topic spam, advocate violence, have multiple comments or posts removed, weaponize reports or violate the code of conduct will be banned.

All posts and comments will be reviewed on a case-by-case basis. This means that some content that violates the rules may be allowed, while other content that does not violate the rules may be removed. The moderators retain the right to remove any content and ban users.


Lemmy World Partners

News !news@lemmy.world

Politics !politics@lemmy.world

World Politics !globalpolitics@lemmy.world


Recommendations

For Firefox users, there is media bias / propaganda / fact check plugin.

https://addons.mozilla.org/en-US/firefox/addon/media-bias-fact-check/

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] 0ddysseus@lemmy.world -2 points 1 year ago* (last edited 10 months ago) (1 children)

Editing this reply to say that I was in fact right and I did not have any fundamental misunderstanding of anything. And the database in question here is called LAIOn and contains 6 billions images scraped from the web, including CSAM images.

Thanks for that. As I said, I'm not big into how AI works, so not surprised I got that wrong. The databases of everything that has come across the clear web are still there though and are available for use by people with access.

[–] BreakDecks@lemmy.ml 5 points 1 year ago (2 children)

What are you referring to by "the database of everything that has come across the clear web"?

[–] 0ddysseus@lemmy.world 1 points 10 months ago

See this new article. The image database they looked into is called LAIOn. There are others though of course. I don't mean google crawlers, I mean image databases for training image generators.

https://www.independent.co.uk/news/ap-study-developers-thorn-canada-b2467386.html

[–] NightAuthor@lemmy.world 1 points 1 year ago

NSA servers? jkjk, kinda

I think they mean Google's web-crawler index, but I don't think that the index works that way.... well, on the other hand, they do cache some stuff.