this post was submitted on 07 Jul 2025
553 points (98.1% liked)

Open Source

38876 readers
88 users here now

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] bdonvr@thelemmy.club 29 points 6 days ago (6 children)

Ooh can this work with Lemmy without affecting federation?

[–] beyond@linkage.ds8.zone 30 points 6 days ago (1 children)

Yes.

Source: I use it on my instance and federation works fine

[–] bdonvr@thelemmy.club 16 points 6 days ago (1 children)

Thanks. Anything special configuring it?

[–] beyond@linkage.ds8.zone 20 points 6 days ago* (last edited 6 days ago)

I keep my server config in a public git repo, but I don't think you have to do anything really special to make it work with lemmy. Since I use Traefik I followed the guide for setting up Anubis with Traefik.

I don't expect to run into issues as Anubis specifically looks for user-agent strings that appear like human users (i.e. they contain the word "Mozilla" as most graphical web browsers do) any request clearly coming from a bot that identifies itself is left alone, and lemmy identifies itself as "Lemmy/{version} +{hostname}" in requests.

[–] deadcade@lemmy.deadca.de 11 points 5 days ago (1 children)

"Yes", for any bits the user sees. The frontend UI can be behind Anubis without issues. The API, including both user and federation, cannot. We expect "bots" to use an API, so you can't put human verification in front of it. These "bots* also include applications that aren't aware of Anubis, or unable to pass it, like all third party Lemmy apps.

That does stop almost all generic AI scraping, though it does not prevent targeted abuse.

[–] beyond@linkage.ds8.zone 3 points 5 days ago

The API, including both user and federation, cannot.

This is theoretically an issue however in practice Anubis only weighs requests that appear to come from a browser: https://anubis.techaro.lol/docs/design/how-anubis-works

I just tested my instance with Jerboa and it seems to work just fine.

[–] interdimensionalmeme@lemmy.ml 8 points 5 days ago (1 children)

Yes, it would make lemmy as unsearchable as discord. Instead of unsearchable as pinterest.

[–] bdonvr@thelemmy.club 3 points 5 days ago (1 children)

That's not true, search indexer bots should be allowed through from what I read here.

[–] interdimensionalmeme@lemmy.ml 9 points 5 days ago

If you allow my searchxng search scraper then an AI scraper is indistinguishable.

If you mean, "google and duckduckgo are whitelisted" then lemmy will only be searchable there, those specific whitelisted hosts. And google search index is also an AI scraper bot.

[–] infinitesunrise@slrpnk.net 5 points 6 days ago

Yeah, it's already deployed on slrpnk.net. I see it momentarily every time I load the site.

[–] Resonosity@lemmy.dbzer0.com 3 points 5 days ago (1 children)

To be honest, I need to ask my admin about that!

[–] fxomt@lemmy.dbzer0.com 3 points 5 days ago* (last edited 5 days ago)

We don't use anubis but we use iocaine (?), see /0 for the announcement post

[–] seang96@spgrn.com 4 points 6 days ago

As long as its not configured improperly. When forgejo devs added it it broke downloading images with Kubernetes for a moment. Basically would need to make sure user agent header for federation is allowed.