this post was submitted on 05 Jul 2023
768 points (98.2% liked)

World News

32288 readers
695 users here now

News from around the world!

Rules:

founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] eek2121@lemmy.world 251 points 1 year ago (38 children)

They actually likely did this due to SEO. Google was allegedly in the process of removing tweets from the search index because they weren’t accessible. This happens automatically for most sites.

[–] Veltoss@lemmy.world 75 points 1 year ago (11 children)

How does Pinterest get around this then? They pollute image searches like crazy, and require you to login to see anything. At least they did, I blocked them from searches so maybe it's different now.

[–] gressen@lemmy.world 14 points 1 year ago (1 children)

Easy - detect if you're getting accessed by a search crawler or a human. Serve a full page or just a login request.

[–] RGB3x3@lemmy.world 11 points 1 year ago (3 children)

So how can a user pretend to be a web crawler?

[–] theMightyMoonWorm@lemmy.ml 22 points 1 year ago* (last edited 1 year ago)

This browser addon can spoof useragents:https://add0n.com/useragent-switcher.html

[–] SketchySeaBeast@lemmy.ca 19 points 1 year ago (1 children)

You're going to need a special hat.

[–] dangrousperson@vlemmy.net 7 points 1 year ago

Ever heard of https://12ft.io/ ? It allows you to bypass alot of pay walls by basically pretending to be a search engine trying to index a website. For SEO reasons a lot of pay walled sites allow search engines to access the whole article to index. 12ft.io leverages this to show you whole articles behind paywalls. This is something you could also achieve by spoofing the User-Agent. It would probably work for things like Pinterest without an account as well, but that's something I have never tried (since I have no interest in the cancer that is Pinterest).

load more comments (9 replies)
load more comments (35 replies)