this post was submitted on 21 Jan 2024
602 points (97.6% liked)
Technology
59223 readers
3713 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
It's not about security through obscurity but "if a measurement becomes a goal then it ceases to be a good measurement" - so keeping the measurements hidden in order to make it harder for them to become a goal is a decent way to go on about it.
How would you measure "without ads"? That would just be the same cat and mouse game that adblockers have to deal with for decades.
I'm not sure it's possible to find a good completely open source solution that's not either giving bad results by down rating good results for the wrong reasons or that's open to misuse by SEO.
That might work if it's a small project where noone cares about fixing the results but if something like that becomes mainstream it's going to happen
The measure, from the perspective of Clickbaiters, is purely their own income stream. And there's no way to hide that from the guy generating the clickbait.
We have a well-defined set of sites and services that embed content within a website in exchange for payment. An easy place to start is to look for these embeds on a website and downgrade the results in your query as a result. We can also see, from redirects and ajax calls off a visited website, when lots of other information is being drawn in from third-party sites. That's a very big red flag on a site that's doing ad pop-ups/pop-overs and other gimmicks.
I would put more faith in an open-source solution than a private model, purely due to the financial incentives involved in their respective creations. The challenge with an open model is in getting the space and processing power to do all the web-crawling.
After that, it wouldn't be crazy to go in the Wikipedia/Reddit direction and have user-input to grade your query results, assuming a certain core pool of reliable users could be established.