this post was submitted on 15 Jun 2025
28 points (81.8% liked)

Fediverse

34476 readers
339 users here now

A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).

If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!

Rules

Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration)

founded 2 years ago
MODERATORS
 

You might've heard that search sucks on software X... maybe software Y... definitely on software Z. The default one kind of sucks on NodeBB too, admittedly.

But why? It's because search is really frickin' hard to get right, and expensive to get good at.

Remember that Google started as a search company, and they became king because they got really good at it, and it was their only product (at the time, anyway!)

The easiest type of search is "full text" search. It matches words exactly based on what you type in. For example if you search lemmy it would match posts that include the word lemmy but depending on how the content was indexed, might not match lemmy.world, lemmy.ca, lemmyverse, etc.

From there you start adding complexity like supporting AND and OR. You support partial matches (lem returns posts containing lemmy and lemmings).

Add more logic to remove stop words and articles like a, the, etc.

Put in some sorting logic to rank stuff higher (what's your algo? Recency? Votes? etc.)

That's just the tip of the iceberg... this problem domain is so vast that entire companies have been built around just providing searching as a service (e.g. Algolia), and it isn't cheap!

top 12 comments
sorted by: hot top controversial new old
[–] MimicJar@lemmy.world 3 points 1 hour ago (1 children)

Search also sucks because people suck.

If I post a picture of a flower with the caption "Look what grew in my garden!", that's a terrible post from a search point of view.

Later on someone will search for "flower" but I didn't use the word "flower" so now search sucks.

Of course a much more common post is someone posting a picture of text, from Twitter, Tumblr, etc. with, once again, a vague caption. You remember the picture, but not what the poster actually said.

Searching comments will sometimes help, but that depends on the comments being related.

[–] julian@community.nodebb.org 2 points 22 minutes ago (1 children)

Does anyone remember way before Google had image recognition technology, the time they built a game that paired up random people on the internet, showed them each an image, and waited for them to both guess the same keyword?

It was gamified human powered taxonomy for meaningless internet points and it was hilarious (at the time.)

[–] MimicJar@lemmy.world 1 points 4 minutes ago

Google Image Labeler apparently, but I don't actually just remember the game. Looks like it's called Crowdsource now, and you can get points, but it isn't a competition.

[–] yessikg@lemmy.blahaj.zone 1 points 1 hour ago

I wish more fedi software had advanced search options like Peertube

[–] jjmoldy@lemmy.world 5 points 8 hours ago* (last edited 8 hours ago)

What gets me is when I set to sort results by top, 1 month (for example) and it shows posts from 2 years ago

[–] LandedGentry@lemmy.zip 8 points 9 hours ago

Don’t forget to mention that Google is no longer that good at it lol

[–] iso@lemy.lol 8 points 13 hours ago* (last edited 13 hours ago) (1 children)

I think Meilisearch is pretty good. I throw it wherever possible and it works :) Implementing a custom solution is not worth it.

[–] mbirth@lemmy.ml 3 points 10 hours ago

There's also Marginalia if you're looking for some rather traditional web search.

[–] rglullis@communick.news 8 points 15 hours ago (1 children)

Is this rant Fediverse-specific?

[–] julian@community.nodebb.org 6 points 14 hours ago (1 children)

rglullis@communick.news A little bit, yes! There was a recent thread in the community I posted to where a discussion about the rather lacklustre search of various software took place.

[–] rglullis@communick.news 6 points 14 hours ago

I understand where you are coming from: search is not easy, but at the same time I think we already have solutions that are "good enough" and doesn't require a ton of work from the developers. PostgreSQL FTS works well enough to power the search system for Lemmy and it works out-of-the-box, for example.

[–] hendrik@palaver.p3x.de 4 points 13 hours ago* (last edited 13 hours ago)

I think I'm fine. I'll just search for some words in the title and that usually returns the correct post. And as long as it's the Fediverse and not a closed forum with login or Discord, I can use Google, since it's on the open internet. At least for Lemmy. Other than that it's really hard. I don't think any search engine can find me the article that I skimmed by Friday evening where I just vaguely remember on how it was about some Youtuber that I know, and I have no other information. I sometimes want to find stuff and it's impossible. With any search engine/method. Sometimes my browser history helps me with that. Or homing in on a timeframe and a rough place and then scrolling through things. But a least for me it tends to be one of the two extremes. Either the rudimentary tools are fine. Or it's really hard but a "better" search wouldn't cut it either.