this post was submitted on 09 Jul 2023
99 points (96.3% liked)

Asklemmy

43874 readers
1324 users here now

A loosely moderated place to ask open-ended questions

Search asklemmy 🔍

If your post meets the following criteria, it's welcome here!

  1. Open-ended question
  2. Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
  3. Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
  4. Not ad nauseam inducing: please make sure it is a question that would be new to most members
  5. An actual topic of discussion

Looking for support?

Looking for a community?

~Icon~ ~by~ ~@Double_A@discuss.tchncs.de~

founded 5 years ago
MODERATORS
 

As I was browsing lemmy and the fediverse at large, this question kept popping into my head.

Since multimedia files have a much bigger footprint than raw text, it made me feel worried since as time goes, massive resources will be needed to keep up with the big data coming in.

I do wonder if the instances have taken the route of the cloud and just decided to put all of it in something like AWS S3? Or maybe they use self hosted storage with something like minio for object storage?

you are viewing a single comment's thread
view the rest of the comments
[–] rcmaehl@lemmy.world 14 points 1 year ago* (last edited 1 year ago) (14 children)

Edit: I am partially wrong. (See below)

They're stored on their host Instance. Only text is copied across instances.

[–] laenurd@lemmy.lemist.de 20 points 1 year ago* (last edited 1 year ago) (10 children)

That is not true. As long as a user on your instance is subscribed to a community, the media content of posts [Edit: only posts linking to outside sources, e.g imgur] of that community is stored locally on your instance as well.

This, of course, only applies to media which is uploaded to Lemmy, links to media hosted externally are not downloaded.

See this issue for more context.

Edit: I want to clarify that I was partially wrong - Lemmy only locally caches content which is hosted on outside sites. It does (should?) not cache content that was directly uploaded to a Lemmy instance and just embeds the source media.

[–] Trifictional@lemmy.ca 25 points 1 year ago (5 children)

I think this could be a ticking DOS time bomb.

Someone manages to spam upload massive files to the largest Lemmy instances could wipe out a ton of smaller ones.

Not to mention scalability wise this seems like a nightmare… eventually the largest Lemmy instances will have petabytes of media data with 100s of gbs coming in per day, giving other instances no chance to sync with them.

I think the system architecture needs a significant review. This won’t scale.

[–] Booty@lemmy.world 5 points 1 year ago (1 children)

I agree. It's also a tremendous waste of resources. I'm all for redundancy (like CDNs), but this seems incredibly poorly thought out. If Lemmy (as a whole) every scales to the size of other social media, the space requirements will start to become unreasonable.

Why wouldn't something like symlinks be implemented? Not saying specifically use symlinks, but there has to be a similar, better way.

[–] laenurd@lemmy.lemist.de 6 points 1 year ago* (last edited 1 year ago)

The obvious way would be to just not cache content locally and always link to the source instance. While this would concentrate the strain immensely, it would also greatly decrease the storage space used by all other instances.

There might also be other viable alternatives such as using a CDN and having it selectively cache content which is requested often etc.

~~As of now, Lemmy does not support either, though. ~~

Edit: I want to clarify that I was partially wrong - Lemmy only locally caches content which is hosted on outside sites. It does (should?) not cache content that was directly uploaded to a Lemmy instance and just embeds the source media.

load more comments (3 replies)
load more comments (7 replies)
load more comments (10 replies)