this post was submitted on 12 Mar 2025
20 points (100.0% liked)

LocalLLaMA

2732 readers
35 users here now

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

founded 2 years ago
MODERATORS
 

GGUF quants are already up and llama.cpp was updated today to support it.

you are viewing a single comment's thread
view the rest of the comments
[–] brucethemoose@lemmy.world 1 points 2 days ago

I tested these out and found they are really bad at longer context... at least in settings that can sanely fit on most GPUs.

Seems the Gemma family is mostly for short-context work, still.