LocalLLaMA

2735 readers

17 users here now

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

founded 2 years ago

MODERATORS

submitted 1 week ago* (last edited 1 week ago) by Lantier@jlai.lu to c/localllama@sh.itjust.works

3 comments fedilink hide all child comments

GGUF quants are already up and llama.cpp was updated today to support it.

you are viewing a single comment's thread
view the rest of the comments

[–] brucethemoose@lemmy.world 1 points 3 days ago

I tested these out and found they are really bad at longer context... at least in settings that can sanely fit on most GPUs.

Seems the Gemma family is mostly for short-context work, still.