ChatGPT

8912 readers

1 users here now

Unofficial ChatGPT community to discuss anything ChatGPT

founded 1 year ago

MODERATORS

148

submitted 7 months ago by ooli@lemmy.world to c/chatgpt@lemmy.world

43 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] huginn@feddit.it 2 points 7 months ago (1 children)

I think increasingly specialized models and analog systems that run them will be increasingly prevalent.

LLMs at their current scales don't do enough to be worth their enormous cost... And adding more data is increasingly difficult.

That said: the gains on LLMs have always been linear based on recent research. Emergence was always illusory.

[–] ericjmorey@discuss.online 1 points 7 months ago (1 children)

I'd like to read the research you alluded to. What research specifically did you have in mind?

[–] huginn@feddit.it 2 points 7 months ago

Sure: here's the article.

The basics are that:

LLM "emergent behavior" has never been consistent, it has always been specific to some types of testing. Like taking the SAT saw emergent behavior when it got above a certain number of parameters because it went from missing most questions to missing fewer.
They looked at the emergent behavior of the LLM compared to all the other ways it only grew linearly and found a pattern: emergence was only being displayed in nonlinear metrics. If your metric didn't have a smooth t transition between wrong, less wrong, sorta right, and right then the LLM would appear emergent without actually being so.