Have we really watered down the definition of AGI that much? LLMs aren't really ...

theptip · on Dec 21, 2024

> LLMs aren't really capable of "learning" anything outside their training data.

ChatGPT has had for some time the feature of storing memories about its conversations with users. And you can use function calling to make this more generic.

I think drawing the boundary at “model + scaffolding” is more interesting.

dimitri-vs · on Dec 21, 2024

Calling the sentence or two it arbitrarily saves when you statd your preferences and profile info "memories" is a stretch.

True equivalent to human memories would require something like a multimodal trillion token context window.

RAG is just not going to cut it, and if anything will exacerbated problems with hallucinations.

theptip · on Dec 21, 2024

Well, now you’ve moved the goalposts from “learn anything” to “learn at human level”. Sure, they don’t have that yet.

TrapLord_Rhodo · on Dec 21, 2024

Thats the whole point of llama index? I can connect my LLM to any node or context i want. Syncing it to a real time data flow like an API and it can learn...? How is that different than a human?

Once optimus is up an working by the 100k+, the spatial problems will be solved. We just don't have enough spatial awareness data, or for a way for the LLM to learn about the physical world.

bubblyworld · on Dec 21, 2024

That's true for vanilla LLMs, but also keep in mind that there are no details about o3's architecture at the moment. Clearly they are doing something different given the huge performance jump on a lot of benchmarks, and it may well involve in-context learning.

catmanjan · on Dec 21, 2024

Given every other iteration has basically just been the same thing but bigger, why should we think this?

bubblyworld · on Dec 21, 2024

My point was to caution against being too confident about the underlying architecture, not to argue for any particular alternative.

Your statement is false - things changed a lot between gpt4 and o1 under the hood, but notably not a larger model size. In fact the model size of o1 is smaller than gpt4 by several orders of magnitude! Improvements are being made in other ways.