How can you be so certain there is 0 chance LLMs lead to AGI/Superintelligence? ...

barlines · on May 15, 2024

LLMs are gigantic curves fitted to civilizational scale datasets. LLM predictions are based on this. A language model is a mathematical construct and can only be as intelligent as that Algebra book sitting on your shelf.

dwaltrip · on May 15, 2024

An algebra book is a collection of paper pages with ink on them. An LLM is... nothing like that at all. LLMs are complex machines that operate on data and produce data. Books are completely static. They don't do anything.

Do you have a better analogy? I'd like to hear more about how ML models can't be intelligent, if you don't mind.

I'm pretty skeptical of the idea that we know enough at this point to make that claim definitively.

andsoitis · on May 15, 2024

> Books are completely static. They don't do anything.

Books (and writing) are a big force in cultural evolution.

dwaltrip · on May 16, 2024

Yes, I love books. They are awesome. But we are talking about machine intelligence, so that's not super relevant.

Books aren't data/info-processing machines, by themselves. LLMs are.

hollerith · on May 15, 2024

>LLMs are gigantic curves fitted to civilizational scale datasets

>A language model is a mathematical construct

That is like telling someone from the Middle Ages that a gun is merely an assemblage of metal parts not too different from the horseshoes and cast-iron nails produced by your village blacksmith and consequently it is safe to give a child a loaded gun.

ADDED. Actually a better response (because it does not rely on an analogy) is to point out that none of the people who are upset over the possibility that most of the benefits of AI might accrue to a few tech titans and billionaires would be in the least bit re-assured by being told that an AI model is just a mathematical construct.

guitarlimeo · on May 15, 2024

Pure LLM based approach will not lead to AGI, I'm 100% sure. A new research paper has shown [0] that no matter what LLM model is used, it exhibits diminishing returns, when you would be wanting at least a linear curve when looking for AGI.

[0] https://www.youtube.com/watch?v=dDUC-LqVrPU

sebzim4500 · on May 15, 2024

Based on the abstract this is about image models not LLMs

guitarlimeo · on May 15, 2024

Ah fair point, should've read it more carefully.

I'm tuning my probabilities back to 99%, I still don't believe just feeding more data to the LLM will do it. But I'll give the chance a possibility.

DrSiemer · on May 16, 2024

Obviously feeding more data won't do anything besides increase the knowledge available.

Next steps would be in totally different fields, like implementing actual reasoning, global outline planning and the capacity to evolve after training is done.

guhidalg · on May 15, 2024

I'm 100% certain that I need to do more than just predict the next token to be considered intelligent. Also call me when ChatGPT can manipulate matter.

soulofmischief · on May 15, 2024

> Also call me when ChatGPT can manipulate matter.

You mean like PALM-E? https://palm-e.github.io/

Embodiment is the easy part.

mypalmike · on May 15, 2024

Are you 100% certain that the human brain performs no language processing which is analogous to token prediction?

stubish · on May 16, 2024

A human brain certainly does do predictions, which is very useful to the bit that makes decisions. But how does a pure prediction engine make decisions? Make a judgement call? Analyze inconsistencies? Theorize? The best it can do is blindly follow the mob, a behavior we consider unintelligent even when done by human brains.

crakenzak · on May 16, 2024

> But how does a pure prediction engine make decisions? Make a judgement call? Analyze inconsistencies? Theorize?

My intuition leads me to believe that these are arising properties/characteristics of complex and large prediction engines. A sufficiently good prediction/optimization engine can act in an agentic way, while never had that explicit goal.

I recently read this very interesting piece that dives into this: https://www.lesswrong.com/posts/kpPnReyBC54KESiSn/optimality...

soulofmischief · on May 16, 2024

I'm of the belief that the entire conscious experience is a side effect of the need for us to make rapid predictions when time is of the essence, such as when hunting or fleeing. Otherwise, our subconscious could probably handle most of the work just fine.