Well, the first 90% is easy, the hard part is the second 90%. Case in point: Sel...

throwthrowuknow · 2026-01-07T13:14:46 1767791686

Even if Opus 4.5 is the limit it’s still a massively useful tool. I don’t believe it’s the limit though for the simple fact that a lot could be done by creating more specialized models for each subdomain i.e. they’ve focused mostly on web based development but could do the same for any other paradigm.

emodendroket · 2026-01-07T14:23:00 1767795780

That's a massive shift in the claim though... I don't think anyone is disputing that it's a useful tool; just the implication that because it's a useful tool and has seen rapid improvement that implies they're going to "get all the way there," so to speak.

bayindirh · 2026-01-07T13:18:08 1767791888

Personally I'm not against LLMs or AI itself, but considering how these models are built and trained, I personally refuse to use tools built on others' work without or against their consent (esp. GPL/LGPL/AGPL, Non Commercial / No Derivatives CC licenses and Source Available licenses).

Of course the tech will be useful and ethical if these problems are solved or decided to be solved the right way.

ForHackernews · 2026-01-07T13:25:49 1767792349

We just need to tax the hell out of the AI companies (assuming they are ever profitable) since all their gains are built on plundering the collective wisdom of humanity.

thfuran · 2026-01-07T13:38:47 1767793127

I don’t think waiting for profitability makes sense. They can be massively disruptive without much profit as long as they spend enough money.

encyclopedism · 2026-01-07T15:15:25 1767798925

AI companies and corporations in general control your politicians so taxing isn't going to happen.

literalAardvark · 2026-01-07T13:26:44 1767792404

They're not blenders.

This is clear from the fact that you can distill the logic ability from a 700b parameter model into a 14b model and maintain almost all of it.

You just lose knowledge, which can be provided externally, and which is the actual "pirated" part.

The logic is _learned_

encyclopedism · 2026-01-07T15:17:06 1767799026

It hasn't learned any LOGIC. It has 'learned' patterns from the input.

theshrike79 · 2026-01-07T21:49:45 1767822585

What is logic other than applying patterns?

encyclopedism · 2026-01-07T22:48:52 1767826132

The definition is broad for now this will do: Logic is the study of correct reasoning.

vidarh · 2026-01-08T14:27:16 1767882436

How is that different from applying patterns?

bayindirh · 2026-01-07T13:31:24 1767792684

Are there any recent publications about it so I can refresh myself on the matter?

D-Machine · 2026-01-07T15:21:46 1767799306

You won't find any trustworthy papers on the topic because GP is simply wrong here.

That models can be distilled has no bearing whatsoever on whether a model has learned actual knowledge or understanding ("logic"). Models have always learned sparse/approximately-sparse and/or redundant weights, but they are still all doing manifold-fitting.

The resulting embeddings from such fitting reflect semantics and semantic patterns. For LLMs trained on the internet, the semantic patterns learned are linguistic, which are not just strictly logical, but also reflect emotional, connotational, conventional, and frequent patterns, all of which can be illogical or just wrong. While linguistic semantic patterns are correlated with logical patterns in some cases, this is simply not true in general.

mcfedr · 2026-01-07T14:22:29 1767795749

i like to think of LLMs as random number generators with a filter

rat9988 · 2026-01-07T12:53:18 1767790398

> Well, the first 90% is easy, the hard part is the second 90%.

You'd need to prove that this assertion applies here. I understand that you can't deduce the future gains rate from the past, but you also can't state this as universal truth.

bayindirh · 2026-01-07T13:30:13 1767792613

No, I don't need to. Self driving cars is the most recent and biggest example sans LLMs. The saying I have quoted (which has different forms) is valid for programming, construction and even cooking. So it's a simple, well understood baseline.

Knowledge engineering has a notion called "covered/invisible knowledge" which points to the small things we do unknowingly but changes the whole outcome. None of the models (even AI in general) can capture this. We can say it's the essence of being human or the tribal knowledge which makes experienced worker who they are or makes mom's rice taste that good.

Considering these are highly individualized and unique behaviors, a model based on averaging everything can't capture this essence easily if it can ever without extensive fine-tuning for/with that particular person.

enraged_camel · 2026-01-07T14:08:21 1767794901

>> No, I don't need to. Self driving cars is the most recent and biggest example sans LLMs.

Self-driving cars don't use LLMs, so I don't know how any rational analysis can claim that the analogy is valid.

>> The saying I have quoted (which has different forms) is valid for programming, construction and even cooking. So it's a simple, well understood baseline.

Sure, but the question is not "how long does it take for LLMs to get to 100%". The question is, how long does it take for them to become as good as, or better than, humans. And that threshold happens way before 100%.

bayindirh · 2026-01-07T14:40:59 1767796859

>> Self-driving cars don't use LLMs, so I don't know how any rational analysis can claim that the analogy is valid.

Doesn't matter, because if we're talking about AI models, no (type of) model reaches 100% linearly, or 100% ever. For example, recognition models run with probabilities. Like Tesla's Autopilot (TM), which loves to hit rolled-over vehicles because it has not seen enough vehicle underbodies to classify it.

Same for scientific classification models. They emit probabilities, not certain results.

>> Sure, but the question is not "how long does it take for LLMs to get to 100%"

I never claimed that a model needs to reach a proverbial 100%.

>> The question is, how long does it take for them to become as good as, or better than, humans.

They can be better than humans for certain tasks. They are actually better than humans in some tasks since 70s, but we like to disregard them to romanticize current improvements, but I don't believe current or any generation of AIs can be better than humans in anything and everything, at once.

Remember: No machine can construct something more complex than itself.

>> And that threshold happens way before 100%.

Yes, and I consider that "treshold" as "complete", if they can ever reach it for certain tasks, not "any" task.

rat9988 · 2026-01-07T17:59:48 1767808788

Self driving cars is not a proof. It only proves that having quick gains doesn't mean necessarily you'll get a 100% fast. It doesn't prove it will necessarily happen.

damethos · 2026-01-07T19:34:49 1767814489

"covered/invisible knowledge" aka tacit knowledge

bayindirh · 2026-01-07T19:40:55 1767814855

Yeah, I failed to remember the term while writing the comment. Thanks!

thfuran · 2026-01-07T13:41:20 1767793280

>None of the models (even AI in general) can capture this

None of the current models maybe, but not AI in general? There’s nothing magical about brains. In fact, they’re pretty shit in many ways.

bayindirh · 2026-01-07T13:47:43 1767793663

A model trained on a very large corpus can't, because these behaviors are different or specialized enough they cancel each other most of the cases. You can forcefully fine-tune a model with a singular person's behavior up to a certain point, but I'm not sure that even that can capture the subtlest of behaviors or decision mechanisms which are generally the most important ones (the ones we call gut feeling or instinct).

OTOH, while I won't call human brain perfect, the things we label "shit" generally turn out to be very clever and useful optimizations to workaround its own limitations, so I regard human brain higher than most AI proponents do. Also we shouldn't forget that we don't know much about how that thing works. We only guess and try to model it.

Lastly, searching perfection in numbers and charts or in engineering sense is misunderstanding nature and doing a great disservice to it, but this is a subject for another day.

emodendroket · 2026-01-07T14:27:40 1767796060

The understanding of the brain is far from complete whether they're "magical" or "shit."

D-Machine · 2026-01-07T15:32:37 1767799957

Also obviously brains are both!

sanderjd · 2026-01-07T13:07:17 1767791237

I read the comment more as "based on past experience, it is usually the case that the first 90% is easier than the last 10%", which is the right base case expectation, I think. That doesn't mean it will definitely play out that way, but you don't have to "prove" things like this. You can just say that they tend to be true, so it's a good expectation to think it will probably be true again.

rybosworld · 2026-01-07T14:04:10 1767794650

The saying is more or less treated as a truism at this point. OP isn't claiming something original and the onus of proving it isn't on them imo.

I've heard this same thing repeated dozens of times, and for different domains/industries.

It's really just a variation of the 80/20 rule.