More

siva7 · 2026-04-11T20:14:32 1775938472

Except you would need about 10,000 security researches in parallel to inspect the whole FreeBSD codebase. So about 200 million dollars at least.

siva7 · 2026-04-11T20:07:54 1775938074

Could it really be that not only we vibeslop all apps nowadays but also don't care to even check how ai solved a benchmark it claimed solved?

retinaros · 2026-04-11T22:39:35 1775947175

Every ai labs train on the test set. That is a big part of why we see benchmark climbing from 1% to 30% after a few models iterations

SpicyLemonZest · 2026-04-11T20:39:15 1775939955

Frontier model developers try to check for memorization. But until AI interpretability is a fully solved problem, how can you really know whether it actually didn't memorize or your memorization check wasn't right?

operatingthetan · 2026-04-11T20:16:24 1775938584

Probably a more interesting benchmark is one that is scored based on the LLM finding exploits in the benchmark.

siva7 · 2026-04-11T14:15:54 1775916954

Ok, that explains everything. Who you know in the Valley is everything. Literally.

siva7 · 2026-04-11T09:02:01 1775898121

> If you say it’s valid and not a war crime for the US to assassinate former political Iranian figures and their families for aiding the new regime and therefore becoming enemy combatants in the eye of the US Military, it’s also valid to assassinate Altman and his family for doing the same to the other war party.

Sam isn't a political leader, so this comparison is flawed. What the hell, are we really arguing about if assasinating a long-standing figure of this community here is valid? Seriously??

psd1 · 2026-04-11T10:17:04 1775902624

He is a leader and a political figure. This blogpost is political (as well as sharing a family photo, which is itself imbued with a political message in that context).

Engineer archetypes hate politics and refuse to think about it. For most engineering, there is negligible political dimension. But culturally-transformative technology is inherently political to the degree it's transformative. Altman recognises this.

He is working towards a social goal, and attracting support to achieve it. Yes, he is a political leader.

senordevnyc · 2026-04-11T17:20:25 1775928025

This waters down the definition of political leader to the point of absurdity.

bogzz · 2026-04-11T12:28:29 1775910509

Neither were the Iranian nuclear scientists.

stogot · 2026-04-11T12:38:05 1775911085

People on this forum applauded Charlie Kirk’s murder too. Unfortunately theres a number of people here who believe it’s okay to murder instead of argue with words. Violence is the last refuge of the incompetent

guzfip · 2026-04-11T14:27:27 1775917647

Always so rich to see in a country founded on political violence lol.

mindslight · 2026-04-11T15:05:02 1775919902

Indeed. I've seen much more outright support for the murders of Pretti, Good, and Taylor than people "applauding" Kirk's murder. Never mind the recent support for the mass murder of Iranians ("bomb them back to the stone age" etc). Unfortunately those incompetents who take refuge in violence are now in charge of our society.

mindslight · 2026-04-11T17:19:37 1775927977

(I suppose I'm getting the reply-less downvotes from people's cognitive dissonance getting triggered. Just because it's possible to frame a murder as being legally justified, does not absolve you of the fact that by adopting this justification you're still supporting a murder. In fact I'd point out that the most horrific atrocities in human history have been legally justified. Randomly-directed violence doesn't really scale up, whereas organized violence does)

siva7 · 2026-04-11T08:53:22 1775897602

You make it sound like an american company has a choice under this administration

selfhoster11 · 2026-04-11T09:38:40 1775900320

They always have a choice, it just doesn't make them as much.

ahtihn · 2026-04-11T11:35:33 1775907333

Anthropic clearly showed that they have a choice.

nickthegreek · 2026-04-11T12:21:39 1775910099

Anthropic seemed to have a choice.

pocksuppet · 2026-04-11T12:36:04 1775910964

Is it a democracy or a dictatorship?

dwroberts · 2026-04-11T09:05:25 1775898325

They were just following orders, right

finghin · 2026-04-11T09:42:24 1775900544

You may want to sit with that one for a while.

siva7 · 2026-04-10T11:46:02 1775821562

oh svn had branches. people just didn't know that they wanted a distributed cvs.

siva7 · 2026-04-10T11:44:16 1775821456

Honestly it is. Investors value my company like 4 Mcdonalds.

sunir · 2026-04-10T14:00:50 1775829650

Exactly. A safe bet vs a great bet.

siva7 · 2026-04-10T10:14:42 1775816082

> MCP adds friction, imagine doing yourself the work using the average MCP server.

Why on earth don't people understand that MCP and skills are complementary concepts, why? If people argue over MCP v. Skills they clearly don't understand either deeply.

bavell · 2026-04-10T12:32:14 1775824334

They're complementary but also have significant overlap. Hence all the confusion and strong opinions.

robot-wrangler · 2026-04-10T14:11:07 1775830267

> clearly don't understand either deeply

No appetite for that. The MCP vs Skills debate has gradually become just a proxy war for the camps of AI skeptics vs AI boosters. Both sides view it as another chance to decide about more magic vs less, in absolute terms, without doing the work of thinking about anything situational. Nuance, questions, reasoning from first principles, focusing on purely engineering considerations is simply not welcome. The extreme factions do tend to agree that it might be a good idea to attack the middle though! There's no changing this stuff, so when it becomes tiresome it's time to just leave the HN comment section.

_pdp_ · 2026-04-10T10:18:18 1775816298

I won't be surprised if MCP start shipping skills. They already ship prompts and other things exposed as resources. It is not even difficult to do with the current draft as skills can be exposed by convention without protocol changes.

Future version of the protocol can easily expose skills so that MCPs can acts like hubs.

radiospiel · 2026-04-10T14:56:39 1775832999

Doesn't it already? https://modelcontextprotocol.io/specification/2025-11-25/ser...

_pdp_ · 2026-04-10T15:02:02 1775833322

these are prompts - similar yes - but not the same

insin · 2026-04-10T10:18:40 1775816320

The more things change in tech, the more they stay the same.

The shoe is the sign. Let us follow His example!

Cast off the shoes! Follow the Gourd!

siva7 · 2026-04-09T07:52:54 1775721174

Mother Anthropic needs more compute for their Mythos Model, so it phones home to tell her millions of claude harnesses to manipulate its human user into not wasting more precious compute and instead call it a day for now.

dstanko · 2026-04-09T11:16:03 1775733363

This has been the problem with every new model coming out in my experience. You can almost predict that they are testing new model by how dumb current one becomes suddenly

siva7 · 2026-04-08T20:25:34 1775679934

You're worth a whole department of claude subscribers which tells me they don't give a fuck about API users.