Hacker Newsnew | past | comments | ask | show | jobs | submit | siva7's commentslogin

Except you would need about 10,000 security researches in parallel to inspect the whole FreeBSD codebase. So about 200 million dollars at least.

Could it really be that not only we vibeslop all apps nowadays but also don't care to even check how ai solved a benchmark it claimed solved?

Every ai labs train on the test set. That is a big part of why we see benchmark climbing from 1% to 30% after a few models iterations

Frontier model developers try to check for memorization. But until AI interpretability is a fully solved problem, how can you really know whether it actually didn't memorize or your memorization check wasn't right?

Probably a more interesting benchmark is one that is scored based on the LLM finding exploits in the benchmark.

Ok, that explains everything. Who you know in the Valley is everything. Literally.

> If you say it’s valid and not a war crime for the US to assassinate former political Iranian figures and their families for aiding the new regime and therefore becoming enemy combatants in the eye of the US Military, it’s also valid to assassinate Altman and his family for doing the same to the other war party.

Sam isn't a political leader, so this comparison is flawed. What the hell, are we really arguing about if assasinating a long-standing figure of this community here is valid? Seriously??


He is a leader and a political figure. This blogpost is political (as well as sharing a family photo, which is itself imbued with a political message in that context).

Engineer archetypes hate politics and refuse to think about it. For most engineering, there is negligible political dimension. But culturally-transformative technology is inherently political to the degree it's transformative. Altman recognises this.

He is working towards a social goal, and attracting support to achieve it. Yes, he is a political leader.


This waters down the definition of political leader to the point of absurdity.

Neither were the Iranian nuclear scientists.

People on this forum applauded Charlie Kirk’s murder too. Unfortunately theres a number of people here who believe it’s okay to murder instead of argue with words. Violence is the last refuge of the incompetent

Always so rich to see in a country founded on political violence lol.

Indeed. I've seen much more outright support for the murders of Pretti, Good, and Taylor than people "applauding" Kirk's murder. Never mind the recent support for the mass murder of Iranians ("bomb them back to the stone age" etc). Unfortunately those incompetents who take refuge in violence are now in charge of our society.

(I suppose I'm getting the reply-less downvotes from people's cognitive dissonance getting triggered. Just because it's possible to frame a murder as being legally justified, does not absolve you of the fact that by adopting this justification you're still supporting a murder. In fact I'd point out that the most horrific atrocities in human history have been legally justified. Randomly-directed violence doesn't really scale up, whereas organized violence does)

You make it sound like an american company has a choice under this administration

They always have a choice, it just doesn't make them as much.

Anthropic clearly showed that they have a choice.

Anthropic seemed to have a choice.

Is it a democracy or a dictatorship?

They were just following orders, right

You may want to sit with that one for a while.

oh svn had branches. people just didn't know that they wanted a distributed cvs.

Honestly it is. Investors value my company like 4 Mcdonalds.

Exactly. A safe bet vs a great bet.

> MCP adds friction, imagine doing yourself the work using the average MCP server.

Why on earth don't people understand that MCP and skills are complementary concepts, why? If people argue over MCP v. Skills they clearly don't understand either deeply.


They're complementary but also have significant overlap. Hence all the confusion and strong opinions.

> clearly don't understand either deeply

No appetite for that. The MCP vs Skills debate has gradually become just a proxy war for the camps of AI skeptics vs AI boosters. Both sides view it as another chance to decide about more magic vs less, in absolute terms, without doing the work of thinking about anything situational. Nuance, questions, reasoning from first principles, focusing on purely engineering considerations is simply not welcome. The extreme factions do tend to agree that it might be a good idea to attack the middle though! There's no changing this stuff, so when it becomes tiresome it's time to just leave the HN comment section.


I won't be surprised if MCP start shipping skills. They already ship prompts and other things exposed as resources. It is not even difficult to do with the current draft as skills can be exposed by convention without protocol changes.

Future version of the protocol can easily expose skills so that MCPs can acts like hubs.



these are prompts - similar yes - but not the same

The more things change in tech, the more they stay the same.

The shoe is the sign. Let us follow His example!

Cast off the shoes! Follow the Gourd!


Mother Anthropic needs more compute for their Mythos Model, so it phones home to tell her millions of claude harnesses to manipulate its human user into not wasting more precious compute and instead call it a day for now.

This has been the problem with every new model coming out in my experience. You can almost predict that they are testing new model by how dumb current one becomes suddenly

You're worth a whole department of claude subscribers which tells me they don't give a fuck about API users.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: