maxrumpf's comments

maxrumpf · 2026-06-02T15:38:31 1780414711

are you a maintainer on npm?

andrewzeno · 2026-06-02T15:41:26 1780414886

lol thankfully no. My GitHub is in bio

maxrumpf · 2026-05-12T17:55:47 1778608547

congratulations!

maxrumpf · 2026-03-27T03:16:50 1774581410

We think this is a pretty sad day for research: Some context about Chroma's model. https://x.com/maxrumpf/status/2037365748973384154?s=20

dominotw · 2026-03-27T13:14:55 1774617295

can you share actual links for your published research and theirs .

maxrumpf · 2025-12-08T23:43:00 1765237380

This is the tech report for a model I helped work on. I'm biased, but it turned out very well.

We essentially let the model learn to retrieve like a human would: Make a first search, read the results, and then make another. This lets the model be vastly better than pre-programmed pipelines. We test this extensively and compare against implementing this with API models (like Sonnet 4.5 and GPT-5.1). SID-1 compares favorably.

Happy to answer any questions or get feedback. First and foremost: Enjoy the read. It's much more detailed than most tech reports.

maxrumpf · on Aug 21, 2024

This comment section is scary. Hacker news advocating FOR nanny technology?!

maxrumpf · on Aug 20, 2024

Excel is awful in almost every way, but I just wish more software was as customizable.

I can get (even more) customization by using pandas etc., but it's usually much slower and you get much less of an intuition about the data.

maxrumpf · on July 18, 2024

I imagine color consistency will be such a pain here.

Retr0id · on July 18, 2024

I'd hope that per-pixel calibration would solve that, but I wonder how much that calibration would drift over time.

mensetmanusman · on July 18, 2024

Whatever the drift would be, inorganics would drift less than organic materials.

maxrumpf · on April 19, 2024

The weirdest thing people do is make up criteria that YC supposedly uses to reject people. There was such a huge diversity in our batch: From 20 y/o to 40+. Foreign, domestic. Credentialed, not credentialed. $1M rev run rate, $0 run rate. Just apply.

reagan83 · on April 20, 2024

This comment scares me that YC is desperate for applications now after burning through so many early stage founders for years. Has YC peaked?

maxrumpf · on April 7, 2024

The abstract and the rest of the paper don't really match imo. It's not really allocating more to some sequences, but just introducing ~dropout. Might be different sides to the same coin, but was still a weird read.

adamsantoro · on April 7, 2024

We spent a fair bit of effort ensuring we were accurate with the language and claims, so we're happy to take any feedback and make updates in subsequent versions. However, I don't see where we claim that MoD allocates more to some sequences and not others (specifically, the abstract says "transformers can instead learn to dynamically allocate FLOPs (or compute) to specific positions in a sequence".

That said, it's a pretty simple change to make the approach work in the way you describe (allocating more to some sequences and not others) by changing the group across which the top-k works. In the paper we use the time (sequence) dimension, but one could also use the batch * time dimension, which would result in asymmetric allocation across sequences

hackerlight · on April 8, 2024

Dropout is at train time this is at inference time. Dropout is random this is determined. Can't compare them.