More

rohitpaulk · 2025-12-17T17:09:40 1765991380

Wild how this beats 2.5 Pro in every single benchmark. Don't think this was true for Haiku 4.5 vs Sonnet 3.5.

FergusArgyll · 2025-12-17T17:28:37 1765992517

Sonnet 3.5 might have been better than opus 3. That's my recollection anyhow

rohitpaulk · 2025-11-27T09:17:14 1764235034

For those who haven't seen the actual issue that links to, I'd take a closer look. Pretty insane: https://github.com/actions/runner/issues/3792#issuecomment-3...

creato · 2025-11-27T09:48:43 1764236923

Yeah, calling the authors of this code losers and monkeys is being kind. There is zero excuse for ever writing code like this, the incompetence is staggering.

rohitpaulk · 2025-10-01T17:34:20 1759340060

Always felt dependency updates are a perfect fit for AI agents:

(a) they’re broadly similar across companies,

(b) they aren’t time-sensitive, so the agent can take hours without anyone noticing, and

(c) customers are already accustomed to using bots here, just bad ones

XiZhao · 2025-10-01T17:40:39 1759340439

One would imagine they are broadly similar; but that's off the assumption that codebases are similar as well.

Migrations between versions can have big variance largely as a function of the parent codebase and not the dependency change. A simple example of this would be a supported node version bump. It's common to lose support for older node runtimes with new dependency versions, but migrating the parent codebase may require large custom efforts like changing module systems.

rohitpaulk · 2025-07-22T21:02:42 1753218162

I noticed this with Harvey / Legal.

rohitpaulk · 2025-06-10T19:40:59 1749584459

Where do I buy one?

timmg · 2025-06-10T19:45:48 1749584748

https://vassarrobotics.com/

lucidrains · 2025-06-10T19:43:14 1749584594

Love to know as well!

dang · 2025-06-10T19:46:20 1749584780

Link added above :)

rohitpaulk · on Dec 2, 2024

- Contribute to OSS - Do CodeCrafters

rohitpaulk · on July 30, 2024

Great content! Sidenote for the Fly team: on mobile, the “sidenote” cards appear in the wrong order - they appear before the content instead of after.

tptacek · on July 30, 2024

Yeah, we need to tweak them; they were designed for a wider browser window than a lot of people use. If they're not rendered as actual sidenotes, I think we should make them footnotes or popups.

mwcampbell · on July 31, 2024

The sidenotes also come first when reading with a screen reader, and there's no indication that they're sidenotes.

On the other hand, nice job with the alt text on the packet diagram. Maybe it sucks that the diagram itself can't be accessible, but I think you did the right thing in this case.

tptacek · on Aug 1, 2024

This has been a peeve of mine for awhile; we're going to "demote" them to pop-up footnotes when there isn't enough screen width to make them sidenotes. (In either case, you're better off than when they're in the main flow of the text).

Thanks!

rohitpaulk · on July 6, 2024

Not exactly what you described, but https://labs.iximiuz.com/ is similar

dbglog · on July 6, 2024

thanks! I'll check it out

fefferkorn · on July 6, 2024

interessting, thanks!

rohitpaulk · on June 10, 2024

Still involves friction. A more "seamless" way for Apple to do this would've been to license GPT-4's weights from OpenAI and run it on Apple Intelligence servers.

asadm · on June 10, 2024

but that restricts it to just openai then.

I want to use perplexity from siri too!

rohitpaulk · on June 6, 2024

This is a great idea. We’ve used Storybook & Chromatic in the past to cook up something similar for our front-end apps. There was quite a bit of setup work involved though.

How does integrating this into a web app work? Do you hook into an existing test suite? Record sessions on staging?

irrationaljared · on June 6, 2024

This is all automated. It's a github app that tracks changes as they come in, analyzes and simulates them and creates simulations that can be accessed via the CodeYam website, integrated into a CI workflow, or, hopefully soon, leveraged in developer workflows.

In this way there's no set up or maintenance required (or very minimal). It updates simulations as changes are made so everything is always up-to-date.

It automatically generates a number of scenarios (data that is fed into the simulation), but you can define new scenarios through the website as well (or describe a scenario for the AI to generate).

rohitpaulk · on June 6, 2024

Definitely curious about how this works - signed up!

irrationaljared · on June 6, 2024

Awesome, we'll reach out soon!