More

blurbleblurble · 2026-05-31T17:03:10 1780246990

Maybe the algorithm has some kind of "momentum" to it, taking into consideration the velocity of upvotes.

blurbleblurble · 2026-05-30T09:12:24 1780132344

Maybe he got notified from the mythos team of a bunch of vulnerabilities and then followed up using claude. Doesn't seem that unlikely.

What would you do if suddenly there were a dozen exploitable CVEs in your highly used open source project staring you down? Maybe you'd use the tool that found them to patch them as quickly as possible.

kelnos · 2026-05-30T09:24:37 1780133077

I am absolutely willing to give tridge the benefit of the doubt here, but a note on what you said: I don't think you should ever patch a CVE "as quickly as possible". You should do it slowly, be very sure of the change, and test the hell out of it. You can easily introduce a new security vulnerability by rushing something like that.

blurbleblurble · 2026-05-30T10:05:04 1780135504

Good point. I just can't imagine the urgency and pressure I'd feel.

threecheese · 2026-05-30T23:06:36 1780182396

Looks like at least one of these issues was from a CVE [0], they don’t call out Mythos specifically though (“security researchers”). Many teams are sprinting on security issues atm (including mine, who put all product priorities aside two sprints ago), it must suck to be responsible for high-visibility/high-risk projects like rsync right now.

0: https://github.com/advisories/GHSA-pfv9-gp3h-73xv

blurbleblurble · 2026-05-30T09:04:32 1780131872

Go look for yourself, quite a few mention CVEs.

blurbleblurble · 2026-05-30T09:03:42 1780131822

Their loss

blurbleblurble · 2026-05-30T09:03:30 1780131810

They have not

blurbleblurble · 2026-05-29T17:00:14 1780074014

Reckless Ben has an amazing scientology series, worth watching

blurbleblurble · 2026-05-29T01:00:32 1780016432

Try qwen 3.6 models with hermes and see for yourself. 27b is excellent and 35b is very good for basic agentic tasks.

blurbleblurble · 2026-05-29T00:59:15 1780016355

4.7 broke my trust

blurbleblurble · 2026-05-20T05:17:32 1779254252

Deja vu from the other week https://news.ycombinator.com/item?id=48051562

zambelli · 2026-05-20T05:53:12 1779256392

I think I'm aligned with the idea that some parts of some workflows are mandatory - auth, read before edit, etc.

But otherwise, forge really doesn't own or opine much of the workflow. Step enforcement exists if you want it, so do prerequisites, but the idea is that those could be conditional or optional (you may never need to edit a file).

The guardrails are designed to work for non deterministic flows or deterministic ones. In the latter, you just might not have one of the guardrails active. It's much more about nudging the model back on track than laying more obvious tracks, in a sense.

Overall, agentic reliability is definitely an active field.

blurbleblurble · 2026-05-20T07:16:52 1779261412

In this blog post I'm reading their call for "control flow" as a generalization of exactly what your work illustrates so nicely.

The blog post doesn't say to me "we need to start encoding specifically opinionated conditional branching statements that guide the model" rather I'm hearing a call to realize the broader principles of control flow itself relevant for composing programs with LLMs.

I think your work "nudges" us in that direction.

zambelli · 2026-05-20T13:25:50 1779283550

Nice ;). I'll take a closer read of it, that's on me - I am definitely seeing more people looking in this direction as agents start to ramp in production at the enterprise level, which I suspect is highlighting some of these failure modes at higher stakes. And also the cloud frontier API bills.

blurbleblurble · 2026-05-20T04:49:54 1779252594

Nice explanation, thank you.

So basically the kind of thing I'd usually be doing manually with small models, over and over again, you just automate that nudging and off they go.

Sometimes LLMs have seemed to me like "computer programs with inertia" and in that frame what your tool does is identify and reduce friction at key points so the wheels can keep spinning.

zambelli · 2026-05-20T05:22:51 1779254571

Yep! The big frontier models are already quite good at doing that, and they have decent harnesses. That's why Opus on Claude Code does what it does.

Small models aren't there yet and they would veer off course, this just nudges them back onto the road. Whether or not they have a good sense of direction is a different question.

blurbleblurble · 2026-05-20T07:27:44 1779262064

Really nice intuition, thank you.