More

runeblaze · 2026-06-22T15:51:35 1782143495

tbh the summarized thinking with encrypted raw thinking is there for many purposes; it is there to:

1. make distillation much harder

2. safety: prevent modifications to the thinking leading to injection attacks.

3. also honestly sometimes the model raw thoughts can be deranged and is not a good user experience (consider the varied audience in the market, etc.)

also often the mass underestimate/the model makers over-estimate how people love distilling models

soisses · 2026-06-25T14:14:57 1782396897

The reasoning blocks are only temporarily part of the context. They aren't part of the context in the next turn anymore so (2) wouldn't really be an issue.

runeblaze · 2026-06-22T04:07:11 1782101231

links to two papers with at least enough apparent quality and novelty to get into ICLR 2026

> So basically... openrouter

:skull:

i now really wonder how many people of the public understood my thesis defense lol

runeblaze · 2026-04-10T00:02:43 1775779363

> visual similarity

> SigLIP 2

Maybe visual-semantic similarity is more appropriate? Nonetheless the design is fantastic

meodai · 2026-04-10T03:14:09 1775790849

True, thanks for the feedback

ghywertelling · 2026-04-10T11:05:03 1775819103

One future project idea suggestion. Can we combine these characters to create new ones just like Gboard allows us to intelligently combine emojis to create new complex emojis.

runeblaze · 2026-04-08T07:38:52 1775633932

I mean I used to work on model reliability with my little PhD degree and the models i manage go down all the time.

Some profs have a team of PhDs and things go to shit all the time. I don’t know why we expect $FRONTIER_LLM to do better

al_borland · 2026-04-08T13:40:53 1775655653

We accept human errors, limitations, and failures. We can empathize with team of humans doing the best they can, and we know any failure is a chance for them to learn and grow.

The sales pitch of AI is that it’s better than humans and has no real limits; it will make us all obsolete. This framing they created means I expect it not to make errors, not to have limits, and not to fail. I expect it to be able to learn and adapt at the speed of light and solve complex problems beyond what a PhD could do. This is what we’ve been told with the narratives around future jobs, AI performance on PhD level tests, how coding is a solved problem, and pictures painted of what a future with AI will look like. While we may know this isn’t true, this is what they are selling, and that’s the standard I’m going to hold them to.

I don’t blame the customer for being upset the snake oil didn’t live up to its promises, I blame the snake oil salesman. We have every right to be upset with the snake oil salesman and ridicule him when his product doesn’t work. Maybe we don’t need better more reliable snake oil, maybe we need real medicine. If real medicine don’t exist, its better to be honest than to mislead people and say it does.

This isn’t to say AI is completely useless, but it’s not what’s being sold. The downtime just proves that, unless they aren’t using their own product. If that’s the case, why not?

runeblaze · 2026-02-27T17:21:26 1772212886

1. openrouter is API usage. There is obviously consumer side

2. people often use openrouter for the sole purpose of using a unified chat completions API

3. OpenAI invented chat completions; if you use openrouter for chat completions often you can just switch your endpoint URL to point to the OAI endpoint to avoid the openrouter surcharge!

4. Hence anyone with large enough volume will very likely not use openrouter for OpenAI; there is an active incentive to take the easy route of changing the endpoint URL to OAI’s

runeblaze · 2026-01-17T06:12:53 1768630373

Schemas can get pretty complex (and LLMs might not be the best at counting). Also schemas are sometimes the first way to guard against the stochasticity of LLMs.

With that said, the model is pretty good at it.

runeblaze · 2026-01-11T02:09:47 1768097387

Is it though? There is a reason gpt has codex variants. RL on a specific task raises the performance on that task

jjmarr · 2026-01-11T02:30:42 1768098642

Post-training doesn't transfer over when a new base model arrives so anyone who adopted a task-specific LLM gets burned when a new generational advance comes out.

runeblaze · 2026-01-11T17:35:59 1768152959

Resouce-affording, if you are chasing the frontier of some more niche task you redo your training regime on the new-gen LLMs

runeblaze · 2026-01-02T22:05:51 1767391551

I think radioactive is a strong word here… I have talked to a lot of people in tech

lol768 · 2026-01-02T22:08:19 1767391699

I don't. YouGov's data suggests 77% of the UK populace has a negative view of the brand. Musk has destroyed its credibility.

Analemma_ · 2026-01-02T22:32:11 1767393131

Obviously we're just dueling anecdotes here, but FWIW, I'm a US tech worker who bought a Tesla in 2022 and certainly never will again. I have four friends with Teslas in tech and all of them say the same thing: never again. Replacement cycles for cars are so long that this will take a while to fully show up in the data, but I don't see growth anywhere in their future, especially when BYD is eating their lunch in seemingly every non-US market.

runeblaze · 2026-01-03T01:09:55 1767402595

Sure never again is totally fair and I am sure a lot of people hate it. I was mostly objecting to the radioactivity of it. Your friends will be more like “I am looking to sell my Tesla in 3 months” if it is truly radioactive.

Let’s be realistic in our portrayal here.

keeda · 2026-01-03T02:07:03 1767406023

Unfortunately, Tesla resale values have also plummeted, so even if people wanted to sell them desperately it may not be a financially sensible decision.

Personally, as a Tesla owner I'm concerned that if my car gets totalled I'll get pretty lowballed on the insurance settlement.

Marsymars · 2026-01-05T02:22:09 1767579729

> Personally, as a Tesla owner I'm concerned that if my car gets totalled I'll get pretty lowballed on the insurance settlement.

The kinda obvious answer there is to use your insurance settlement to buy another highly-depreciated Tesla. Insurance settlements are intended to let you get a comparable replacement as determined by market value. (The alternative is that if your Tesla gets totalled, it's a get-out-of-jail-free card to get a non-Tesla.)

bpt3 · 2026-01-02T22:54:28 1767394468

Tech workers weren't their core market, upper-middle to upper class liberals in major metro areas were.

Sales to that demographic are approximately zero and will remain there until every shred of Elon is removed from the company's fabric.

runeblaze · 2025-12-31T19:42:38 1767210158

Reading what you wrote scares me

runeblaze · 2025-12-26T09:58:02 1766743082

> And if for some ungodly reason you had to do it in Python

I literally invoke sglang and vllm in Python. You are supposed to (if not using them over-the-network) use the two fastest inference engines there is via Python.