There are a number of comments here where people open up about their contrasting experiences of not being a part of a programming community. Those are well addressed, I think, but there is another point to consider.
We need to remember the people, that we may never talk to, that are downstream of all of this software. Not necessarily “the users” as there are many pieces of software meant for other devs, but I think the users deserve consideration nonetheless.
Handing over software quality to the stochastic code extruder is causing a sharp drop in the quality of software put out into the world. This is on top of all of the problems that existed before LLMs, like human error and perverse financial incentives. Shipping poor quality and user hostile software actually hurts people. Real people. Harm is caused in both big and little ways to living, breathing actual people. This “inevitable” slide into generative AI harms every single person it comes into contact with. The devs, the users, the investors, everyone. Those harms may happen at different times and in different ways and the creeping nature of it all might make it easier to ignore, but it’s happening.
“AI” is a blight. You can leave me behind as well.
> Handing over software quality to the stochastic code extruder is causing a sharp drop in the quality of software put out into the world.
I genuinely don't know if that's true and I doubt you do, either. It's all feels right now.
What I do know is I run a couple of personal projects and I can say they are of objectively higher quality now that I'm using AI to build out proper CI pipelines, expand test coverage, produce higher quality architectures, etc.
Why?
Because in the past I didn't have the capacity to invest in that kind of hardening, but with AI, now I do.
Of course you'll probably make the claim that my code is probably crap, the tests suck, etc, because you've already made up your mind. But having been in the industry for 25 years, I can tell you definitively that you'd be wrong about that.
Now, what'll happen to the median codebase? God only knows. Maybe I'm especially diligent.
But given we're really only 6-12 months into the agentic coding era, I think the only conclusion you can make is that the jury is still out.
And I get that compulsion to boil this mess down into a simple good/bad dichotomy.
I absolutely have deeply mixed feelings about these tools, the ethics associated with them, the impact on the industry, on the talent pipeline, etc.
But I also can't deny that they are incredibly powerful tools that are here to stay in one form or another.
And I say that as someone who, a year ago, was absolutely convinced that they were incremental at best and scoffed at everyone who said something like "yeah but they're so much better now!" or "they're only going to get better!"
Well, they were right, they did, and the world has changed. AI generated code is landing in the Linux kernel. 250+ security holes were found and fixed in Firefox. The impact is here and now, and it's mixed and ugly and complicated.
Even if the jury is still out, I would still say you are both right already.
The amount of slop produced even in company setting is staggering and I don't like it one bit that neither the submitter nor the reviewer of the PR paid due dilligence. And I am only complaining because it then becomes my problem. So, then I have to start nagging people to clean that up. I can say with 100% certainty that the problems I face now would not have happened without LLMs.
That said, used with care, with proper supervision, with dilligence to review what LLMs did, I still think they can be and are beneficial.
I think that we are just not used to getting results of questionable quality from the tools we use. So, I am hopeful that we will learn and it will improve with time but still find myself dreading the age of the vibe coder.
I also think reading and reviewing code is a skill that connected to but very much independent of the writing of code, and the use of coding agents requires us to be far more skilled and diligent at it.
So put another way, people who were good at coding without agents may in fact be a poor fit with them, which means the entire industry is experiencing a dislocation between skills we have and skills we need, leading to extremely bimodal outcomes.
Indeed, however I would also point out that senior engineers have already been expected to be good at reading code: they were expected to evaluate the code quality of other contributors, so they had to be able to do that.
In fact, from my personal experience, going from junior to mid to senior, that was the hardest thing. Reading the code and thinking if what they did was really correct and will not have additional undesired side-effects was hard to become efficient at (it didn't help that we were working in C back then).
So, really, I think that for juniors it's actually much harder because if they want to do due dilligence they have to do the same evaluation but without the years of experience working with that code base. I can understand, even if I don't like it, that they just submit the output of the LLM for the senior to review.
> Indeed, however I would also point out that senior engineers have already been expected to be good at reading code: they were expected to evaluate the code quality of other contributors, so they had to be able to do that.
Yeah, but the frequency, volume, and complexity of that activity, and its ratio versus all the other work that a developer was previously expected to do, has shifted dramatically, not least because now we're having to review the output of our own coding agents as well as that of other developers on our teams.
As a consequence, folks who were marginal but capable at that skill now likely find themselves working beyond their ability.
> So, really, I think that for juniors it's actually much harder because if they want to do due dilligence they have to do the same evaluation but without the years of experience working with that code base. I can understand, even if I don't like it, that they just submit the output of the LLM for the senior to review.
It's a bit weird when the article in question is predominantly about software development in a professional setting and the top comment is about how some people in thread are disregarding this context and opining unrelatedly about their unique solo development or personal project development experiences, to then respond to said comment by insistently going on about how AI is great for your personal projects, when people are unable to assess the value of your AI-assisted personal projects and whether they would concur with the high opinion you have of them. A turd with a CI pipeline is still a turd, I think we can all agree on that. IF someone said AI is great because they can now expand test coverage and build a CI pipeline for their todo app in rust, it wouldn't exactly be the proof you're looking for I don't think.
But I agree fully with your last paragraph, and said something similar in a comment elsewhere where I stated my tangible bar as being a Ladybird like browser built from scratch achieving Chrome parity in six months while doing continuous stable releases with coding agents in tow. Otherwise, as you said, the jury is still out.
Yes! It absolutely was (and in many cases still is)! Both projects I've originated and projects I've inherited. I'm not ashamed to admit that. I build in my spare time to create things people (in particular myself) want to use, not to construct ivory towers of architectural purity.
Hell, I inherited maintainership of one gem that barely had a functioning test suite at all and is now at north of 85% coverage and is something I can now change with far greater confidence, and I recently forked a repo to work on another project that was a damned disaster that I massively refactored to make it clean and maintainable.
The world is absolutely full of janky side projects. Is that surprising to you? They're side projects, ffs, not five-nines planetary scale platforms.
I suspect you are right that LLM-generated software will likely negatively impact people's lives. The flip side of this is there is going to be a lot of software generated that would have never been possible before. And for some use cases, some crappy software is better than no software. I think it's hard to predict whether on net this will be a good or bad thing.
>And for some use cases, some crappy software is better than no software
The best use case i've seen for AI is people generating random one shot projects for themselves, which is honestly so cool. You can make some basic app that does something very specific, that would have taken objectively a lot longer to make by hand. This is when 'crappy' software is more than good enough for a specific problem
Similar use cases of useful AI one-shot projects are: demos, proof-of-concept, prototype, exploratory tests, etc... I've AI-generated an HTML/javascript craplet to test how browser security would behave for certain JavaScript API calls. I just wanted to confirm how browser security would behave before I started to spend time hand-coding the non-AI quality software that I wanted to release.
And I say this as somebody who generally has a low opinion of AI generated code and rarely uses it. But it has its place.
I was listening to a podcast talking about this the other day. They encouraged non-programmers to use LLMs to make working demos of the features they wanted to have. A demo is a much more powerful communication tool than a written description. Speaking as an engineer, a working demo makes a way better spec to work off than 100 pages of words. Even if the code itself will be completely rewritten.
I think this is a wonderful use case for LLMs. Who knows if regular people will try it out. We've spent decades making people feel like software is some special techy thing that regular mortals should stay away from. LLMs make it easy for anyone to turn an idea into a prototype.
Structured programming is a blight
Compilers are a blight
Object oriented programming is a blight
Code generation is a blight
Agentic engineering is a blight
All of these blights have one thing in common, they are tools that the lazy person can use as a crutch to put out passible but problematic code. Laziness is a choice, and choices are made by humans with agency and free will.
Steve Jobs motivated Larry Kenyon, the engineer working on bootup, by saying he is effectively saving lives.
I wonder if we reverse that. When engineers write shitty, slow, buggy code or when they're forced to do that by the company, they are effectively killing people.
After all, if I spend an hour dealing with a preventable bug, that's an hour of my live I am never getting back. Multiple that by the userbase and you get entire lifetimes.
I think history will prove that this is a less nuanced view than is required to accurately describe the situation. Abandoning human agency through the use of generative AI harms us all. Using AI as a force multiplier to implement human agency helps us all. It's possible to recognize that asking AI to do everything results in a poor product and brain rot for the humans. It's not at all clear that this is the case for using AI to build boilerplate, help with tests, etc.
I've always felt we somewhat failed as engineers (I included, of course) when I was doing boilerplate by hand. We should have taken the time to get those autogenerated. But... it takes time and the generator is always more complex than the produced code, sometimes even by a few factors, so it also takes an expert to maintain.
As for tests, I've seen LLMs produce good ones as well as useless ones. I guess it's all about instructions... sorry, prompts... no, sorry, prompt engineering to get it right and done properly.
That said, I am also very concerned with brain rot. Engineers nowadays can commit code they don't understand without a blink of the eye. Slowly, the knowledge may get sparse if we are not careful about it.
> Handing over software quality to the stochastic code extruder is causing a sharp drop in the quality of software put out into the world.
Well, first of all you and the author point to the same derisive comment of these models being, in your words "stochastic code extruder" or the one I have heard a lot "next-token predictors", and the connotation I read from these being that this makes them inherently dumb or unintelligent and I don't understand that. The fact that these "stochastic code extruders" can solve Erdos problems is sort of the proof in the pudding. Next token prediction is profound in that it is _a very simple objective_ yet it is _enough_ to take you to extraordinary heights.
Also I wonder how many folks honestly look in the mirror and think: how does the median programmer differ from an LLM. Do you really think humans are universally better and produce universally higher quality code? Not even universally, I would say _typically_. I would trust an LLM to not write a buffer overflow far more than a junior or a mediocre senior engineer. LLMs have built things in my domain that are non-trivial and impressive and correct.
Not to mention, these systems are following a predictable trend in performance improvement so these worries about quality just won't age well, and it seems to be a head-in-the-sand attitude that pretends like quality and reliability are not getting very very good _already_.
> Shipping poor quality and user hostile software actually hurts people.
Could not agree more. So why do you think humans are inherently better at this?
> This “inevitable” slide into generative AI harms every single person it comes into contact with.
I just don't quite understand this, is it that: (1) agentic code is inherently inferior to human code and thus (2) shipping agentic code is actively harmful?
It's like people complaining about "poor quality plastic trinkets" that replaced well-made household items. Of course high-quality things can be (and are) made of plastic. The problem is that you can still make a very cheap passable thing out of plastic, that would be uneconomical to make out of metal or wood.
Same with code: by using AI, one can produce passable software trinkets very cheaply, that would be uneconimical to produce by paying poor-quality human developers.
The floor has moved downwards, allowing to produce a flood of new, trash-quality, disposable code very cheaply. It does not mean that we'll have to use only that code. But unfortunately we'll have to live with it, too.
> “It was a really difficult sell to the American public in the post-war period, to inculcate people into a throwaway living,” she says. “That is not what people were used to.”
> A solution companies came up with was emphasizing that plastic was a low-cost, abundant material.
> A 1960 marketing study for Scott Cup said the containers were “almost indestructible,” but that the manufacturer could still convince people to discard them after a few uses. To counter any “pangs of conscience” consumers might feel about throwing them away, the researchers suggested a “direct attack”: Tell people the cups are cheap, they said, and that “there are more where these came from.”
> A few years later, Scott ran an advertisement saying its plastic cups were available at “‘toss-away prices.”
It wasn't plastic itself, and likewise it's not "AI" itself.
We do have an abysmal track record as industrialized nations however, and more recently, in many parts of the software industry.
But we can change it. With so many things, tech people spent so much time and energy debating... like cookies or HTTPS or whatever... we often heard/said that while we care so much about doing the right thing, we can't achieve anything at work because consumers don't care. Well, this time, pretty much all of the world cares a lot. I mean, the Vatican just blogged about it!
Maybe we just "have to live with it", but in that case, there is also no utility in pointing that out, since we literally have to live with it. And of course, it's really about the shape of the "it", and how it's used, not that there is one that will never go away. That is also true about most things: stuff we don't currently use is in the museum or text books. Nothing goes away away, but we no longer drink out of lead cups, even though we still use lead. We don't have x-ray machines in shoe shops, even though we still use x-rays.
I agree, "AI" is being pushed beyond reason, in an attempt to make it look like what it's not, or at least not yet. And yes, people do act irrationally due to that, see all that tkenmaxxing and layoffs. I don't think it will last in this inebriated state forever.
But it's worth noting that there's a new substance enabled by AI, the "slop", that is flushing violently into our world right now. Pretending it's not there, and that we won't see more of it, is perilous.
> The fact that these "stochastic code extruders" can solve Erdos problems is sort of the proof in the pudding.
This claim is very misleading and not really true. It reflects the kind of exaggeration and spin made by corporate marketing. I would not call this a fact at all. Like many claims made by for-profit marketing, if one looks into the details and think critically about what is being claimed, one can see that consumers are jumping to false conclusions.
That said, it is very cool how an LLM helped human mathematicians in the recent specific Erdos problem solution announced by OpenAI. Just don't jump to the conclusion that anybody can input any Erdos problem into an LLM and a solution will come out the other end.
> exaggeration and spin made by corporate marketing.
corporate marketing spins and hypes, but this is an ultimately pretty academic and mathematical field. The loud LinkedIn promoters are not building these systems.
"if one looks into the details and think critically about what is being claimed, one can see that consumers are jumping to false conclusions."
well then help us out here: can you be specific? To me it sounds a lot like goalpost moving. You're telling me that in 2020 if I showed you a system that can solve an Erdos problem or disprove a conjecture (just recently showed up) you wouldn't be blown away?
> That said, it is very cool how an LLM helped human mathematicians in the recent specific Erdos problem solution announced by OpenAI. Just don't jump to the conclusion that anybody can input any Erdos problem into an LLM and a solution will come out the other end.
Woah woah, that's not the conclusion I'm jumping to. That's not at all how these headlines happen. Solving problems like this is almost prohibitively expensive today, and they more often than not lead nowhere. The point I'm making is, today, 4 years since ChatGPT, we have systems that can and have solved them. First we had things like AIME and IMO benchmarks, then people said "well those are just cheats in the training data, wait for it to solve a real math problem" -- ok but now we're solving real math problems.
> well then help us out here: can you be specific?
In "Remarks on the disproof of the unit distance conjecture" (https://arxiv.org/abs/2605.20695) I think Melanie Matchett Wood's remark is the most informative: "It is easy to jump to hasty conclusions, but what we can learn about humans, AI, and mathematics from this development is somewhat subtle. I believe if the level and type of human expertise that is represented on this note had been assembled to find a counterexample to this conjecture a month ago, and those people put in similar amounts of time working on it than they did to reading and thinking about ChatGPT’s solution, the mathematicians would have found a counterexample. However, without the claimed proof by ChatGPT, there is no particular reason anyone would have tried to look for a counterexample, assembled a group of experts with the appropriate expertise, or that the experts would have agreed to turn their attention to this problem."
Some readers might find some of the other remarks more appealing or more informative. I encourage folks to read these remarks rather than the OpenAI marketing video and spin.
> To me it sounds a lot like goalpost moving. You're telling me that in 2020 if I showed you a system that can solve an Erdos problem or disprove a conjecture (just recently showed up) you wouldn't be blown away?
I'm not sure what goalpost you are talking about. Regarding 2020: it depends on the framing, how much I know about the conjecture, the details of the computer system. I an easily imagine not being blown away. But I don't really see the relevance of our emotional reactions to computers doing new things we've never seen computers do before. If the goalpost is being "blown away" by what computers can do, then that happened I think around 1990 when I heard a computer program generate a vaguely human sounding voice. In math, I think it happened when I saw Mathematica simplified a huge nasty complicated algebra expression around 2000. I've been "blown away" by new things computers can do many times over the past 36 years.
> Woah woah, that's not the conclusion I'm jumping to. ... ok but now we're solving real math problems.
Sounds like we are in agreement then that (1) LLMs can not solve any given Erdos problem and (2) computers are solving more real math problems than they were before.
I honestly do think what the OpenAI group did with an LLM recently is a new milestone worthy of attention if one is interested in computer assisted mathematics. I don't mean to diminish the LLM feat. I just mean to throw shade on the corporate marketing, language, slick video, and spin.
> Also I wonder how many folks honestly look in the mirror and think: how does the median programmer differ from an LLM.
Once you step out of pure-software orgs, it becomes clear that most would benefit from having AI write code. There's a huge moat between most people and the point where they can afford/find the effort of someone that can write software.
These people, that only care about practical results rather than somewhat tangential things like "elegance" and "maintainability", are going to benefit tremendously.
Why is that a prerequisite? There are entire philosophies about what makes good design for UI's etc. Why can't models figure this out? Why do you feel this is some sort of mystical thing out of reach?
If you think all of the complexity of the human experience is reducible to statistical weights between tokens, that’s fine. Go with God. I don’t think that it is.
What do you think humans are? What’s the mechanism by which we make decisions, learn, remember, etc? Why would this be anything more than a very complex system that has been slowly optimized via evolution to perform well in its environment over long time horizons?
> to the same derisive comment of these models being, in your words "stochastic code extruder"
So many excited and insulted LLM adopters on this thread. There is nothing derisive in that comment, it is simply the purest possible definition of how they work. Stochastics is a branch of maths you know.
> can solve Erdos problems is sort of the proof in the pudding
For the non-engineer, non-mathematician it may sound authoritative, but you'd probably be surprised to learn that most of Erdos problems are not at all complex - they are just not very interesting or relevant. So it is a proof in the pudding, provided the pudding is made of shit - the kind of stuff LLMs produce most of the time.
> I just don't quite understand this, is it that: (1) agentic code is inherently inferior to human code and thus (2) shipping agentic code is actively harmful?
Yes and yes - have you not heard of that AWS incident with Kiro when the "agentic" shit deleted an entire infrastructure environment, complete with data, config, etc.?
> Also I wonder how many folks honestly look in the mirror and think: how does the median programmer differ from an LLM
Apart from the obvious absurdity of this statement - I know a lot of you non-engineer types feel "empowered" by the LLMs, in the sense of how they immediately seem a genius when you ask them on a topic you are not expert in, but you may want to read a book on programming first - maybe you'll get a clue then.
> So many excited and insulted LLM adopters on this thread.
neither excited nor insulted.
> There is nothing derisive in that comment, it is simply the purest possible definition of how they work. Stochastics is a branch of maths you know.
Not sure what you mean by stochastics but this is more statistics. They are trained with a next token loss, that doesn't belie "how they work".
> For the non-engineer, non-mathematician it may sound authoritative, but you'd probably be surprised to learn that most of Erdos problems are not at all complex - they are just not very interesting or relevant.
It sounds like you are both an engineer and a mathematician? Can you confirm? These are problems unsolved for many years. You think no good mathematicians have taken a stab at them, even if just to say they have resolved an unsolved Erdos problem? They are "not at all complex" is quite an extraordinary thing to say I'm wondering if you actually do have the pedigree you are trying to make it sound like you have, or if you are just regurgitating the same HN talking points you've heard.
> Yes and yes - have you not heard of that AWS incident with Kiro when the "agentic" shit deleted an entire infrastructure environment, complete with data, config, etc.?
And this means agentic code is inherently inferior to human code? Howso?
> Apart from the obvious absurdity of this statement - I know a lot of you non-engineer types feel "empowered" by the LLMs, in the sense of how they immediately seem a genius when you ask them on a topic you are not expert in, but you may want to read a book on programming first - maybe you'll get a clue then.
in the beginning you mentioned there were a lot of "excited and insulted LLM adopters" and yet...this sounds quite excited and defensive. Believe it or not, I am not a "non-engineer type" and its telling you assume that people who don't seem to share the same opinion as you are somehow less qualified than I assume you think you are? Why is this statement obviously absurd. Maybe you work in a really talented engineering team, which kudos to you I also have worked in teams like this, and I have also seen what is the p50 engineer and they are just as error prone or more than Claude. Thank you for the advice to read a book on programming as if that somehow would have any bearing on this at all?
An engineer with an engineering degree, which as it may still be known to some, requires a fairly stringent mathematical underpinning. So yes, I know a thing or two - read up on Erdos and his problems, I am not here to enlighten every vibecoding PM that shows up.
> And this means agentic code is inherently inferior to human code? Howso?
Again, I am not here to explain the world to some clueless PM. You have your LLMs for that :) But for the sake of bringing you closer, the "agentic" code is often very inferior, implementing happy paths or just bluntly exposing secrets in clear texts, etc. Probably a consequence of it being trained on, as you put it "p50 engineering code".
> Maybe you work in a really talented engineering team,
Running my own company and been paying the LLM-Shit-Generators for my whole team for a long time, in the hope they would bring the advertised benefits. Guess what - for serious use-cases, they bring shit and more shit.
> Thank you for the advice to read a book on programming as if that somehow would have any bearing on this at all?
Oh yeah obviously not, I mean, its not like understanding software development would help you understand how LLMs are not similar to a "p50 engineer" at all:). I'd take the latter over the former every time.
> Why is this statement obviously absurd
Well for one, LLMs are not humans, but it should be obvious to even to most cretinous of the e/acc crowd. It's not like they can think in abstract terms or come up with completely new concepts. But then again, don't mind me - if you can live with below average AI slop - go for it.
A really sincere piece of advice that I really hope you take to heart: everyone who disagrees with you is not simply beneath your genius. I am not a PM (yet that is also quite insulting to some very competent and technical PM's I have worked with), I have an actual math degree, alongside a physics degree and a PhD in astrophysics from a strong department; I have worked in companies both at the MAANG scale and companies as small as 20 people for the last decade. It feels gross to have to type this but evidently you seem blocked from considering other viewpoints because you think I am a "vibe coding PM". It's ok if you want to cling to this as a comfort but just know it's a troubling way of going through life and also happens to be leading you astray in this particular case.
I don't see really any hard source at all here from you except anecdotes that you seem to hold in very high regard. I do see an incredible amount of condescension and chest pounding about what is ultimately a very technical and...ahem...mathematical topic. I don't know about you but I don't really see many conference paper reviews that start with "I am not here to enlighten every vibe coding PM that shows up". I am sure you would agree with me.
> But for the sake of bringing you closer, the "agentic" code is often very inferior, implementing happy paths or just bluntly exposing secrets in clear texts, etc. Probably a consequence of it being trained on, as you put it "p50 engineering code".
I do appreciate this tiny delicious gift of "bringing me closer" because it (1) answers my question about "inherent" properties of agentic systems by giving anecdotal examples of existing systems, (2) completely misunderstands how agentic coding models are trained. Human code traces are a bootstrap to an RL with verifiable rewards stage. Not having the same "you are too beneath me to explain my wrong opinions" attitude, I will genuinely explain a bit because this isn't as trivial and obvious as you make it sound, nor is it a giant pissing contest. Likely the most important property of coding agents that has resulted in their existing and future success is that they are not limited by the quality of human training data. Seems to be a very common misconception, but this is, like you say, just math:
- Agentic coding models like Claude go through several complex training stages
- Pretraining which is kind of a compression step and gives them semantic understanding and a bit more
- Supervised fine tuning which gives them some task specific performance (this is where human traces and verified synthetic traces are used)
- Alignment to make them not give you meth recipes and to behave in the way you want agents to behave
- Reinforcement learning with verifiable rewards (RLVR): then they go forth and solve open ended questions. RLVR is not new mathematics, we know what happens when you take RL with good rewards and throw a bunch of compute at it, we've known that for decades now. This is where the "superhuman" performance comes in, it's not some "vibe coding PM" that's giving you an empty promise, it is the math that you and I so highly revere that promises you this.
> Running my own company and been paying the LLM-Shit-Generators for my whole team for a long time, in the hope they would bring the advertised benefits. Guess what - for serious use-cases, they bring shit and more shit.
This sounds like the experience I would mostly expect from a small company adopting Claude, it is not magic nor is it at the point where you can blindly trust it to not mess something up. It will waste your time. I find it kind of doubtful it has not given you any benefits, but I'm not sitting where you're sitting so I can't refute your experience. People talk resentfully about "advertised benefits" but then never cite what advertised benefits they interpreted these systems as having. Do you have like a quote or something that you can point at?
> Oh yeah obviously not, I mean, its not like understanding software development would help you understand how LLMs are not similar to a "p50 engineer" at all:). I'd take the latter over the former every time.
Maybe I misinterpreted you: I found you telling me to "read a book" to be more of a dismissive condescending comment but maybe you mean it sincerely in which case, sure I will continue to read programming books and following the published work in the field as I have done for years now.
> Well for one, LLMs are not humans, but it should be obvious to even to most cretinous of the e/acc crowd. It's not like they can think in abstract terms or come up with completely new concepts. But then again, don't mind me - if you can live with below average AI slop - go for it.
I do agree with you that LLMs are not humans but when you say this is obvious and then don't back it up, that is really not convincing. I think you overestimate the capability of human beings and underestimate the asymptotic capabilities of these systems. Their performance improvements are predictable and these predictions continue to hold. It seems the burden of proof is on you to explain why we should expect some sort of fundamental limit to these capabilities and where those fundamental limits would arise. I'm not aware of very many.
> have an actual math degree, alongside a physics degree and a PhD in astrophysics from a strong department
Good for you, I suppose, but all it tells me is that you have probably not developed software professionally - after all, PhDs in astrophysics "from a strong department" rarely end up in commercial software development...
> This sounds like the experience I would mostly expect from a small company adopting Claude
Who said it was a small company? You're making too many assumptions buddy :)
> will genuinely explain a bit because this isn't as trivial and obvious as you make it sound
It is literally the same technology developed in the 1940s mate, adding more GPUs will not magically make it become a god-in-the-box. How fucking innovative can you still claim it to be?
> I think you overestimate the capability of human beings and underestimate the asymptotic capabilities of these systems
Right, remember when LLMs constructed the rockets and modules for landing on the moon, using practically just the logarithmic tables? Or when they invented the vaccine? How about X-rays? Cars? Aeroplanes? You don't? Oh right, me neither! We must be downplaying their nonexistent "capabilities". And the use of word "asymptotic" - is absolutely not conveying the meaning you think it does.
> Do you have like a quote or something that you can point at?
Well, how about the CEOs of companies claiming to be worth 1T and upwards, stating that their products have almost superhuman intelligence? PhDs in the pocket etc?
Nah, there's no evidence of reduced quality. If anything it's the reverse. I've seen AI code review tools be tremendously effective at catching defects which otherwise would have shipped.
Its easy to lean too hard into vibe coding. I've spent the last week unpicking a bunch of poor decisions claude made while coding something up. Its my mistake - I trusted it too much. It made a lot of sloppy, poorly thought through abstractions and then built a bunch of bad code on top of them. I should have been more careful reviewing its abstractions first.
But as others in this thread have said, LLMs are also great at doing all the tedious quality-improving tasks that I sometimes don't have time to do. You can prompt LLMs to write tests, to do fuzz testing and to set up CI pipelines. You can get them to formalise the constraints in the system (and then try to find violations of those constraints). I'm doing some work at the moment with Cocoa (apple's old UI library). I downloaded all the cocoa docs I could find. Then asked claude to read it all and review our code. It found lots of places where we're not using the API effectively. Fabulous.
LLMs are an accelerator. If you already know how to write performant, reliable software, LLMs can help you get there faster. But if you sit back and let the LLM guide itself, who knows where you'll end up. Probably nowhere good. Over the next few years we're going to see every possible use of LLMs. Probably more vibe coding disasters than anything else. But as a senior engineer, I think its way too reductive to write the tools off entirely. They're useful. But using them well? That's the trick.
In the DORA group's reporting on AI-assisted software engineering, they indeed state that across industries, quality goes UP with the use of AI assistants like Claude and others.
Moreover, in my experience helping businesses on this topic; They never defined or made measurable what quality meant in the first place. Then when they finally do figure it out, it turns out that the average repository is a total disappointment in terms of absolute quality.
Same for me. I have not yet successfully used an LLM to generate code to produce a feature, though to be fair, it might just be because I don't have the patience to go back and forth with it (will readily admit that I am using it wrong).
Also, while some code review comments are just plain wrong, LLMs did produce some damn good comments, on the same level as a different senior engineer might note had they taken the time to study the code carefully.
So do we want to acknowledge that interest in this is probably driven by the desire to abusively scrape the web for LLM training data? I wouldn’t be surprised if the motivation is focused around bypassing the anti-bot restrictions of Reddit alone.
This quote comes up often in SJ biographies or anecdotes and they universally attribute it purely to aesthetic concerns. Admittedly the man cared quite a lot about "beauty", but I've always thought this was more about the caring and less about the beauty.
To spend time making something most people never see look just as good as the things they do see you have to care quite a lot. This care begets a wide range of (usually) desirable secondary effects brought about by diligence. In my view it's similar to the effect of spending the time to make many iterations of a thing versus one perfect thing, with the former usually resulting in an end product much closer to "perfect".
This can also be seen as a way to filter the customers. "We only care about customers who care about the visual quality of the board that nobody ever sees." In other words, customers who are driven by aesthetics, and who have the means to support the habit of buying extra quality things, maybe with a whiff of conspicuous consumption.
If anything, it's a good, high-margin market. Beside the actual piece, you sell both self-appreciation and status. Apple long tried to make their products closer to fashion accessories, with some success.
A real estate agent walked through my house I was putting for sale. He examined the switch plates carefully. I asked why. He replied that a good craftsman lined up the slots in the top screw and bottom screw, and this was a "tell" that he'd done a good job.
Why not? There are companies dedicated to sprucing up houses for sale. They will even bake bread for you before inspections to give the place a "homey" smell.
> Despite skepticism from Volcker and Buffet, financial innovation has been and will continue to be a massive net positive for humanity.
Juxtaposing yourself with Warren Buffet and then hand-waving away his wisdom is probably the reddest of flags when discussing finance (not that Buffet is always right). "Innovation" in payday loans is akin to inventing new ways to feed living, breathing things into a meat grinder. In this case it's the poorest among us. The author goes on to say:
> Is financing your lunch a sign of societal decay? Maybe, maybe not. But it’s definitely an evolution in Market Completion.
This is undiagnosed sociopathy.
There is a point when making a thing that you must ask "what affect will this have on the world?" or you risk destroying far more than you create. Finance types have learned absolutely nothing since Buffet laid down his "newspaper test":
"I want them to not only do what’s legal obviously, but I want them to judge every action by how it would appear on the front page of their local paper written by a smart but semi-unfriendly reporter who really understood it to be read by their family, their neighbors, their friends."
This. Also, I like the term "net positive" in articles like this. If you lose everything, but I win even more, it's technically still a "net positive". Even if only one person would be happy about it.
I'm happy someone else also had the same thoughts, and put it better than I could.
Incidentally, regarding Buffet's sensibilities, I once felt it worthwhile to write to Berkshire Hathaway's little office, about a new shady thing one of their holdings was rumored to be doing towards employees, and whether that fit BRK's standard of good management. My note almost certainly got tossed into the crazy-people round-file, but it'd be nice if Warren Buffet called up a CEO or Chair, and said, "Hi, Bob. This is Warren. What kind of shop are you running over there?"
> … it means delivering the kind of things that are legible to the decision-makers at the company: i.e. visible to your manager, plus 1-3 skip levels, depending on your title. The easiest way to do this is to deliver things that they already know about, such as projects that they’ve asked you to do, or incidents that are serious enough that they’re involved in them. It’s possible to make other work legible to them as well. If your work produces or saves money, that will make it immediately legible, for instance (or you could just be really convincing). By default, work you do isn’t legible: to the decision-makers, it’s generic technical nonsense. They don’t know whether it’s crucial high-impact work or pointless code reshuffling, and will tend to assume the latter.
This person understands the “business” side of the tech business. I couldn’t agree more. Where many struggle is that they can’t communicate legibly about the indirect benefits their work has for the business. The classic “refactoring” (which he mentions) is a great example.
Refactoring code has a context dependent benefit to a business. When you’re searching for product/market fit is has essentially no benefit, and then you’re Microsoft and the code is deep within Windows and affects the performance of every Win32 app it can have extreme benefits. In the end it’s all about how you relate your work to either making or saving the organization money, and doing so indirectly can be legible if you take the time to figure out how to best communicate it to the target audience (and how it can be conveyed to customers).
I couldn't agree more. It really is important for developers careers to learn at least a bit of business speak, and try to learn how to frame problems in ways that business people understand and care about
At the end of the day, most decisions at a business come down to a cost versus benefit, assuming that the business is behaving more or less rationally
Most business people in my experience also view the software itself as an expense, not an asset. I find that software devs do not understand that. "What do you mean the software is a cost center. This whole business sells software, how can we make money without software?"
This isn't how many business types view it. The software doesn't matter to them at all. They would love if they could just sell nothing, so their costs would be zero and their profit margin would be infinite. That is the actual dream
It's not rational but you gotta understand that sales doesn't sell on rational, they sell on vibes, good relationships, bribes, whatever they can get away with.
Trying to be rational when selling puts you on too level of a playing field with other sellers, so they pursue other angles
Subscribed. All this money I’m saving boycotting spineless American companies is coming in handy.
If LWN is worth $16 a month, and it is, then so is The Atlantic, ProPublica, etc. We collectively need to make a habit of financially supporting actual journalism and doing so loudly.
Loading up this thread I knew this kind of response would be here. Like, I was willing to bet money on it.
Examples of support people worth $200k+ are abundant, and the business case is the same every time. When you do the work to place a monetary value on customers and their retention your support personnel costs relative to that are easy to justify. When a support person is preventing churn of X number of customers worth $Y dollars a year the math becomes trivial.
The (American) tech industry is so accustomed to massive scale and lack of competition that the notion of giving a damn about customer retention has risen to the level of a cultural, not economic, problem.
Arguing that many humans are stupid or ignorant does not support the idea that an LLM is intelligent. This argument is reductive in that it ignores the many, many diverse signals influencing the part of the brain that controls speech. Comparing a statistical word predictor and the human brain isn’t useful.
I'm arguing that it's natural for intelligent beings to hallucinate/confabulate in the case where ground truth can't be established. Stupidity does not apply to e.g. Isaac Newton or Kepler who were very religious, and any ignorance wasn't due to a fault in their intelligence per se. We as humans make our best guesses for what reality is even in the cases where it can't be grounded, e.g. string theory or M-theory if you want a non-religious example.
Comparing humans to transformers is actually an instance of the phenomenon; we have an incomplete model of "intelligence" and we posit that humans have it but our model is only partially grounded. We assume humans ~100% have intelligence, are unsure of which animals might be intelligent, and are arguing about whether it's even well-typed to talk about transformer/LLM intelligence.
> You might think "something something incentive systems". No. At my big tech job I had the pleasure of interviewing a few programmers who worked for a large healthcare company that engages in regulatory capture. Let me assure you: They. Do. Not Care.
Regarding programmers specifically I can concur, but with a caveat. Devs often care quite a lot about many things, but often one of those things is not doing the job they were hired for. The tedium of building software for businesses, even what we now call "big tech", is universally unappealing and definitely not the reason most devs started tinkering with computers. So they care very little, and it shows in the tech taking over the clerical aspects of every day life.
We need to remember the people, that we may never talk to, that are downstream of all of this software. Not necessarily “the users” as there are many pieces of software meant for other devs, but I think the users deserve consideration nonetheless.
Handing over software quality to the stochastic code extruder is causing a sharp drop in the quality of software put out into the world. This is on top of all of the problems that existed before LLMs, like human error and perverse financial incentives. Shipping poor quality and user hostile software actually hurts people. Real people. Harm is caused in both big and little ways to living, breathing actual people. This “inevitable” slide into generative AI harms every single person it comes into contact with. The devs, the users, the investors, everyone. Those harms may happen at different times and in different ways and the creeping nature of it all might make it easier to ignore, but it’s happening.
“AI” is a blight. You can leave me behind as well.
reply