I use Playwright to intercept all requests and responses and have Claude Code na...

bredren · 2026-03-15T22:50:41 1773615041

I also do this. My primary use case is for reproducing page layout and styling at any given tree in the dom. So, capturing various states of a component etc.

I also use it to automatically retrieve page responsiveness behavior in complex web apps. It uses playwright to adjust the width and monitor entire trees for exact changes which it writes structured data that includes the complete cascade of styles relevant with screenshots to support the snapshots.

There are tools you can buy that let you do this kind of inspection manually, but they are designed for humans. So, lots of clickety-clackety and human speed results.

---

My first reaction to seeing this FP was why are people still releasing MCPs? So far I've managed to completely avoid that hype loop and went straight to building custom CLIs even before skills were a thing.

I think people are still not realizing the power and efficiency of direct access to things you want and skills to guide the AI in using the access effectively.

Maybe I'm missing something in this particular use case?

mrieck · 2026-03-16T21:27:56 1773696476

> There are tools you can buy that let you do this kind of inspection manually, but they are designed for humans.

You should try my SnipCSS Claude Code plugin. It still uses MCP as skill (haven't converted to CLI yet), but it does exactly what you want for reproducing designs in Tailwind/CSS at AI speeds.

https://snipcss.com/claude_plugin

AlphaSite · 2026-03-16T02:54:55 1773629695

its mostly because MCPs handle auth in a standardised way and give you a framework you can layer things like auth, etc on top of.

Without it youre stuck with the basic http firewall, etc which is extremely dangerous and this is maybe the 1 opportunity we have to do this.

re5i5tor · 2026-03-16T12:28:05 1773664085

And people forget, Claude Code isn’t the only Claude surface, and CLIs don’t help in other surfaces other than Cowork.

ranyume · 2026-03-16T15:51:00 1773676260

> My first reaction to seeing this FP was why are people still releasing MCPs?

MCPs are more difficult to use. You need to use an agent to use the tools, can't do it manually easily. I wonder if some people see that friction as a feature.

Axsuul · 2026-03-15T21:41:36 1773610896

Why even use Playwright for this? I feel like Claude just needs agent-browser and it can generate deterministic code from it.

dsrtslnd23 · 2026-03-15T22:29:46 1773613786

you mean this one? https://github.com/vercel-labs/agent-browser

dataviz1000 · 2026-03-15T22:33:47 1773614027

It is 2 months old!

My excuse for not keeping up is that I'm in so deep that Claude Code can predict the stock market.

I'll still publish mine and see if has any value but agent browser looks very complete.

Thank you for sharing!

bartek_gdn · 2026-03-15T22:54:38 1773615278

Yes please, maybe there will be some solution that will fit the problem better! I recently released something similar, and because of the small API, I'm more comfortable using it.

https://news.ycombinator.com/item?id=47207790

Barbing · 2026-03-16T04:25:53 1773635153

>I'm in so deep that Claude Code can predict the stock market.

“What?”, more polite than “yeah right” :)

(oh I guess obviously it would have a chance at nailing it for weeks in a row, and have more good years than bad—since actively managed funds can pull that off until, universally, they can’t [beat the market])

botanrice · 2026-03-16T16:24:10 1773678250

I'm curious, have you developed your own reasoning system for how Claude can predict the stock market? Or have you trained it on past data combined with news sources?

djsavvy · 2026-03-16T12:43:48 1773665028

> Claude Code can predict the stock market.

Please say more!

citizenpaul · 2026-03-16T03:29:27 1773631767

>I'm in so deep that Claude Code can predict the stock market.

What?

thefreeman · 2026-03-16T01:37:07 1773625027

You can just start claude with the —chrome flag too and it will connect to the chrome extension.

dallen33 · 2026-03-17T14:30:38 1773757838

Yeah I'm confused why I need something like agent-browser for that. Maybe it uses less tokens.

felarof · 2026-03-16T18:46:20 1773686780

I do this via BrowserOS -- https://github.com/browseros-ai/BrowserOS

It has an in-built MCP server and I use it with claude code, codex and like it quite a lot.

schainks · 2026-03-15T22:17:46 1773613066

Very interested. Would even pay for an api for this. I am doing something similar with vibium and need something more token efficient.

hugs · 2026-03-16T04:35:05 1773635705

have you tried vibium's cli + agent skill?

schainks · 2026-03-18T20:54:57 1773867297

Yes, it's pretty good! I've also written API harnesses for bot-based browser automation so that you can detect fields to fill in, remember where they are for next time you need them, and then if the webpage changes, re-explore and rewrite the tags to remember for the new form fields.

Spoiler: this is to automate ticket submission to my landlord's half-baked web portal, not some kind of nefarious captcha breaking thing.

defen · 2026-03-15T21:25:26 1773609926

Would this hypothetically be able to download arbitrary videos from youtube without the constant yt-dlp arms race?

dawnerd · 2026-03-15T21:44:20 1773611060

Don’t know how this could be more stable than ytdlp. When issues come up they’re fixed really quickly.

varenc · 2026-03-15T22:20:26 1773613226

yt-dlp was very recently broken for ~2 days for any Youtube videos that required cookies: https://github.com/yt-dlp/yt-dlp/issues/16212

Here is what actually fixed it: https://github.com/yt-dlp/ejs/pull/53/changes

yt-dlp is relatively stable, but still occasionally breaks for long periods. I get the sense YouTube is becoming increasingly adversarial to yt-dlp as well.

I don't know the details, but it doesn't seem like yt-dlp is running the entire YouTube JS+DOM environment. Something like a real headless browser seems like it would break less often, but be much heavier weight. And Youtube might have all sorts of other mitigations against this approach.

22c · 2026-03-16T01:45:38 1773625538

> yt-dlp is running the entire YouTube JS+DOM environment

IIRC they maintain a minimal execution environment that is able to run just the JS needed to pass a few checks but this breaks too often enough that they're planning to make Node.js or another JS interpreter a hard requirement (possibly already happened).

defrost · 2026-03-16T01:51:12 1773625872

Pretty much - yt-dlp currently requires Deno to "solve" youtube challenges.

* https://deno.com/

* there may well be other JS interpreters that are accepted, can be used - but solving JS challenges is required for much, if not all, YT content.

zipping1549 · 2026-03-16T03:08:26 1773630506

> const url = (${generate(expression)})("https://youtube.com/watch?v=yt-dlp-wins", "s", sig);

I'm pretty sure yt-dlp is filled with these kinds of gold.

coro_1 · 2026-03-16T01:19:20 1773623960

> I get the sense YouTube is becoming increasingly adversarial to yt-dlp as well.

I rarely use yt-dlp anymore.

Before I just updated. Now when I do that, it usually becomes complex and full of questions.

toomuchtodo · 2026-03-15T22:27:53 1773613673

I think having a hook to an LLM endpoint to enable yt-dlp to attempt to self resolve until an official fix is available would be a useful enhancement.

dataviz1000 · 2026-03-15T21:28:10 1773610090

> yt-dlp arms race

I don't know anything about yt-dlp.

It would probably help people who want to go to a concert and have a chance to beat the scalpers cornering the market on an event in 30 seconds hitting the marketplace services with 20,000 requests.

I can try to see if can bypass yt-dlp. But that is always a cat and mouse game.

defen · 2026-03-15T21:35:14 1773610514

To clarify - yt-dlp is a command line tool for downloading youtube videos, but it's in a constant arms race with the youtube website because they are constantly changing things in a way that blocks yt-dlp.

dexterdog · 2026-03-16T00:15:03 1773620103

I wouldn't call it an arms race. I don't update my client that often and I rarely have problems downloading any video with it.

phantomathkg · 2026-03-16T02:14:38 1773627278

If it can save all the video/audio fragment and call ffmpeg to join them together. Maybe?

Johnny_Bonk · 2026-03-15T22:02:50 1773612170

Yes, please do and ping me when it's done lol. Did you make it into an agent skill?

dataviz1000 · 2026-03-15T22:09:52 1773612592

Exactly, it is an agent skill that interacts pressing buttons and stuff with a webpage capturing and documenting all the API requests the page makes using Playwright's request / response interception methods. It creates and strongly typed well documented API at the end.

bengt · 2026-03-15T22:12:14 1773612734

Sounds awesome. I've been using mitmproxy's --mode local to intercept with a separate skill to read flow files dumped from it, but interactive is even better.

sidwyn · 2026-03-16T18:05:06 1773684306

I do something similar [1] but it leverages WebMCP (see Amazon example [2]). Could probably turn it into a strongly typed API.

[1] https://github.com/sidwyn/webmcp-tool-library

[2] https://github.com/sidwyn/webmcp-tool-library/blob/main/cont...

kolinko · 2026-03-16T10:35:14 1773657314

Please do.

Did you compare playwright with mcp? Why one over another?

I use MCP usually, because I heard it’s less detectable than playwright, and more robust against design changes, but I didn’t compare/test myself

miohtama · 2026-03-15T23:25:10 1773617110

I just ask Claude to reverse engineer the site with Chrome MCP. It goes to work by itself, uses your Chrome logged in session cookies, etc.

arjunchint · 2026-03-16T22:32:51 1773700371

With our rtrvr.ai Extension we are actually about to allow anyone to do this with just prompting:

- the agent takes actions on a page

- network calls recorded

- agent writes script to hit endpoints directly at scale

- requests made from main world of webpage so automatically get auth headers added

Basically what you do manually but done via the agent in a minute and for FREE from an AI Studio API key

mikrl · 2026-03-15T22:37:59 1773614279

I was doing similar by capturing XHR requests while clicking through manually, then asking codex to reverse engineer the API from the export.

Never tried that level of autonomy though. How long is your iteration cycle?

If I had to guess, mine was maybe 10-20 minutes over a few prompts.

cbility · 2026-03-16T13:00:28 1773666028

I use chrome devtools MCP to the same end - it works great for me. Interested in what advantages you see in using Playwright over chrome devtools?

rkagerer · 2026-03-16T03:43:39 1773632619

I assume you're not logged into those sites, in order to avoid bans and the risk of hitting the wrong button like, say, "Delete Account".

dataviz1000 · 2026-03-16T04:09:48 1773634188

It turns any authenticated browser session into a fully typed REST API proxy — exposing discovered endpoints as local Hono routes that relay requests through the browser, so cookies and auth are automatic.

The point is that it creates an API proxy in code that a Typescript server calls directly. The AI runs for about 10 minutes with codegen. The rest of the time it is just API calls to a service. Remove the endpoint for "Delete Account" and that API endpoint never gets called.

This 100% breaks everyone's terms of service. I would not recommend nor encourage using.

zacmps · 2026-03-16T08:41:16 1773650476

I would love it if you had time to publish it!

3abiton · 2026-03-16T06:34:21 1773642861

I always used playwrite as an alternative to selenium, relatively surprised by its ability to interface with LLMs.

swyx · 2026-03-16T18:24:00 1773685440

yes please! i need a "comment to follow" functionality on HN

swyx · 2026-03-16T21:21:57 1773696117

i had claude code oneshot it: https://github.com/swyxio/websiteinterceptor

dataviz1000 · 2026-03-18T02:03:32 1773799412

Can I contact you to get some feedback on what I did before I release it?

npunt · 2026-03-16T21:51:42 1773697902

thanks swyx! you're always on top of stuff

TimCTRL · 2026-03-16T09:39:48 1773653988

+1, publish, but how will we know when you have published...

bheadmaster · 2026-03-16T13:25:59 1773667559

I am EXTREMELY interested. Please publish it.

citizenpaul · 2026-03-16T03:30:21 1773631821

Id like to see this published as well thx!

xrd · 2026-03-15T21:20:47 1773609647

Yes, please do!

dataviz1000 · 2026-03-15T21:26:15 1773609975

100% I'll response to this by Friday with link to Github.

I use Patchright + Ghostery and I have a cleaver tool that uses web sockets to pass 1 second interval screenshots to the a dashboard and pointer / keyboard events to the server which allow interacting with websites so that a user can create authentication that is stored in the chrome user profile with all the cookies, history, local storage, ect.. in the cloud on a server.

Can you list some websites that don't require subscription that you would like to me to test against? I used this for Robinhood and I think Linked in would be a good example for people to use.

bengt · 2026-03-21T04:40:37 1774068037

I empathize with you in that once you publish a timeline people's expectations can be come unfair; so where is it‽ :)

dataviz1000 · 2026-03-21T22:50:23 1774133423

It got out of hand :)

It will make a proxy API from any transport from any website using Claude Opus 4.6 -- oneshot

I need help testing it before I share it with everyone.

Can you send message to [HN username]@gmail.com

botanrice · 2026-03-16T16:27:00 1773678420

Would you be open to sharing your Github profile now so I could follow you? I don't check on here very often.

zzleeper · 2026-03-15T21:46:02 1773611162

Another +1, it would be incredibly useful to play with this approach! (and fun)

mistyvales · 2026-03-16T16:39:37 1773679177

I would like to see this!

retinaros · 2026-03-15T22:20:47 1773613247

isnt it what everyone that needs web validation does?

toomuchtodo · 2026-03-15T22:26:14 1773613574

Please publish!

liamdgray · 2026-03-15T23:03:55 1773615835

Please do!

heystefan · 2026-03-15T23:40:51 1773618051

Commenting to follow up.

fuzzyfizz · 2026-03-16T07:17:36 1773645456

Wow. Yes please.

halJordan · 2026-03-15T23:23:30 1773617010

I love how HN is loving this idea when it's the exact same thing Anthropic and OpenAi (and every other llm maker) did.

It's God's gift to them when it lets them bypass ads and dl copyrighted material. But it's Satan's curse on humanity when the Zuck does it to train his llm and dl copyrighted material.

deaux · 2026-03-16T00:44:56 1773621896

Both scale and purpose make them completely different things. You're acting as if they're the same when they're not.

eipi10_hn · 2026-03-16T02:36:59 1773628619

I won't comment about dl but ads are trackers and spyware for me. I don't spy on websites' owners, I have my human rights to stop those trackers.

Zuck serves ads/spywares to other users, he deserves to taste his own medicines, not me.

coldtea · 2026-03-16T11:11:26 1773659486

Yes, it's a god's gift when the average user can do it, and satan's curse what a hated fucking mega-corp is doing it.

Where's the contradiction?

friendzis · 2026-03-16T07:04:55 1773644695

You can see this pattern in many different topics: updoots are highly correlated with a positive answer to "do I personally get to profit"?

achierius · 2026-03-16T07:17:27 1773645447

Yes, and? People need to eat. Billionaires are generally not interested in whether or not the average Joe gets to eat.

cyberax · 2026-03-16T03:54:37 1773633277

I would love to pay for content. I'm _paying_ for YouTube Premium.

But heck. Do I hate the YouTube interface, it degraded far past usability.

zx8080 · 2026-03-16T04:37:42 1773635862

Write to their support. Oh, wait.

joks · 2026-03-16T19:01:13 1773687673

I think there's a little bit of the Goomba fallacy at play here to be fair

tclancy · 2026-03-16T00:25:38 1773620738

So you’re that Hal Jordan then? Why would a Green Lantern feel the need to defend either? I feel like the Guardians would not accept your arguments as soon as you got to Oa, poozer. I guess what I am saying is don’t have a famous name. Seems obvious.

llbbdd · 2026-03-16T01:10:09 1773623409

OP appears to be talking about real life. What are you on about?

bryanrasmussen · 2026-03-16T02:36:20 1773628580

the user name he is responding to is HalJordan, Hal Jordan is the name of a comic book superhero: Green Lantern, a moral paragon.

on edit: he is evidently being "sarcastic"

miki123211 · 2026-03-16T09:16:37 1773652597

You conflate web crawling for inference with web crawling for training.

Web crawling for training is when you ingest content on a mass scale, usually indiscriminately, usually with a dumb crawler for scale's sake, for the purposes of training an LLM. You don't really care whether one particular website is in the dataset (unless it's the size of Reddit), you just want a large, diverse, high-quality data mix.

Web crawling for inference is when a user asks a targeted question, you do a web search, and fetch exactly those resources that are likely to be relevant to that search. Nothing ends up in the training data, it's just context enrichment.

People have a much larger issue with crawling for training than for inference (though I personally think both are equally ok).