Codex for almost everything

mikeevans 786 points 393 comments April 16, 2026
openai.com · View on Hacker News

Discussion Highlights (20 comments)

kelsey98765431

it it doesn't complain about everything being malware maybe i will come back to openai from my adventures with anthropic

croemer

What does "major update to codex" mean? New model? Or just new desktop app? The announcement is vague.

sidgtm

They felt the pressure of posting something after Claude 4.7

hyperionultra

Tool for everything does nothing really good.

bughunter3000

First use case I'm putting to work is testing web apps as a user. Although it seems like this could be a token burner. Saving and mostly replaying might be nice to have.

uberduper

Do people really want codex to have control over their computer and apps? I'm still paranoid about keeping things securely sandboxed.

daviding

There seems a fair enthusiasm in the UI of these to hide code from coders. Like the prompt interaction is the true source and the actual code is some sort of annoying intermediate runtime inconvenience to cover up. I get that productivity can be improved with a lot of this for non developers, just not sure using 'code' as the term is the right one or not.

tvmalsv

My monthly subscription for Claude is up in a week, is there any compelling reason to switch to Codex (for coding/bug fixing of low/medium difficulty apps)? Or is it pretty much a wash at this point?

cjbarber

My current expectation is that the Cowork/Codex set of "professional agents" for non-technical users will be one of the most important and fastest growing product categories of all time, so far. i.e. agents for knowledge workers who are not software engineers A few thoughts and questions: 1. I expect that this set of products will be extremely disruptive to many software businesses. It's like when a new VP joins a company, they often rip and replace some of the software vendors with their personal favorites. Well, most software was designed for human users. Now, peoples' agents will use software for them. Agents have different needs for software than humans do. Some they'll need more of, much they'll no longer need at all. What will this result in? It feels like a much swifter and more significant version of Google taking excerpts/summaries from webpages and putting it at the top of search results and taking away visits and ad revenue from sites. 2. I've tried dozens of products in this space. For most, onboarding is confusing, then the user gets dropped into a blank space, usage limits are uncompetitive compared to the subsidized tokens offered by OpenAI/Anthropic, etc. It's a tough space to compete in, but also clearly going to be a massive market. I'm expecting big investment from Microsoft, Google etc in this segment. 3. How will startups in this space compete against labs who can train models to fit their products? 4. Eventually will the UI/interface be generated/personalized for the user, by the model? Presumably. Harnesses get eaten by model-generated harnesses? A few more thoughts collected here: https://chrisbarber.co/professional-agents/ Products I've tried: ai browsers like dia, comet, claude for chrome, atlas, and dex; claw products like openclaw, kimi claw, klaus, viktor, duet, atris; automation things like tasklet and lindy; code agents like devin, claude code, cursor, codex; desktop automation tools like vercept, nox, liminary, logical, and raycast; and email products like shortwave, cora and jace. And of course, Claude Cowork, Codex cli and app, and Claude Code cli and app. Edit: Notes on trying the new Codex update 1. The permissions workflow is very slick 2. Background browser testing is nice and the shadow cursor is an interesting UI element. It did do some things in the foreground for me / take control of focus, a few times, though. 3. It would be nice if the apps had quick ways to demo their new features. My workflow was to ask an LLM to read the update page and ask it what new things I could test, and then to take those things and ask Codex to demo them to me, but it doesn't quite understand it's own new features well enough to invoke them (without quite a bit of steering) 4. I cannot get it to show me the in app browser 5. Generating image mockups of websites and then building them is nice

armcat

Is it OpenAI Cowork?

thomas34298

Does that version of Codex still read sensitive data on your file system without even asking? Just curious. https://github.com/openai/codex/issues/2847

tommy_axle

OpenClaw acquisition at work.

VadimPR

Only on macOS though? This doesn't seem to work on Linux. Neither does Claude Cowork, not officially.

OsrsNeedsf2P

> Computer use is initially available on macOS, Does anyone know of a good option that works on Wayland Linux?

mrtksn

Codex is my favorite UX for anything as it edits the files and I can use the proper tooling to adjust and test stuff, so in my experience it was already able to do everything. However lately the limits seem to have got extremely tight, I keep spending out the daily limits way too quickly. The weekly limits are also often spent out early so I switch to Claude or Gemini or something.

postalcoder

I wish Codex App was open source. I like it, but there are always a bunch of little paper cuts that, if you were using codex cli, you could have easily diagnosed and filed an issue. Now, the issues in the codex repo is slowly becoming claude codish – ie a drawer for people's feelings with nothing concrete to point to.

hmokiguess

I can't help but see some things as a solution in search of a problem every time I see these examples illustrating toy projects. Cloud Tic Tac Toe? Seriously?

incognito124

<tin foil hat> I swear OpenAI has 2-3 unannounced releases ready to go at any time just so they can steal some thunder from their competitors when they announce something </tin foil hat>

bobkb

Using Claude and Codex side by side now . Would love to just use one eventually

lucrbvi

Is there anyone that feels that LLMs are wrong for computer use? It's like robotic, if find LLMs alone are really slow for this task

Semantic search powered by Rivestack pgvector
4,783 stories · 45,112 chunks indexed