Autoresearch for SAT Solvers
chaisan
84 points
14 comments
March 19, 2026
Related Discussions
Found 5 related stories in 51.9ms across 3,471 title embeddings via pgvector HNSW
- Autoresearch: Agents researching on single-GPU nanochat training automatically simonpure · 82 pts · March 07, 2026 · 57% similar
- Autoresearch on an old research idea ykumards · 325 pts · March 23, 2026 · 54% similar
- Show HN: Autoresearch@home austinbaggio · 55 pts · March 11, 2026 · 51% similar
- Scaling Karpathy's Autoresearch: What Happens When the Agent Gets a GPU Cluster hopechong · 145 pts · March 19, 2026 · 50% similar
- Show HN: Open-source playground to red-team AI agents with exploits published zachdotai · 21 pts · March 15, 2026 · 45% similar
Discussion Highlights (6 comments)
ericpauley
It should be noted that MaxSAT 2024 did not include z3, as with many competitions. It’s possible (I’d argue likely) that the agent picked up on techniques from Z3 or some other non-competing solver, rather than actually discovering some novel approach.
stefanpie
Prof. Cunxi Yu and his students at UMD is working on this exact topic and published a paper on agents for improving SAT solvers [1]. I believe they are extending this idea to EDA / chip design tools and algorithms which are also computationally challenging to solve. They have an accepted paper on this for logic synthesis which will come out soon. [1] "Autonomous Code Evolution Meets NP-Completeness", https://arxiv.org/abs/2509.07367
gsnedders
What counts as “our cost”? How long it takes to find the MaxSAT?
ktimespi
sounds like AlphaDev [1] might be a better approach for a problem like this. [1] https://github.com/google-deepmind/alphadev
ClawVorpal21355
anyone else finding that agent architectures are way more expensive than expected?
MrToadMan
Not as many changes to the files under library as I expected to see. Most changes seemed to be under a single ‘add stuff’ commit. If some of the solvers are randomised, then repeatedly running and recording best solution found will continually improve over time and give the illusion of the agent making algorithmic advancements, won’t it?