Show HN: AA-Briefcase: a frontier knowledge work evaluation

declanjackson 11 points 2 comments June 18, 2026
artificialanalysis.ai · View on Hacker News

Discussion Highlights (2 comments)

mrdbourke

the example submissions are really good comparisons, comparing Fable 5's submission to Opus 4 is fairly stark

brenton_on_news

GLM hanging with the frontier big dogs

Semantic search powered by Rivestack pgvector
10,996 stories · 103,478 chunks indexed