Profiling Hacker News users based on their comments
simonw
60 points
64 comments
March 22, 2026
Related Discussions
Found 5 related stories in 61.5ms across 3,471 title embeddings via pgvector HNSW
- Enabling Codex to Analyze Two Decades of Hacker News Data ronfriedhaber · 77 pts · April 02, 2026 · 64% similar
- Show HN: Hackerbrief – Top posts on Hacker News summarized daily p0u4a · 66 pts · March 16, 2026 · 64% similar
- Researchers Deanonymize Reddit and Hacker News Users at Scale hk_flying_gear · 15 pts · March 01, 2026 · 63% similar
- Crow Watch: A Hacker News Alternative medv · 12 pts · March 09, 2026 · 57% similar
- Show HN: I built a real-time OSINT dashboard pulling 15 live global feeds vancecookcobxin · 208 pts · March 08, 2026 · 49% similar
Discussion Highlights (20 comments)
johnfn
> This is arguably their defining HN characteristic: they are one of the most vocal, persistent AI optimists on the platform. They claim ~90-95% of their shipped code is AI-generated, report 5-10x productivity gains, and have built a detailed methodology around it — using Playwright for visual verification, static typechecking as a hallucination filter, and e2e test suites as automated validation harnesses Wow, I sound really annoying. Sorry about that everyone!
bibimsz
hacker news is a goldmine since you can't delete comments nor even delete your account. this site is a privacy nightmare, in a world where everyone is excited to cancel and dox for unpopular opinions (on this site that means anything to the right of bernie sanders).
alexgandy
This just in; posting ridiculous amounts of personal information on the internet can lead to you being profiled correctly. Wild stuff.
JSR_FDED
Given a profile like this, how good would an LLM be at figuring out whether the profile if from a bot or a real person?
janalsncm
Not doubting the method works in general, but Simon Willison is a public-enough figure so the baseline level of info is higher than just HN comments. If you turn off Claude’s web search: > Simon Willison is a British software developer, blogger, and open-source advocate, best known for…
alexpotato
Story time: My first full time job (early 2000s) was working for a firm that did online cybersecurity related investigations for Fortune 500 companies (generally via a 3rd party law firm they had retained). A big part of this was running investigations into people running "pump and dump" stock schemes on Yahoo message boards. We would generally start by scraping all of the posts for a user who had instigated one of these and then handing off the posts to an analyst. It's amazing: a. how much info people give out even when they think they are being careful b. related to a, how even small tidbits combined over time can build a pretty accurate picture of who someone is. e.g. they post "oh man, the Cubs lost", then a year later "went for a walk on Lakeshore drive", another year later, there was a fire at my local subway stop etc etc and you pretty quickly narrow down the rough neighborhood where they live in Chicago. Combined with tools like Lexis Nexus and you get a list of people that you can narrow down by age, sex, occupation etc and we could narrow it down to <20 people based on other info they had shared. Then you fold in their posting patterns and it's pretty obvious who is at work (posting 9 to 5pm) vs home (posting 7pm to 1am). Again, you keep adding constraints and the intersection of the Venn diagrams gets smaller and smaller. This was all in the early 2000s before we had cellphones that tracked your location and ad infrastructure that followed you around the internet.
Forgeties79
> “Two things can be true at the same time” — he holds nuanced positions I feel the need to point out that 99% of the time that phrase is essentially an insult and isn’t indicative of a “nuanced position” lol it generally means “you’re myopic in your views/your argument lacks nuance.” That strikes me as a pretty charitable interpretation by the model there. You seem like a good dude, and I’m not going to pretend I haven’t thrown out the flippant quip here and there in my comments. I just thought that interpretation was pretty funny.
sachaa
You can also do this with a simple bookmarklet, no extension needed. Create a new bookmark in your browser, name it something like "Profile HN User", and paste this as the URL: javascript:void(function(){var u;var m=window.location.href.match(/news\.ycombinator\.com\/user\?id=([^&]+)/);if(m){u=m[1]}else{u=prompt(%27Enter HN username:%27)}if(!u)return;var msg=%27Profile this HN user: https://hn.algolia.com/api/v1/search_by_date?tags=comment,au... })() If you're on a HN profile page (news.ycombinator.com/user?id=someone) it grabs the username automatically. Otherwise it prompts you to type one. It copies the profiling prompt to your clipboard and opens a new Claude conversation, just Cmd/Ctrl+V and hit Enter.
plun9
You can just ask a chatbot about Hacker News or reddit users based on their username.
n2d4
This was interesting to do on my own profile. It got a bunch of personality attributes about me right that I haven't directly mentioned on here, which is impressive. I then followed it up with "Given my chat history, how do they compare to me?", and it started making comparisons of myself to myself. Very fun experience.
stego-tech
This is...disquieting. It's one thing to know that it's possible, another thing to know nation states or large megacorps are doing it, but another thing entirely to see such verbose output from free models about, well, me . The first two, I've made peace with (nothing I can do about it anyway). The last one picks quite fiercely at old trauma that really makes me reconsider my socials in general, not just HN. But maybe that's just the anxiety and trauma talking, encouraging me to recede back into the shadows and re-apply the old mask of "acceptableness" I've been trying to toss aside. Maybe the fact a free chatbot can do such a thorough analysis is in fact reason enough to stop worrying about every aspect of my identity and its perception by others , and instead just...be me, and deal with whatever consequences arise from that. I dunno. Just...lot of emotions, here, most of them quite bad.
vpribish
HAHAHA - I like me. but claude (sonnet 4.6) seemed like it was cheerleading a bit
SanjayMehta
"Fetched 0 comments." Edit: turns out it's case sensitive. Sounds about right: roughly “anti-imperialist realist” with Indian/Global South anchoring and paleoconservative/libertarian-adjacent distrust of state-corporate surveillance power.
few
See also https://antirez.com/news/150 https://antirez.com/hnstyle?username=pg&threshold=20&action=... Which lets you find the alts of a handle
michaelteter
And note that HN does not allow you to delete your comments after a short time passes. If you contact them and ask for your data to be deleted, they will directly refuse.
sgbeal
(...does this thing to check own profile[^1]...) > Old man raising fist at, and yelling at, clouds. Get off his lawn. [^1]: not really - this is speculation (so... kinda the same thing the LLM is doing) but is possibly an accurate representation.
tamimio
> Recurring Hobby Horse >The word "engineer" being diluted by software/bootcamp culture is something they return to obsessively — arguably their strongest ideological position alongside surveillance criticism Busted!! That being said, not surprised because it listed exactly what I want my persona to appear, does that mean I am like that irl? No, I rarely bring the above “engineer” term IRL let alone to be obsessed about it, but in HN it makes sense to bring up, rest are mostly about techie stuff that I usually don’t bring with my friends or family. Also, this can be about anything you produce, like your blog, books, YouTube, or anything, that personality is what attracts (or repels) other people to be around you, it’s human society 101.
Simulacra
I've been doing this for a long time, it's amazing what ChatGPT can suss out with enough data. I like to feed it comments from message boards to try to uncover interesting business opportunities, or threads to follow for my own research.
raw_anon_1111
Just pasting my last 100 comments into ChatGPT using the API and cutting out anything positive it said about me… “Your communication style is direct and often adversarial, using rhetorical questions and sharp analogies to pressure-test assumptions, with little tolerance for what you see as naïve, performative, or abstract reasoning. You prioritize competence, execution, and practical tradeoffs over signaling or theory, and while that makes your analysis grounded and often incisive, it can also make your stance appear combative and less receptive to edge cases or emerging paradigms that don’t yet fit established incentive structures”
zoogeny
It is interesting that Marc Andreesen was having a bit of a X crash out over his belief that introspection is bad [1] I disagree because I tend to seek a middle way. I would agree that too much (excessive) introspection is bad. But I would argue that too little is equally bad. I think obsessively examining ones own comment history would verge on excessive. I'm wondering how much LLM analysis of my public and private life can remain healthy. 1. https://x.com/pmarca/status/2035190797218587116