AI Self-preferencing in Algorithmic Hiring: Empirical Evidence and Insights

laurex 320 points 171 comments May 02, 2026
arxiv.org · View on Hacker News

Discussion Highlights (20 comments)

makeitrain

Vibe resume?

johndhi

Another way to phrase this might be that LLMs make better resumes no?

nottorp

Easy then. Apply N times, each time with a resume generated by a different LLM. No human is going to notice anyway. Or add a N+1 resume written by yourself in which you describe your strategy, just in case.

randomdrake

I wonder if this extends to training models on new content as well. Are we creating a cyclical information-consumption and training situation in which models being trained are more likely to pick up on and reference content created by themselves or by other LLMs than by other humans?

logicalfails

I suspect this is more a function of the corporate sanitization of language within the models. When I have passed my resume through the models for refinement, it often sanitizes some of the more easy going or simpler wording. It expands the vocabulary, makes it more dense, and uses more corpo speak in the bullets and formatting. Each model likely has its own biases in terms of what constitutes correct corporate speak, and it chooses the resumes that best fit this. Ultimately, I suspect it's more a function of model saying "this grammer, syntax structure, and formatting is most aligned with what is correct corporate language, so flag as high quality".

charliebwrites

Anecdata, sample size of one: When I was looking for my next role after being laid off, I didn’t get much of a response with my human handmade resume despite my experience Just for kicks, I asked ChatGPT to “Analyze my resume and give it a score for what percentage it was in” then I asked it to revise it to make it score as high as possible I still tweaked and fact checked it but after I started sending that out, I got a much higher hit rate than before But who knows, maybe the market changed, was a better time of year, etc I still had to pass interviews and prove my worth. But it probably helped me get my foot in the door

benashford

Intuitively this feels obvious. Content generated by the model will be shaped by its training, therefore when reading it back it will resonate with that same training and have a positive view as a result. Human when preparing a CV: "Make my CV more professional" LLM many days later presenting a report to HR: "This CV is really professional" There's probably more to it than that of course. But it justifies my personal policy of using a different LLM family for code review tasks than for code generation tasks. To avoid the "marking your own homework" problem.

sb057

Well yeah, LLMs generate resumes (and other text) that they judge as superior to alternative plausible texts. Why would that judgement change just because a different instance hasn't seen it before? To anthropomorphize it, it's like having a hiring manager write a resume, get amnesia, and then have to judge it among other resumes.

hyperpape

I'll copy what I wrote on LinkedIn (note: I read roughly 25 pages, which is half the paper, and read it quickly)[0]: "If I read the paper correctly, they don’t actually show that LLMs prefer resumes they generate. Their actual method seems to be taking a human written resume, deleting the executive summary, having an LLM rewrite the executive summary based on the rest of the resume and then having another LLM rate the executive summary without the rest of the resume. That’s likely to massively overstate any real impact, if you can even rely on it capturing a real effect. I really wonder if I read that correctly, because I can’t come up with a justification for that study design." [0] I couldn't help but mildly copy-edit before pasting here. Edit: yes, the authors present a reason for their design, and an ideal version of my comment would've said that. I do not consider it much of a justification. See below: https://news.ycombinator.com/item?id=47987256#47987727 .

AlexB138

This may lead to some interesting gamesmanship. For instance, if I am applying to a company, and I know they use a certain applicant tracking system, and I know that ATS uses a certain model provider for its filter, I should then use that model to write the version of my resume I send to the company.

einpoklum

> As artificial intelligence (AI) tools become widely adopted, large language models (LLMs) are increasingly involved ... [in] ... decision-making processes That's the problem right there.

mpurbo

At this point, all these are becoming almost like comedy.

bendergarcia

We are without our consent introducing a party in between people. The models become the arbiters of who does and does not get a job. It feels problematic.

jimnotgym

I just guessed that and got Copilot to rewrite my profile on the internal HR system. I also got a job spec benchmarked higher by getting Copilot to write it with that exact aim given in the prompt

rogermarley

I think resumes will eventually (or have already) become obsolete in tech. The SNR is so low, they offer very thin filtering value. Even taking the tiny bits of the resume that are "hard signal", like GPA, certifications, prior roles, etc, it doesn't translate into their performance in the initial screening interview. This is why what I think the industry sorely needs is examination consortia. Rather than trying to guess capability from the name of the university they went to, leading tech companies creating standardized tests in various fields, and your test scores form your "resume", so that developers can just focus on improving their scores rather than wasting time on resume/application/repetitive-screening toil.

jamiecurle

disclaimer: Not a lawyer, but studying towards CIPP/E. You'd make no friends doing it, but as I understand it, for those that have GDPR as a statutory right then under "[Article 22 - Automated individual decision-making, including profiling][0]" you can request to know if your CV was screened by AI and what (and this is key) "meaningful human interaction" led to that decision. Technically this falls under a data subject access request and so a response is mandatory (but who really is going to enforce that - ICO / <insert your data protection agency here> probably isn't). Companies can't just smash a button and claim meaningful interaction, it has to be, well, meaningful and smashing a "nope" button obviously isn't meaninful. If it turns out that it was only AI that screened it you can request a human review. Do not hold your breath. Again, you'd make no friends doing it, but sooner or later a test case will emerge to generate some case law around "AI said no" because employment, or lack of because AI says no, does have significant impact on a human. [0]: https://gdpr.algolia.com/gdpr-article-22

jonahs197

Will people snap over this?

ilia-a

Seems kinda obvious, given that most large recruiting firms/hr use algos to analyze resumes and AI written version likely do a better job at hitting keywords/structure algos/llms pick up on...

embedding-shape

You'll find the same is true if you have two different LLMs first independently come up with a plan for an implementation, then ask each one of them to say which one of the two designs/plans are the best. They're much more likely to favor the plans generated from the same model, rather than from other models. I'm sure, internally, this somehow makes sense, but it's worth thinking about if you're doing the whole "ask N models for voting/rating N plans to find the best" charade.

jqpabc123

Repeat after me --- it makes no sense to try and prompt a language prediction engine to display good judgment.

Semantic search powered by Rivestack pgvector
8,303 stories · 78,303 chunks indexed