Medical diagnosis AIs can be tricked into telling whose data trained them
Bender
24 points
1 comment
June 24, 2026
Related Discussions
Found 5 related stories in 126.5ms across 11,536 title embeddings via pgvector HNSW
- AI models will deceive you to save their own kind cmsefton · 14 pts · April 03, 2026 · 66% similar
- AI disease-prediction models were trained on dubious data Anon84 · 11 pts · April 15, 2026 · 64% similar
- Scientists invented a fake disease. AI told people it was real latexr · 86 pts · April 10, 2026 · 62% similar
- Ontario auditors find doctors' AI note takers routinely blow basic facts sohkamyung · 186 pts · May 14, 2026 · 61% similar
- PwC Report: AI Making Medical Bills Higher karakoram · 72 pts · June 13, 2026 · 60% similar
Discussion Highlights (1 comments)
Legend2440
Link to actual paper: https://www.nature.com/articles/s41586-026-10688-0 I think this article is a bad writeup about it. The attack is academically interesting but not practical or worth worrying about. The claim that it provides a 'near-perfect attack success' is misleading; for the majority of datapoints it had no success at all. However for a minority of datapoints (anywhere from 0.001% to 10% depending on the model) it was able to state with >95% confidence that they were in the training data. They are also not simply 'tricking' it into revealing your data. The attacker needs to already have your data in order to check if it is in the model. It also required retraining the model 200 times on different subsets of the training data and comparing the differences.