Atlassian enables default data collection to train AI

kevcampb 546 points 124 comments April 20, 2026
letsdatascience.com · View on Hacker News

Discussion Highlights (20 comments)

kevcampb

I really wish I could find a better source to link to for this. By default, all free and paid customers are being opted-in to their data being used for AI training. All your Confluence pages, Jira tickets, etc. https://support.atlassian.com/security-and-access-policies/d... describes how to disable this, but it also appears that the setting to disable this doesn't exist (it's not visible on any of our instances).

martinald

Atlassian just goes from misstep to misstep. I still use their products quite often. The amount of P0 bugs I experience is absolutely crazy: - Bitbucket workers are hopelessly out of date (self hosted). We've had to put so many random workarounds in especially for Docker, as they don't keep them up to date enough - I have had a bug in JIRA for years where I can't reorder a new ticket unless I refresh the page - Every new feature they introduce into JIRA/Bitbucket over the past couple of years just doesn't work. - I tried their AI stuff on the free trial, didn't work at all, tried to cancel, can't cancel the free trial online and had to write a load of support tickets (of which the support ticket contact form bugged out multiple times). Anyone have any insight into why things have got so so dysfunctional? Tech debt? Talent leaving? Both? Even 'bad' enterprise software tends to be able to keep the most basic features running, but Atlassian is a whole new category. If you check their 'community' it is just hundreds/thousands of bugs with workarounds.

huwsername

If the rumours of an Anthropic acquisition are true, this makes a lot of sense. Anthropic are probably looking for a clean, high-signal dataset of metadata around business tasks that they can buy.

oliver236

genius move.

tqwhite

I don't see it as a misstep at all. The purpose of StackOVerflow is to share expertise. I am 100% supportive of it being used for training... AI, you, everyone.

reeseparker63

Worth noting that Atlassian's data residency options don't exempt you from this—your data can still be used for training even if you've pinned it to a specific region.

kepano

The official Atlassian FAQ on this change: https://www.atlassian.com/trust/ai/data-contribution/faqs

dreknows

The opt-out-by-default pattern has been gradually normalizing in enterprise SaaS, but what makes this particularly egregious is the combination of two things: the data scope (not just metadata, but all in-app content per kevcampb's link) and the broken opt-out (the disabling setting not rendering on any instance). One is a policy decision you can argue about. Both together suggest the friction is intentional. The data residency point is worth flagging separately - a lot of enterprise buyers treat region-pinning as a privacy guarantee for everything in their contract. It was never that. Residency tells you where data is stored at rest, not who can access it for what purpose.

pkilgore

Does this apply to Loom?

willis936

Presumably the government and HIPAA carveouts are for legal obligations. Trade secret theft is illegal so I wonder why they're not considering this.

firesteelrain

No wonder they wanted to stop supporting the Data Center versions for on prem.

jerhewet

Will Atlassian be harvesting code and content from private Bitbucket repositories? The wording in their policies and FAQ's is vague, so I'd like to get a definitive (Yes / No) answer.

microflash

I read this as "Stop using this product" toggle every time a company does this without consent. It has done a good amount of mental and financial improvements to me.

Bnjoroge

Plenty of other companies enable this by default too, such as Github, Figma, Adobe, Vercel. I think it's fair to assume that if you ahve data stored within any company, they'll by default use it for training.

an0malous

We need to kill SaaS. Apps should be local-first and have peer-to-peer data sync. These companies won't stop until they use your data to replace you and enrich their owners.

rsynnott

Imagine an AI based on jira tickets. _That's_ the torment nexus.

qsera

I am wondering why not just rsyncrypt the source code before pushing to the repo? >rsyncrypto is a utility that encrypts a file (or a directory structure) in a way that ensures that local changes to the plain text file will result in local changes to the cipher text file. This, in turn, ensures that doing rsync to synchronize the encrypted files to another machine will have only a small impact on rsync's wire efficiency. https://manpages.ubuntu.com/manpages/focal/man1/rsyncrypto.1...

titzer

AI contributing to rising natural stupidity.

jason_s

I'm really tired of JIRA, to the point where I have expressed it publicly: https://www.embeddedrelated.com/showarticle/1772.php

shadowgovt

The only silver lining I can see in this is that if they replace their existing tooling with AI integration, we might actually get search and confluence that works. I've lost count of how many times I search for a keyword and get no relevant results, but the document I'm looking for, which contains the keyword, is in my automatic pop-up of recent documents visited.

Semantic search powered by Rivestack pgvector
5,126 stories · 48,318 chunks indexed