What they're not telling you: # The Semantic Shell Game: How AI Companies Hide Data Sharing Behind "We Don't Sell" Promises In 2026, the person whose data is being harvested by artificial intelligence companies is increasingly the company itself—not you, if you believe their privacy policies. A Reddit user recently discovered that Perplexity AI had retained their first name despite no explicit account creation or data submission. This seemingly small incident opened a door into privacy policy language that has become industry standard: the distinction between "selling" data and "sharing" it with third parties.
What the Documents Show
Perplexity's privacy documentation claims the company does not sell personal information, a statement technically true while obscuring a far broader practice of data distribution. The user's experience suggests this semantic distinction matters far less than the actual collection and retention occurring behind closed doors. When an AI system knows your name without you providing it directly, the question of whether that violates a "no sales" policy becomes almost philosophical rather than practical. The mainstream tech press typically frames privacy concerns around explicit data sales—the Facebook-Cambridge Analytica scandal model where information is monetized directly. This framing allows companies to craft policies that prohibit "selling" while permitting "sharing" with service providers, analytics partners, and business associates.
Follow the Money
Perplexity's approach mirrors this pattern: the company can simultaneously advertise privacy protection while distributing user information across its operational infrastructure. The user who discovered their name in the system found themselves navigating language designed to be technically defensible rather than transparent. No mainstream outlet has deeply examined how AI companies accumulate baseline user data before any transaction occurs, focusing instead on whether that data is subsequently monetized. The infrastructure supporting this data architecture remains largely invisible to users. When Perplexity identifies a person by first name, it has already connected multiple data points—conversation patterns, IP addresses, query history, temporal metadata. Whether this integration originated from third-party data sources, browser cookies, or connection patterns within Perplexity's own systems becomes unclear when privacy policies use broad categories like "service providers" and "business partners." The user's discovery suggests companies are collecting identifiable information at a baseline level before any explicit consent occurs.
What Else We Know
Regulators have not meaningfully addressed whether identifying users constitutes "collection" under privacy frameworks designed before AI companies could infer identity from interaction patterns alone. The broader implication is that privacy policy language has evolved to be simultaneously accurate and misleading. A company can honestly state it does not sell data while retaining, analyzing, and distributing that information to dozens of operational partners. For ordinary people, this means the choice to use free AI services implicitly means agreeing to participate in data infrastructure whose true scope remains deliberately obscured by terminology. The user who questioned Perplexity's knowledge of their name asked a question regulators have not yet answered: if a company knows your identity but your privacy policy only restricts "sales," what protection do you actually have?
Primary Sources
- Source: r/privacy
- Category: Tech & Privacy
- Cross-reference independently — don't take our word for it.
Disclosure: NewsAnarchist aggregates from public records, API feeds (Federal Register, CourtListener, MuckRock, Hacker News), and independent media. AI-assisted synthesis. Always verify primary sources linked above.

