Policy Snapshot

Data Compensation

Payments to individuals for the use of their data in training AI models.

Risk Horizon

Near Term

Medium Term

Long Term

Governance

Subnational

National

International

Rate of Disruption

Gradual

All Scenarios

Rapid

Who It Affects

Creators & IP Holders

Households

AI Developers

Decision Maker

Legislators

Regulatory Agencies

Windfall Policy Atlas

›

Wealth Capture

›

Restructuring AI Ownership

Data Compensation

Compensating individuals and creators for the use of their personal data and intellectual property in training AI models, treating data contributions as a form of labor or capital.

What it is:

Data compensation policies seek to ensure that individuals, creators, and communities are paid when their data, content, or intellectual property is used to train AI systems. Most AI models today are built on vast datasets scraped from the internet — text, images, code, music, personal behavior patterns — without compensating the people who created that material. Data compensation formalizes this contribution as something with economic value that deserves remuneration. Proposed mechanisms range from direct royalties (micropayments to creators whenever their work is used in AI training or output), to data dividends (taxes or fees on companies that monetize user data, redistributed to citizens as a collective return), to data trusts (intermediary organizations that negotiate licensing terms and compensation on behalf of data contributors, analogous to collecting societies in the music industry).

The case for data compensation grows stronger as AI systems become more capable and commercially valuable. If AI increasingly displaces the very workers whose output trained it, data compensation provides a mechanism for those workers to retain an economic connection to the value chain even after their jobs are automated. Unlike taxes or transfers, which require political decisions about redistribution, data compensation has a direct causal logic: the people being paid are the people whose contributions made the system possible.

The challenge:

Measuring individual data contributions to a model trained on billions of data points is technically difficult; the marginal value of any single person's data is vanishingly small, even if the aggregate value of all training data is enormous. This creates a tension between the moral case for compensation (which feels intuitive) and the economic mechanics (which make per-person payments tiny unless concentrated on high-value contributors like professional authors or artists). Enforcement is another challenge: data is easily copied, aggregated, and transformed, making it difficult to track provenance or verify that compensation obligations have been met. There is also a risk that data compensation regimes primarily benefit platforms and intermediaries rather than individual creators, as illustrated by content licensing deals where platforms sell access to user-generated content and retain the proceeds. And overly restrictive data compensation requirements could slow AI development by making training data prohibitively expensive or legally uncertain, potentially concentrating advantage among incumbents that have already trained their models on freely available data.

Recommended Reading:

Betsey Stevenson

What’s There to Fear in a World with Transformative AI? With the Right Policy, Nothing.

December 2025

Stevenson proposes a "digital dividend" that treats AI-driven surplus as a collective product built on shared data, public infrastructure, and centuries of accumulated knowledge. Unlike traditional UBI framed as a handout, her dividend is structured as a return on a shared asset; citizens contribute data into the system and receive a dividend from that shared resource. She argues this framing preserves dignity and meaning while decoupling income from employment status, and compares the mechanism to Alaska's Permanent Fund but with data rather than oil as the underlying resource.

Atlantic Council

How data trusts can democratize the AI economy and accelerate innovation

November 2020

The Atlantic Council's GeoTech Center has advocated for data trusts as a mechanism to capture economic value from aggregated personal data — such as patient records, smart city data, or gig worker behavior — and redistribute it as annual dividends to data providers. The Council notes that American Airlines secured a $4.7 billion government loan using its loyalty database as collateral (valued at $18–30 billion), illustrating how a data trust managing similar assets could provide sizable dividends to members, potentially funding universal basic income for workers in an increasingly uncertain labor market.

Jaron Lanier & E. Glen Weyl

A Blueprint for a Better Digital Society

September 2018

Lanier and Weyl coined the term "data dignity" to describe a system where individuals are paid for the data they generate and pay for services that require data from others—replacing the current model of surveillance capitalism. They propose "Mediators of Individual Data" (MIDs), union-like organizations that would negotiate data royalties and engage in collective bargaining on behalf of data creators.

Andrew Yang

Data Dividend Project

June 2020

Former presidential candidate Andrew Yang launched the Data Dividend Project through his nonprofit Humanity Forward, aiming to mobilize one million Americans to "establish and enforce data property rights" under laws like the California Consumer Privacy Act (CCPA). The project asks supporters to submit their email addresses to estimate how many platforms profit from their data, with the goal of building collective bargaining power to demand compensation.

In 2025, HarperCollins struck a pioneering deal with a major tech firm (reportedly Microsoft) to pay authors $2,500 per book for permission to use their work in AI training.
Adobe and Shutterstock have implemented "creator funds" that pay annual bonuses to artists whose images train their models.
Reddit signed licensing deals worth over $200 million with Google and OpenAI to monetize user-generated content, though the proceeds currently flow to the platform rather than individual users.
On the technical side, blockchain projects are building the infrastructure for decentralized data monetization. Ocean Protocol has built a marketplace for trading tokenized datasets, while Tim Berners-Lee’s Solid Project is developing "personal data pods" that allow individuals to store their data securely and license it to third parties on their own terms.

Previous Policy

Sovereign Wealth Funds

Next Policy

Windfall Clause

Policy Snapshot

Data Compensation

Payments to individuals for the use of their data in training AI models.

Risk Horizon

Near Term

Medium Term

Long Term

Governance

Subnational

National

International

Rate of Disruption

Gradual

All Scenarios

Rapid

Who It Affects

Creators & IP Holders

Households

AI Developers

Decision Maker

Legislators

Regulatory Agencies

Securing humanity's AI future

About us

Securing humanity's AI future

Data Compensation

What it is:

The challenge:

Recommended Reading:

Real-world precedents:

Want to suggest an improvement to the Atlas? Contact us here.