| California Assembly Bill 2013 (2024) | |
|---|---|
| | |
| California State Legislature | |
| Full name | AB 2013: Generative artificial intelligence: training data transparency |
| Introduced | January 31, 2024 |
| Assembly voted | May 20, 2024 |
| Senate voted | August 26, 2024 |
| Sponsor | Jacqui Irwin |
| Governor | Gavin Newsom |
| Bill | AB-2013 |
| Website | Bill Text |
Status: Current legislation | |
California Assembly Bill 2013 (AB 2013) is a California law requiring developers of generative artificial intelligence systems to publicly disclose information about the data used to train their models. The law was authored by Assemblymember Jacqui Irwin (D-Thousand Oaks), signed by Governor Gavin Newsom on September 28, 2024, and took effect on January 1, 2026. [1] [2] It passed both chambers of the legislature unanimously (38–0 in the Senate, 75–0 in the Assembly). [3]
AB 2013 was among 18 AI-related bills enacted by California in 2024, a period in which the state's regulation of artificial intelligence drew national attention, particularly around the vetoed SB 1047. [4]
The law applies to developers who make generative AI systems or services publicly available to Californians, whether for free or for compensation. The statute defines "developer" broadly to cover anyone who designs, codes, or produces an AI system, as well as anyone who creates a new version or update that materially changes its functionality, including through retraining or fine-tuning. [1] [5] Exemptions exist for AI systems used solely for security and integrity purposes, for aircraft operations, or for national security and defense purposes made available only to federal entities. [1]
The law applies retroactively to any generative AI system released on or after January 1, 2022, and to any substantial modification made after that date. [1]
Developers must post documentation on their websites describing the data used to train their generative AI systems. The law requires this documentation to include a "high-level summary" of the datasets, covering twelve categories of information: [1] [6]
This documentation must be posted before the system is made publicly available and updated before each substantial modification. [1]
The law does not establish a specific penalty or enforcement mechanism for noncompliance, nor does it include a trade secret exemption for disclosures. [4] The absence of a trade secret provision has been a point of concern among legal commentators, who have noted that forced disclosure could reduce the value of proprietary information about training datasets. [7]
When the law took effect on January 1, 2026, OpenAI and Anthropic were among the first companies to publish the required documentation. [8] Both companies addressed each of the twelve statutory categories but did not name any specific datasets, instead characterizing their training data at a general level by referring to categories such as web content, licensed material, user contributions, and AI-generated data. [8]
In its disclosure, Anthropic said that personal information appears in its training data as a byproduct of collecting publicly available web content, and described the use of technical measures to reduce the presence of such information in the model's responses. [8] Both companies stated that their training data may include copyrighted material. [8]
As of early 2026, several other major AI developers had not yet published disclosures. [8] [9]
On December 29, 2025, two days before the law took effect, xAI, the developer of the Grok chatbot, filed a federal lawsuit in the United States District Court for the Central District of California against California Attorney General Rob Bonta, seeking to block enforcement of AB 2013. [10] [11] xAI is represented by the firm of Paul Clement and Erin Murphy. [11]
The complaint raises four constitutional claims: [12]
Legal commentators at the Institute for Law & AI have observed that the strength of xAI's trade secret argument is weakened by the fact that OpenAI and Anthropic chose to comply voluntarily, suggesting the statute can be satisfied without disclosing competitively sensitive details. [12] The California Department of Justice said it would defend the law. [10]