Bill
Bill > A06578
NY A06578
NY A06578Establishes the artificial intelligence training data transparency act requiring developers of generative artificial intelligence models or services to post on the developer's website information regarding the data used by the developer to train the generative artificial intelligence model or service, including a high-level summary of the datasets used in the development of such system or service.
summary
Introduced
03/06/2025
03/06/2025
In Committee
05/05/2026
05/05/2026
Crossed Over
05/05/2026
05/05/2026
Passed
Dead
Introduced Session
2025-2026 General Assembly
Bill Summary
AN ACT to amend the general business law, in relation to establishing the artificial intelligence training data transparency act
AI Summary
This bill, titled the "artificial intelligence training data transparency act," mandates that developers of generative artificial intelligence (AI) models or services must disclose information about the data used to train these systems on their websites. Generative AI refers to AI that can create new content like text, images, or audio. Developers are defined as entities that design, code, produce, or significantly alter AI models or services for public use, with "substantially modifies" meaning updates that materially change functionality. The required disclosures, to be posted by January 1, 2027, and before any subsequent public release or significant modification of an AI model or service released after January 1, 2022, must include a high-level summary of the training datasets, detailing their sources, how they support the AI's purpose, the number of data points (which can be in general ranges), the types of data points, whether they contain copyrighted or trademarked material, if they were purchased or licensed, if they include personal or aggregate consumer information, any cleaning or modifications made, the period of data collection, and when the datasets were first used in development. Developers may also describe the purpose of synthetic data generation, which is the creation of artificial data with similar statistical characteristics to original data. Exceptions to these disclosure requirements exist for AI models solely for aircraft operation or those developed for national security, military, or defense purposes and only made available to federal entities.
Committee Categories
Business and Industry
Sponsors (12)
Alex Bores (D)*,
Monique Chandler-Waterman (D),
Brian Cunningham (D),
Judy Griffin (D),
Jonathan Jacobson (D),
Anna Kelles (D),
Dana Levenberg (D),
Steve Otis (D),
Nader Sayegh (D),
Phara Souffrant Forrest (D),
Emerita Torres (D),
Jordan Wright (D),
Last Action
REFERRED TO INTERNET AND TECHNOLOGY (on 05/05/2026)
Official Document
bill text
bill summary
Loading...
bill summary
Loading...
bill summary
Loading...