Enlarge / YouTuber Marques Brownlee discusses iOS 18 in a new video. This specific video wasn't part of the large dataset that was used to train AI models, but many of his others were. (credit: Marques Brownlee) AI models at Apple, Salesforce, Anthropic, and other major technology players were trained on tens of thousands of YouTube videos without the creators' consent and potentially in violation of YouTube's terms, according to a new report appearing in both Proof News and Wired. The companies trained their models in part by using "the Pile," a collection by nonprofit EleutherAI that was put together as a way to offer a useful dataset to individuals or companies that don't have the resources to compete with Big Tech, though it has also since been used by those bigger companies. The Pile includes books, Wikipedia articles, and much more.