A new tool, Data Provenance Explorer, lets users pick through the questionable provenance of many large data sets used for AI training. A new online tool allows users to identify, track and learn ...
Personally identifiable information has been found in DataComp CommonPool, one of the largest open-source data sets used to train image generation models. Millions of images of passports, credit cards ...
Katharine Kemp is a Member of the Expert Panel of the Consumer Policy Research Centre, and the Australian Privacy Foundation. Photos of Australian children have been ...
A team has developed a new method that facilitates and improves predictions of tabular data, especially for small data sets with fewer than 10,000 data points. The new AI model TabPFN is trained on ...