Customize Consent Preferences

We use cookies to help you navigate efficiently and perform certain functions. You will find detailed information about all cookies under each consent category below.

The cookies that are categorized as "Necessary" are stored on your browser as they are essential for enabling the basic functionalities of the site. ... 

Always Active

Necessary cookies are required to enable the basic features of this site, such as providing secure log-in or adjusting your consent preferences. These cookies do not store any personally identifiable data.

No cookies to display.

Functional cookies help perform certain functionalities like sharing the content of the website on social media platforms, collecting feedback, and other third-party features.

No cookies to display.

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics such as the number of visitors, bounce rate, traffic source, etc.

No cookies to display.

Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.

No cookies to display.

Advertisement cookies are used to provide visitors with customized advertisements based on the pages you visited previously and to analyze the effectiveness of the ad campaigns.

No cookies to display.

Categories
Articles

AI Training: Adequate, Relevant and not Excessive!

It’s still common to hear stories where privacy is supposedly in conflict with other objectives. I’ve been writing for years about how that’s not the case in security or access management. This morning’s ICO webinar on Security and Data Minimisation in Artificial Intelligence came up with a counter-example in that field, too…

You might think that the more training data you give to a Machine Learning algorithm, the better it will get. Humans get better with more practice, after all. But that’s not typically how ML works. For a while, more training datapoints will indeed improve the algorithm’s ability to derive general patterns. But if you give it too much training data, then it will tend to simply memorise datapoints. This is called “over-fitting”, where the algorithm gets so good at explaining the points it has already seen that it actually becomes worse at explaining any new ones.

So it turns out that the GDPR’s data minimisation principle – that data shall be adequate, relevant and limited to what is necessary – is the right way to train a Machine Learning algorithm, too!

A recording of the webinar should be in the ICO’s YouTube channel soon.

By Andrew Cormack

I'm Chief Regulatory Advisor at Jisc, responsible for keeping an eye out for places where our ideas, services and products might raise regulatory issues. My aim is to fix either the product or service, or the regulation, before there's a painful bump!

Leave a Reply

Your email address will not be published. Required fields are marked *