Machine Unlearning Can Remove Traces Of Data Points

August 23, 2021

2 min.

An evolving field of computer science known as machine unlearning finds ways to induce selective amnesia in artificial intelligence software, with the goal of removing all traces of a specific person or data point from a machine learning system without affecting its performance.

If this breakthrough is put into practice, it could give individuals greater control over their data and value. Machine unlearning could allow someone to take back both their data and a company’s ability to profit from it.

Technology companies spend millions of dollars training machine learning algorithms to recognize faces or arranging social posts because algorithms can often solve a problem faster than human programmers. But, once trained, machine learning is not so easy to change. The traditional way to eliminate the influence of a particular data point is to rebuild a system from the start, a potentially costly endeavour.

Studies on machine unlearning are partly due to the growing attention paid to the impact of artificial intelligence on privacy. Data regulators around the world have long had the power to force companies to delete information they have illegally obtained. Recently, the U.S. and European regulators have said that owners of AI systems sometimes have to go so far as to delete a system that has been trained with sensitive data.

The small field of machine-unlearning research deals with various practical and mathematical questions raised by these regulatory shifts. Researchers have shown that they can make machine learning algorithms forget under certain conditions, but the technology is not yet ripe for wider use.

One promising method proposed by researchers at the University of Toronto and the University of Wisconsin-Madison in 2019 is to split the source data for a new machine learning project into several parts, which are then processed separately before the results are incorporated into the final machine learning model. If a data point needs to be deleted later, only a fraction of the original input data needs to be reprocessed.

Roth and collaborators at Penn, Harvard, and Stanford have recently demonstrated a flaw in such an approach, showing that the unlearning system would malfunction if deletion requests were made in a particular order, either accidentally or by a malicious actor. They have also shown how the problem could be solved.

It will take superb engineering work for technology companies to finally implement machine unlearning as a way to give people more control over the algorithmic fate of their personal data, and even then, technology may not significantly change the privacy risks of the AI age.

For more information, read the original story in Ars Technica.

TND News Desk

SUBSCRIBE NOW

Become a member

New, Relevant Tech Stories. Our article selection is done by industry professionals. Our writers summarize them to give you the key takeaways

Subscribe Now

North Korean hacker infiltrates US security vendor, loads malware

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

CrowdStrike CEO summoned by Homeland Security committee over software disaster

Canadian schools sue social media giants over alleged harm to children

ChatGPT mobile mania: Why users are flocking to ChatGPT Plus

iOS update brings back photos users thought were permanently deleted

Microsoft reveals critical security flaw affecting Android apps

CrowdStrike faces backlash over $10 “apology” voucher

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Machine Unlearning Can Remove Traces Of Data Points

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Homeland Security committee demands appearance by CrowdStrike CEO

SUBSCRIBE NOW

Related articles

Target’s new AI is aimed at employees

The good and the bad of AI generated code

Microsoft’s AI success may spell defeat for it’s climate goals

OpenAI’s Chief Scientist Ilya Sutskever Departs Company

Become a member