What Is Federated Machine Learning?
Federated learning is a decentralized yet collaborative approach to ML model training, according to Kathy Lange, research director for IDC’s AI, Data and Automation Software practice.
“It enables multiple parties to jointly train a model without exchanging or exposing potentially sensitive data,” she says. “It is maturing as a critical architecture for enterprise AI, especially where privacy, compliance and cross-organization collaboration are paramount.”
Across a range of industries — finance, healthcare life sciences and manufacturing — and use cases such as disease research or fraud detection, centralized ML training may not be the most efficient way for organizations to train models.
That’s because, Lange notes, “no single organization may have enough data to build robust, generalizable AI models.”
However, through federated learning, by pooling data across institutions, “organizations can overcome sample size limitations, capture greater diversity and improve the accuracy and reliability of their insights” while still preserving privacy, she says.
READ MORE: Learn how NVIDIA’s CEO, Jensen Huang, envisions the future of enterprise AI.
How Federated Learning Differs From Traditional Centralized AI Training
With traditional centralized AI, all the data is combined into a single data set for training the AI model. In contrast, with federated learning, the data never leaves its original location.
“Only model updates are transmitted to other participants; typically, model parameter updates or gradient changes, not the data itself,” Lange says. “Often, you hear the phrase, ‘The model goes to the data,’ instead of the data going to where the model is being created. Each participant trains the model locally on its own data.”
Privacy-Preserving AI: Why Regulated Industries Are Adopting Federated Learning First
Regulated industries like healthcare and finance have been early adopters of federated ML precisely because of their need for privacy and compliance.
Organizations in these industries can’t simply share sensitive data due to regulations like the Health Insurance Portability and Accountability Act (HIPAA) and General Data Protection Regulation (GDPR).
“Federated learning gives these industries a way to learn from multiple organizations’ data while still keeping tight controls and limiting exposure,” Lange notes. “It’s the best of both worlds: better model accuracy and governance.”
