Decentralized Machine Learning with Centralized Performance Guarantees via Gibbs Algorithms

Yaiza Bermudez; Samir Perlaza; Iñaki Esnaola

✨ TL;DR

This paper shows that decentralized machine learning can achieve the same performance as centralized learning by having clients share Gibbs measures (probability distributions over models) instead of raw data. The key innovation is using each client's Gibbs measure as a reference measure for the next client, effectively encoding prior information while preserving privacy.

01 · Problem

Decentralized machine learning typically requires either sharing local datasets (compromising privacy) or accepting degraded performance compared to centralized learning. The challenge is to design a decentralized learning framework that achieves centralized performance guarantees without requiring clients to share their private data.

02 · Approach

Clients adopt an empirical risk minimization with relative-entropy regularization (ERM-RER) framework and establish forward-backward communication. The key mechanism is that each client k produces a Gibbs measure (a probability distribution over models weighted by their empirical risk), which is then used as the reference measure by client k+1. This creates a chain where local inductive biases are propagated through reference measures rather than raw data. The regularization factors must be scaled appropriately with local sample sizes to achieve centralized performance.

03 · Key insights

What the paper shows.

01Gibbs measures can serve as a privacy-preserving alternative to sharing raw data while maintaining centralized performance guarantees

02Using the previous client's Gibbs measure as a reference measure for the next client effectively encodes prior information in a principled way

03Proper scaling of regularization factors with local sample sizes is critical for achieving centralized performance in the decentralized setting

04This approach shifts collaboration strategy from data sharing to sharing local inductive bias through reference measures over the model space

04 · Results

The paper demonstrates that when clients follow the ERM-RER framework with forward-backward communication and share Gibbs measures, they achieve the same performance as centralized ERM-RER that has access to all datasets. The specific scaling requirement for regularization factors with local sample sizes is identified as necessary for this equivalence.

05 · Limitations

The paper does not provide explicit numerical experiments or empirical validation of the theoretical results. The forward-backward communication structure may impose constraints on the network topology and communication patterns. The practical computational cost of computing and sharing Gibbs measures compared to other decentralized approaches is not discussed. The scalability to very large numbers of clients or high-dimensional model spaces is not addressed.

✨ Generated by Claude · Apr 25, 2026 · Read the PDF for authoritative content.

What the paper shows.

↘ Related papers