Academia.eduAcademia.edu

Distributed Learning

description1,087 papers
group1,397 followers
lightbulbAbout this topic
Distributed Learning is an educational approach that utilizes multiple locations and diverse resources to facilitate learning, often leveraging technology to connect learners and instructors across geographical distances. It emphasizes collaboration, flexibility, and accessibility, allowing for personalized learning experiences and the integration of various instructional methods.
lightbulbAbout this topic
Distributed Learning is an educational approach that utilizes multiple locations and diverse resources to facilitate learning, often leveraging technology to connect learners and instructors across geographical distances. It emphasizes collaboration, flexibility, and accessibility, allowing for personalized learning experiences and the integration of various instructional methods.

Key research themes

1. How can communication-efficient strategies mitigate bandwidth and latency constraints in distributed and federated learning systems?

This research area focuses on developing algorithms and frameworks that reduce communication overhead in distributed learning settings, such as federated learning, where bandwidth limitations and communication latency pose critical bottlenecks. Efficient communication protocols aim to compress, sparsify, or coordinate information exchange to maintain model performance while minimizing the amount and frequency of data transmitted between clients and servers or peers. This is crucial for scalability, privacy preservation, and operational feasibility in heterogeneous and resource-constrained environments.

Key finding: This work generalizes the DRIVE algorithm to support adjustable bandwidth constraints for gradient compression, utilizing random rotations and Kashin's representation to achieve low normalized mean squared error (NMSE) in... Read more
Key finding: The authors propose Sufficient Factor Broadcasting (SFB), which transmits low-rank updates (outer products of two vectors) rather than full parameter matrices during distributed model training. This reduces communication cost... Read more
Key finding: This paper introduces a distributed event-triggered stochastic gradient descent (DETSGRAD) algorithm that leverages a communication-triggering mechanism allowing agents to update model parameters aperiodically, rather than at... Read more
Key finding: The authors present Federated Stochastic Block Coordinate Descent (FedBCD), a collaborative learning framework for vertically partitioned (feature-distributed) data that enables multiple local updates before communication... Read more
Key finding: Through an extensive simulation campaign comparing centralized federated learning (CFL), gossip federated learning (GFL), and blockchain-enabled federated learning, this study quantifies communication overhead, convergence... Read more

2. What algorithmic frameworks and optimization techniques enable scalable, provably convergent distributed training of nonconvex models such as neural networks?

Training neural networks in distributed environments inherently involves solving nonconvex optimization problems with data partitioned across agents connected by possibly dynamic network topologies. Designing algorithms that guarantee convergence to stationary points, handle nonconvexity, and efficiently utilize computation and communication resources is nontrivial. Research in this area develops frameworks based on successive convex approximation, primal convexification, dynamic consensus, and gradient coding to provide scalable, general solutions with convergence guarantees and support for parallelism and robustness to stragglers.

Key finding: This work proposes a general distributed framework for training neural networks where data is partitioned among agents connected over sparse, time-varying networks, formulating the learning as a nonconvex social cost... Read more
Key finding: The authors develop a theoretical framework that leverages successive convex approximation and dynamic consensus to train neural networks in distributed settings without centralized coordination. The approach handles general... Read more
Key finding: To mitigate stragglers in synchronous distributed gradient descent, this paper develops a dynamic gradient coding (GC) approach with dynamic clustering (GC-DC), where redundant data and computations are assigned adaptively... Read more
Key finding: The study introduces an age-aware dynamic encoding strategy for coded computation with partial recovery in distributed gradient descent, which reorders codewords and computation priorities over time based on the age of... Read more

3. How can distributed learning be designed to effectively handle data heterogeneity, non-IIDness, and decentralized data partitions in federated and collaborative learning?

Data heterogeneity and non-identically distributed (non-IID) characteristics of clients' local datasets pose significant challenges to federated and collaborative learning. Approaches addressing vertical (feature-partitioned) and horizontal (sample-partitioned) data distributions focus on enabling collaborative training without sharing raw data or model parameters, preserving privacy, and ensuring effective convergence. Research targets clustering clients by data similarity, multiple local update methods, multi-agent systems for dynamic node management, and ensemble architectures for continual learning on streaming nonstationary data—seeking personalized and robust global models in realistic distributed environments.

Key finding: This paper tackles collaborative learning with vertically partitioned features across parties, proposing Federated Stochastic Block Coordinate Descent (FedBCD) that allows multiple local updates before communication. Parties... Read more
Key finding: Proposing a distributed training algorithm for Random Vector Functional-Link (RVFL) networks in vertically partitioned (feature-distributed) settings, this work uses the Alternating Direction Method of Multipliers (ADMM) and... Read more
Key finding: The authors design a federated learning framework implemented through a multi-agent system (MAS) using the SPADE platform, where agents dynamically join or leave as training nodes. Model training occurs locally at each agent... Read more
Key finding: FLIS dynamically clusters clients without accessing private data by measuring inference similarity of local models, enabling grouped training on similar data distributions under non-IID settings. Unlike prior approaches... Read more
Key finding: Introducing Ensemble and Continual Federated Learning (ECFL), this approach leverages ensemble techniques to aggregate diverse local models trained on data streams subject to concept drift, enabling continual adaptation in... Read more

All papers in Distributed Learning

The global artificial intelligence landscape is undergoing a structural shift. As global AI investments scale toward $2 trillion, machine learning has transitioned from an experimental feature into a primary driver of enterprise... more
The rapid evolution of web technologies, coupled with the widespread adoption of microservices architectures, has introduced unprecedented complexity into modern software systems. As applications become increasingly distributed, modular,... more
There is a critical need for seeking alternative, learner-centred, and more supportive design solutions for the MOOC learning environment since that currently existing design models do not seem to take into consideration the diversity of... more
MOOCs are particular learning environments due to their massiveness and openness. Creating learning environments for masses is extremely complex and raises a lot of design questions because of the large variety of participants in terms of... more
We propose a Hybrid System for dynamic environments, where a "Multiple Neural Networks" system works with Bayes Rule to solve a face recognition problem. One or more neural nets may no longer be able to properly operate, due to partial... more
Online learners can study at convenient times and collaborate with others online, but usually can not come to physical labs. We investigated the use of simulated electronics laboratories to increase access and decrease trips to a physical... more
This project develops an AI system to predict feeder-level outage risk across power grids without sharing raw utility data. It leverages federated learning to enable cross-utility collaboration, combined with differential privacy and... more
The vast scale at which the Internet of Things (IoT) is deployed generates an unprecedented volume of data, from tiny sensor readings in chips hosted in remote or hostile environments to high-resolution images helping to... more
Accurately predicting which students are best suited for graduate programs is beneficial to both students and colleges. In this paper, we propose a quantitative machine learning approach to predict an applicant's potential performance in... more
We study anonymous posted price mechanisms for combinatorial auctions in a Bayesian framework. In a posted price mechanism, item prices are posted, then the consumers approach the seller sequentially in an arbitrary order, each purchasing... more
We study anonymous posted price mechanisms for combinatorial auctions in a Bayesian framework. In a posted price mechanism, item prices are posted, then the consumers approach the seller sequentially in an arbitrary order, each purchasing... more
In federated learning, a large number of users are involved in a global learning task, in a collaborative way. They alternate local computations and two-way communication with a distant orchestrating server. Communication, which can be... more
Recently, there has been a rapid global increase in the provision of massive open online courses (MOOCs). Renowned universities are now providing content in this way, and thousands of people complete these freely available courses. The... more
Recently, there has been a rapid global increase in the provision of massive open online courses (MOOCs). Renowned universities are now providing content in this way, and thousands of people complete these freely available courses. The... more
Distributed training techniques have been widely deployed in large-scale deep models training on dense-GPU clusters. However, on public cloud clusters, due to the moderate inter-connection bandwidth between instances, traditional... more
Distributed learning techniques such as federated learning have enabled multiple workers to train machine learning models together to reduce the overall training time. However, current distributed training algorithms (centralized or... more
Nowadays, large and complex deep learning (DL) models are increasingly trained in a distributed manner across multiple worker machines, in which extensive communications between workers pose serious scaling problems. In this article, we... more
Distributed deep learning (DL) has become prevalent in recent years to reduce training time by leveraging multiple computing devices (e.g., GPUs/TPUs) due to larger models and datasets. However, system scalability is limited by... more
To reduce the long training time of large deep neural network (DNN) models, distributed synchronous stochastic gradient descent (S-SGD) is commonly used on a cluster of workers. However, the speedup brought by multiple workers is limited... more
Machine Learning has proven useful in the recent years as a way to achieve failure prediction for industrial systems. However, the high computational resources necessary to run learning algorithms are an obstacle to its widespread... more
A flexible approach for structuring and merging distributed learning object is presented. At the basis of this approach there is a formal representation of a learning object, called attribute structure. Attribute structures are labeled... more
The training of large-scale neural networks increasingly suffers from communication bottlenecks across distributed accelerators, with gradient synchronization dominating both bandwidth consumption and energy costs. Conventional parameter... more
In this paper, a novel algorithm for bandwidth reduction in adaptive distributed learning is introduced. We deal with diffusion networks, in which the nodes cooperate with each other, by exchanging information, in order to estimate an... more
The rapid growth of social media streams has intensified the need for scalable, low-latency sentiment analysis pipelines that can operate under high-volume, real-time constraints. This paper proposes a distributed framework built on... more
Distributed learning across coalitions is becoming popular for multi-centric implementation of deep learning models. However, the level of trust between the members of a coalition can vary and requires different security architectures.... more
Distributed learning across coalitions is becoming popular for multi-centric implementation of deep learning models. However, the level of trust between the members of a coalition can vary and requires different security architectures.... more
Distributed learning across coalitions is becoming popular for multi-centric implementation of deep learning models. However, the level of trust between the members of a coalition can vary and requires different security architectures.... more
Lorsqu'un enseignant veut évaluer le savoir-faire des apprenants à l'aide d'un logiciel, il utilise souvent les systèmes Tutoriels Intelligents (STI). Or, les STI sont difficiles à développer et destinés à un domaine... more
Je tiens à remercier M. Mustapha Bennouna, président de l'Université Abdelmalak Essaâdi Tétouan-Tanger au Maroc, pour m'avoir permis de commencer cette thèse et aussi pour m'avoir fait l'honneur de la présider.
Download research papers for free!