In a significant stride towards enhancing data privacy and efficiency in machine learning, a team of researchers from China has developed a robust Semi-Supervised Federated Learning (SSFL) system. This innovation, led by Dr. Wang Shufen from the School of Information Engineering at Harbin Institute of Petroleum, along with colleagues from Heilongjiang University and Guangzhou Maritime University, addresses critical challenges in federated learning (FL), particularly in scenarios where client-side data are unlabeled.
Federated learning allows multiple devices or clients to collaboratively train a shared global model while keeping data localized, thereby preserving privacy. However, traditional FL systems assume that client-side data are labeled, which is often not the case in real-world applications. The team’s SSFL system is designed to handle unlabeled data, making it more practical for diverse applications, including those in the maritime sector.
The researchers employed two key methods to improve the robustness and efficiency of their system. The FedMix method analyzes implicit relationships between global model iterations, enabling the system to learn from both labeled and unlabeled data effectively. Meanwhile, the FedLoss aggregation method mitigates the impact of non-Identically and Independently Distributed (non-IID) data among clients, ensuring faster convergence and stability of the global model. As Dr. Wang Shufen explained, “The FedLoss method dynamically adjusts the weight of the local model in the global model based on the loss function value of the client model, which significantly improves the system’s robustness.”
The commercial implications of this research are substantial. In the maritime industry, where data privacy and security are paramount, the SSFL system can facilitate collaborative learning across vessels and ports without compromising sensitive information. This can lead to improved predictive maintenance, route optimization, and safety protocols, ultimately enhancing operational efficiency and reducing costs.
Moreover, the system’s ability to handle non-IID data is particularly valuable in the maritime sector, where data distribution can vary widely due to different vessel types, operating conditions, and environmental factors. As Dr. Zhang Zhe from Heilongjiang University noted, “Our system’s robustness to different non-IID levels makes it highly adaptable to the diverse and dynamic conditions encountered in maritime operations.”
The researchers validated their system using the CIFAR-10 dataset, demonstrating a classification accuracy approximately 3 percentage points higher than mainstream FL systems. This improvement highlights the potential of the SSFL system to outperform traditional methods in real-world applications.
Published in the journal ‘Jisuanji gongcheng’, which translates to ‘Computer Engineering’, this research represents a significant advancement in the field of federated learning. As the maritime industry continues to embrace digital transformation, innovations like the SSFL system will play a crucial role in unlocking the full potential of data-driven decision-making while ensuring privacy and security.
For maritime professionals, the SSFL system offers a promising solution to the challenges of data heterogeneity and privacy, paving the way for more collaborative and efficient operations. As the industry navigates the complexities of digitalization, such advancements will be instrumental in driving progress and innovation.

