Efficient Zero-Order Federated Finetuning of Language Models for Resource-Constrained Devices

Published 14 Feb 2025 in cs.LG and cs.AI | (2502.10239v1)

Abstract: Federated fine-tuning offers a promising approach for tuning LLMs on edge devices while preserving data privacy. However, fine-tuning these models on edge devices remains challenging due to high memory, communication, and computational demands. Zero-order optimization with task alignment provides a potential solution, enabling fine-tuning with inference-level memory requirements but requires a longer convergence time. In this paper, we propose Federated Split-Perturbation Zero-order Optimization (FedSPZO) that divides the network into two blocks, applying a different number of perturbations per block in a computationally effective way, achieving faster convergence. Our evaluation shows a $2.5 - 7\times $ reduction in computation overhead compared to zero-order state of the art techniques in federated learning.