WhoFi: Deep Person Re-Identification via Wi-Fi Channel Signal Encoding

Published 17 Jul 2025 in cs.CV and cs.LG | (2507.12869v1)

Abstract: Person Re-Identification is a key and challenging task in video surveillance. While traditional methods rely on visual data, issues like poor lighting, occlusion, and suboptimal angles often hinder performance. To address these challenges, we introduce WhoFi, a novel pipeline that utilizes Wi-Fi signals for person re-identification. Biometric features are extracted from Channel State Information (CSI) and processed through a modular Deep Neural Network (DNN) featuring a Transformer-based encoder. The network is trained using an in-batch negative loss function to learn robust and generalizable biometric signatures. Experiments on the NTU-Fi dataset show that our approach achieves competitive results compared to state-of-the-art methods, confirming its effectiveness in identifying individuals via Wi-Fi signals.

Abstract PDF Upgrade to Chat

Summary

The paper introduces a novel deep learning pipeline, WhoFi, that encodes Wi-Fi CSI data to create robust biometric signatures for person re-identification.
The methodology employs a Transformer-based encoder and an in-batch negative loss function to optimize feature clustering and separation.
Experimental results show a Rank-1 accuracy of 95.5% and mAP of 88.4%, demonstrating the effectiveness of Wi-Fi signals as a privacy-preserving biometric modality.

WhoFi: Deep Person Re-Identification via Wi-Fi Channel Signal Encoding

The paper "WhoFi: Deep Person Re-Identification via Wi-Fi Channel Signal Encoding" (2507.12869) introduces a novel deep learning pipeline, WhoFi, for person Re-ID utilizing Wi-Fi Channel State Information (CSI). The system extracts biometric features from CSI data and processes them through a modular Deep Neural Network (DNN) featuring a Transformer-based encoder. The network is trained using an in-batch negative loss function to learn robust and generalizable biometric signatures. The study demonstrates competitive results on the NTU-Fi dataset, affirming the effectiveness of identifying individuals via Wi-Fi signals.

Background and Motivation

Traditional person Re-ID systems rely on visual data, which are susceptible to limitations such as poor lighting, occlusion, and viewpoint variations. Wi-Fi-based person Re-ID offers a non-visual alternative, leveraging the principle that Wi-Fi signals are altered by the presence and physical characteristics of individuals. CSI provides detailed measurements of how radio signals interact with the environment, capturing biometric information. The paper addresses the underexplored area of Wi-Fi-based Re-ID by developing scalable deep learning methods that generalize across individuals and sensing environments.

Methodology

The proposed WhoFi pipeline consists of data pre-processing, data augmentation, a DNN architecture, and a loss function.

Data Pre-processing and Augmentation

The pre-processing stage involves cleaning CSI data to extract meaningful biometric features. This includes amplitude filtering using a Hampel filter to remove outliers and phase sanitization to correct phase shifts caused by hardware synchronization issues.

Figure 1: The system processes input signals through an encoder, extracts latent representations, computes a signature vector, normalizes the signature, and uses the normalized signature as a unique identifier.

Deep Neural Network Architecture

The DNN architecture comprises an Encoder module ( $M_e$ ) and a Signature Module ( $M_s$ ). The Encoder module extracts relevant information from CSI measurements using LSTM, Bi-LSTM, or Transformer networks. The Signature module then generates a biometric signature, applying $l^2$ normalization to ensure uniformity and facilitate similarity computations.

Loss Function

The pipeline employs an in-batch negative loss function to train the model. This loss function encourages signatures from the same person to cluster together in the embedding space while maximizing the distance between signatures from different individuals. A custom batch sampler constructs batches with query and gallery lists, and a similarity matrix is computed using cosine similarity.

Figure 2: A similarity matrix is constructed with diagonal elements representing similarities between query signatures and corresponding positive gallery signatures.

Experimental Results

The experiments were conducted on the NTU-Fi dataset, evaluating the performance of LSTM, Bi-LSTM, and Transformer-based models. The Transformer-based model achieved superior results, with a Rank-1 accuracy of 95.5% and a mean Average Precision (mAP) of 88.4%. Ablation studies explored the impact of amplitude filtering, data augmentation, packet size, and model depth. The results indicated that the Transformer encoder benefits from longer input sequences and performs optimally with a single layer.

Discussion

The results demonstrate the effectiveness of the Transformer-based model in capturing discriminative temporal patterns within Wi-Fi amplitude sequences. The ablation studies provide insights into the importance of pre-processing steps and the impact of data augmentation on model performance. The findings suggest that Wi-Fi signals can serve as a robust and privacy-preserving biometric modality for person Re-ID.

Conclusion

The paper presents a deep learning pipeline for person Re-ID using Wi-Fi CSI, leveraging a DNN to generate biometric signatures. The study demonstrates the viability of Wi-Fi signals as a biometric modality and establishes a baseline for future research in CSI-based person re-identification.

Markdown Report Issue