- The paper presents a comprehensive survey of deep gait recognition methods, introducing a novel taxonomy based on body, temporal, feature, and neural dimensions.
- It evaluates state-of-the-art neural architectures like CNNs, GANs, and RNNs using benchmarks such as the CASIA-B and OU-MVLP datasets.
- The study highlights challenges including adversarial robustness and cross-dataset generalization, outlining promising future research directions.
Deep Gait Recognition: A Survey
The paper, "Deep Gait Recognition: A Survey" (2102.09546), provides a comprehensive review of the advancements in gait recognition using deep learning methodologies. The focus lies on vision-based systems, analyzing datasets, test protocols, state-of-the-art deep learning solutions, challenges, and potential future research directions in this domain.
Introduction to Gait Recognition
Gait recognition is a unique biometric modality relying on the distinct manner of an individual's walk. It has significant applications in security, healthcare, and more. Deep learning architectures have largely transformed gait recognition by autonomously learning discriminative features, surpassing traditional methods.
Figure 1: The number of gait recognition papers published after 2015 using non-deep and deep gait recognition methods.
The dominance of deep learning in recent research is clearly depicted in Figure 1.
Evolution and Taxonomy in Gait Recognition
Evolution of Deep Gait Recognition
The evolution of deep learning methodologies in gait recognition is graphically depicted, showcasing the transition and improvement over time.
Figure 2: The evolution of deep gait recognition methods.
Taxonomy
The authors propose a novel taxonomy for deep gait recognition, segmented into four dimensions: body representation, temporal representation, feature representation, and neural architecture (Figure 3). This taxonomy provides a structured framework for understanding the technology landscape in this field.
Figure 3: Our taxonomy consisting of 4 dimensions: body representation, temporal representation, feature representation, and neural architecture.
Core Components and Approaches
Body and Temporal Representation
Gait recognition primarily adopts silhouettes and skeleton-based representations. Silhouettes are widely used due to their ease of computation. Temporal representations involve either static templates like Gait Energy Images (GEI) or dynamic sequence volumes.
Figure 4: Overview of temporal representations in gait recognition using various stages of neural networks.
Neural Architectures
Multiple deep learning structures are utilized, including CNNs, GANs, RNNs, and hybrid models that combine these architectures. These networks are tailored for learning both spatial and temporal characteristics of gait data.
Figure 5: Different approaches for using RNNs in the context of deep gait recognition systems.
A detailed examination of test protocols across various datasets is provided (Figure 6), notably distinguishing between subject-dependent and independent evaluations.
Figure 6: An overview of test protocols in gait recognition.
Datasets
Gait datasets are critical for assessing deep learning solutions, with the most commonly used being CASIA-B and OU-MVLP. They significantly contribute to benchmarking performance and generalization.
Challenges and Future Directions
The paper highlights several challenges:
- Disentanglement and Self-supervised Learning: Effective representation requires disentangling identity from other confounding factors and leveraging unlabeled data.
- Adversarial Robustness: There is a challenge in addressing gait recognition's vulnerability to adversarial attacks.
- Synthesized Data and Multi-task Learning: Generating realistic synthetic data and employing multi-task learning paradigms offer promising avenues for enhancing robustness and applicability.
- Cross-Dataset Evaluation: Ensuring recognition systems generalize across different data domains is crucial.
Conclusion
This paper systematically explores the advancements and state-of-the-art methodologies in deep gait recognition. It categorizes existing technology, evaluates against popular datasets, and outlines unresolved challenges and innovative directions that can facilitate enhanced real-world applications and robustness in gait recognition systems.
Figure 7: Visualization and frequency of different neural architectures, loss functions, and gait datasets used in literature.