Person Re-identification: Past, Present and Future

Published 10 Oct 2016 in cs.CV | (1610.02984v1)

Abstract: Person re-identification (re-ID) has become increasingly popular in the community due to its application and research significance. It aims at spotting a person of interest in other cameras. In the early days, hand-crafted algorithms and small-scale evaluation were predominantly reported. Recent years have witnessed the emergence of large-scale datasets and deep learning systems which make use of large data volumes. Considering different tasks, we classify most current re-ID methods into two classes, i.e., image-based and video-based; in both tasks, hand-crafted and deep learning systems will be reviewed. Moreover, two new re-ID tasks which are much closer to real-world applications are described and discussed, i.e., end-to-end re-ID and fast re-ID in very large galleries. This paper: 1) introduces the history of person re-ID and its relationship with image classification and instance retrieval; 2) surveys a broad selection of the hand-crafted systems and the large-scale methods in both image- and video-based re-ID; 3) describes critical future directions in end-to-end re-ID and fast retrieval in large galleries; and 4) finally briefs some important yet under-developed issues.

Abstract PDF Upgrade to Chat

Citations (1,079)

View on Semantic Scholar

Summary

The paper presents an extensive survey of person re-identification methods ranging from traditional hand-crafted features to modern deep learning and video-based techniques.
It details how metric learning and CNN architectures enhance identity matching while addressing issues like detection errors and re-ranking challenges.
The paper highlights future directions, including large-scale dataset handling, transfer learning, and integrated detection-re-ID systems for real-world applications.

Detailed Summary of "Person Re-identification: Past, Present and Future"

The paper "Person Re-identification: Past, Present and Future" (1610.02984) offers an extensive survey of person re-identification (re-ID) systems, focusing on developments from traditional methods to modern deep learning approaches. This essay presents the core content and insights from the paper, suitable for experienced researchers.

Introduction to Person Re-identification

Person re-ID deals with recognizing the same person across different camera views. The task emerged due to the need for public safety and the increase in surveillance networks. Historically, person re-ID was tied to multi-camera tracking. However, evolving demand led to it becoming a distinct research focus. The primary challenge lies in matching images under varying conditions of lighting, pose, and viewpoint.

Image-based Re-ID Methods

The paper divides image-based re-ID methods into two categories: hand-crafted and deeply-learned systems.

Hand-crafted Systems

Hand-crafted methods primarily rely on engineered features such as color histograms, texture descriptors, and local binary patterns (LBP). Metric learning plays a crucial role, with techniques like Mahalanobis distance-based approaches (e.g., KISSME) significant in aligning feature spaces.

Deeply-learned Systems

Deep learning models, particularly CNNs, have gained traction. Early models employed Siamese networks, while newer strategies incorporate classification models, leveraging large datasets for training. Recent architectures have focused on leveraging fine-tuned features for more robust identity matching.

Figure 1

Figure 1: Percentage of person re-ID papers on top conferences over the years. Numbers above the markers indicate the number of re-ID papers.

Figure 2

Figure 2: Milestones in the person re-ID history.

Video-based Re-ID

Video-based re-ID benefits from temporal information. Multi-shot strategies aggregate multiple frames for more reliable biometric matching. While hand-crafted systems initially dominated, the integration of RNNs with CNNs for temporal encoding has improved performance substantially.

Challenges of Detection and Tracking

An emerging area is end-to-end re-ID systems integrating detection, tracking, and identity retrieval. Detection errors like misalignment and false positives significantly impact overall system performance. Future systems must synergize these components to improve accuracy and computational efficiency.

Figure 3

Figure 3: An example of re-ranking in re-ID. Given a query image, an initial rank list is obtained, illustrating corrections via detection error retrieval.

Large-scale Re-ID Systems

Handling re-ID in very large galleries is challenging, requiring robust descriptor learning and efficient indexing structures like inverted indices or hashing. As surveillance systems grow, managing large data sets efficiently will drive practical re-ID applications.

Figure 4

Figure 4: Person re-ID accuracy (mAP and rank-1) versus pedestrian detection accuracy (AP) on the PRW dataset.

Open Issues and Future Directions

The paper identifies several open challenges:

Data Volume: The need for large labeled datasets remains a bottleneck. Transfer learning and leveraging off-the-shelf detection annotations can help.
Re-ranking: Improving re-ranking strategies for refining results post-retrieval is crucial.
Open-world Re-ID: Developing systems capable of handling novel queries not present in the training data is also a future goal.

Conclusion

Person re-ID research straddles multiple disciplines, from feature engineering to learning methods. While advances have been made with deep learning techniques, future systems will need to address scalability, integration with detection and tracking, and efficient computation to be viable in real-world applications. The paper thoroughly outlines past progress and future possibilities, guiding practitioners and researchers in the field toward promising research avenues.

Markdown Report Issue