Cross-Camera Trajectories Help Person Retrieval in a Camera Network
Abstract: We are concerned with retrieving a query person from multiple videos captured by a non-overlapping camera network. Existing methods often rely on purely visual matching or consider temporal constraints but ignore the spatial information of the camera network. To address this issue, we propose a pedestrian retrieval framework based on cross-camera trajectory generation, which integrates both temporal and spatial information. To obtain pedestrian trajectories, we propose a novel cross-camera spatio-temporal model that integrates pedestrians' walking habits and the path layout between cameras to form a joint probability distribution. Such a spatio-temporal model among a camera network can be specified using sparsely sampled pedestrian data. Based on the spatio-temporal model, cross-camera trajectories can be extracted by the conditional random field model and further optimized by restricted non-negative matrix factorization. Finally, a trajectory re-ranking technique is proposed to improve the pedestrian retrieval results. To verify the effectiveness of our method, we construct the first cross-camera pedestrian trajectory dataset, the Person Trajectory Dataset, in real surveillance scenarios. Extensive experiments verify the effectiveness and robustness of the proposed method.
- L. Zheng, L. Shen, L. Tian, S. Wang, J. Wang, and Q. Tian, “Scalable person re-identification: A benchmark,” in 2015 IEEE International Conference on Computer Vision (ICCV), 2015, pp. 1116–1124.
- E. Ristani, F. Solera, R. S. Zou, R. Cucchiara, and C. Tomasi, “Performance measures and a data set for multi-target, multi-camera tracking,” CoRR, vol. abs/1609.01775, 2016. [Online]. Available: http://arxiv.org/abs/1609.01775
- Q. Xie, W. Zhou, G.-J. Qi, Q. Tian, and H. Li, “Progressive unsupervised person re-identification by tracklet association with spatio-temporal regularization,” IEEE Transactions on Multimedia, pp. 1–1, 2020.
- W. Huang, R. Hu, C. Liang, Y. Yu, Z. Wang, X. Zhong, and C. Zhang, “Camera network based person re-identification by leveraging spatial-temporal constraint and multiple cameras relations,” in MMM 2016 Proceedings, Part I, of the 22nd International Conference on MultiMedia Modeling - Volume 9516, 2016, pp. 174–186.
- J. Lv, W. Chen, Q. Li, and C. Yang, “Unsupervised cross-dataset person re-identification by transfer learning of spatial-temporal patterns,” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018, pp. 7948–7956.
- G. Wang, J. Lai, P. Huang, and X. Xie, “Spatial-temporal person re-identification,” Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 1, pp. 8933–8940, 2019.
- J. Xu, V. Jagadeesh, Z. Ni, S. Sunderrajan, and B. S. Manjunath, “Graph-based topic-focused retrieval in distributed camera network,” IEEE Transactions on Multimedia, vol. 15, no. 8, pp. 2046–2057, 2013.
- X. Liu, W. Liu, T. Mei, and H. Ma, “Provid: Progressive and multimodal vehicle reidentification for large-scale urban surveillance,” IEEE Transactions on Multimedia, vol. 20, no. 3, pp. 645–658, 2018.
- Y. Hou, L. Zheng, Z. Wang, and S. Wang, “Locality aware appearance metric for multi-target multi-camera tracking,” 2018.
- M. C. Liem and D. M. Gavrila, “Joint multi-person detection and tracking from overlapping cameras,” Computer Vision and Image Understanding, vol. 128, pp. 36–50, 2014.
- W. Nie, A. Liu, Y. Su, H. Luan, Z. Yang, L. Cao, and R. Ji, “Single/cross-camera multiple-person tracking by graph matching,” Neurocomputing, vol. 139, pp. 220–232, 2014.
- R. Iguernaissi, D. Merad, K. Aziz, and P. Drap, “People tracking in multi-camera systems: a review,” Multimedia Tools and Applications, vol. 78, no. 8, pp. 10 773–10 793, 2019.
- O. Javed, K. Shafique, Z. Rasheed, and M. Shah, “Modeling inter-camera space-time and appearance relationships for tracking across non-overlapping views,” Computer Vision and Image Understanding, vol. 109, no. 2, pp. 146–162, 2008.
- Z. Zhong, L. Zheng, D. Cao, and S. Li, “Re-ranking person re-identification with k-reciprocal encoding,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 3652–3661.
- O. Chum, J. Philbin, J. Sivic, M. Isard, and A. Zisserman, “Total recall: Automatic query expansion with a generative feature model for object retrieval,” in 2007 IEEE 11th International Conference on Computer Vision, 2007, pp. 1–8.
- H. Jegou, H. Harzallah, and C. Schmid, “A contextual dissimilarity measure for accurate and efficient image search,” in 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007, pp. 1–8.
- D. Qin, S. Gammeter, L. Bossard, T. Quack, and L. van Gool, “Hello neighbor: Accurate object retrieval with k-reciprocal nearest neighbors,” in CVPR 2011, 2011, pp. 777–784.
- X. Shen, Z. Lin, J. Brandt, S. Avidan, and Y. Wu, “Object retrieval and localization with spatially-constrained similarity measure and k-nn re-ranking,” in 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012, pp. 3013–3020.
- L. Zheng, S. Wang, L. Tian, F. He, Z. Liu, and Q. Tian, “Query-adaptive late fusion for image search and person re-identification,” in 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 1741–1750.
- S.-Z. Chen, C.-C. Guo, and J.-H. Lai, “Deep ranking for person re-identification via joint representation learning,” IEEE Transactions on Image Processing, vol. 25, no. 5, pp. 2353–2367, 2016.
- J. Garcia, N. Martinel, C. Micheloni, and A. Gardel, “Person re-identification ranking optimisation by discriminant context information analysis,” in 2015 IEEE International Conference on Computer Vision (ICCV), 2015, pp. 1305–1313.
- J. Garcia, N. Martinel, A. Gardel, I. Bravo, G. L. Foresti, and C. Micheloni, “Discriminant context information analysis for post-ranking person re-identification,” IEEE Transactions on Image Processing, vol. 26, no. 4, pp. 1650–1665, 2017.
- W. Li, Y. Wu, M. Mukunoki, and M. Minoh, “Common-near-neighbor analysis for person re-identification,” in 2012 19th IEEE International Conference on Image Processing, 2012, pp. 1621–1624.
- Q. Leng, R. Hu, C. Liang, Y. Wang, and J. Chen, “Person re-identification with content and context re-ranking,” Multimedia Tools and Applications, vol. 74, no. 17, pp. 6989–7014, 2015.
- L. Zheng, Z. Bie, Y. Sun, J. Wang, C. Su, S. Wang, and Q. Tian, “Mars: A video benchmark for large-scale person re-identification,” in European Conference on Computer Vision, 2016, pp. 868–884.
- T.-Y. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan, and S. Belongie, “Feature pyramid networks for object detection,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 936–944.
- A. Bewley, Z. Ge, L. Ott, F. Ramos, and B. Upcroft, “Simple online and realtime tracking,” in 2016 IEEE International Conference on Image Processing (ICIP), 2016, pp. 3464–3468.
- J. Ferryman and A. Shahrokni, “Pets2009: Dataset and challenge,” Performance Evaluation of Tracking and Surveillance (PETS-Winter), 2009 Twelfth IEEE International Workshop on, pp. 1–6, 2010.
- Y. Xu, X. Liu, Y. Liu, and S.-C. Zhu, “Multi-view people tracking via hierarchical trajectory composition,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 4256–4265.
- F. Fleuret, J. Berclaz, R. Lengagne, and P. Fua, “Multicamera people tracking with a probabilistic occupancy map,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30, no. 2, pp. 267–282, 2008.
- K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 770–778.
- Y. He, X. Wei, X. Hong, W. Shi, and Y. Gong, “Multi-target multi-camera tracking by tracklet-to-target assignment,” IEEE Transactions on Image Processing, vol. 29, pp. 5191–5205, 2020.
- G. Wang, Y. Yuan, X. Chen, J. Li, and X. Zhou, “Learning discriminative features with multiple granularities for person re-identification,” ACM Multimedia, 2018.
- Y. Sun, L. Zheng, Y. Yang, Q. Tian, and S. Wang, “Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline),” in European Conference on Computer Vision, 2018.
- W. Li, X. Zhu, and S. Gong, “Harmonious attention network for person re-identification,” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018, pp. 2285–2294.
- M. Wang, B. Lai, J. Huang, X. Gong, and X.-S. Hua, “Camera-aware proxies for unsupervised person re-identification.” in AAAI, 2020, pp. 2764–2772.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.