Activity-Biometrics: Person Identification from Daily Activities
Abstract: In this work, we study a novel problem which focuses on person identification while performing daily activities. Learning biometric features from RGB videos is challenging due to spatio-temporal complexity and presence of appearance biases such as clothing color and background. We propose ABNet, a novel framework which leverages disentanglement of biometric and non-biometric features to perform effective person identification from daily activities. ABNet relies on a bias-less teacher to learn biometric features from RGB videos and explicitly disentangle non-biometric features with the help of biometric distortion. In addition, ABNet also exploits activity prior for biometrics which is enabled by joint biometric and activity learning. We perform comprehensive evaluation of the proposed approach across five different datasets which are derived from existing activity recognition benchmarks. Furthermore, we extensively compare ABNet with existing works in person identification and demonstrate its effectiveness for activity-based biometrics across all five datasets. The code and dataset can be accessed at: \url{https://github.com/sacrcv/Activity-Biometrics/}
- Elastic transform. https://pytorch.org/vision/main/generated/torchvision.transforms.ElasticTransform.html. [Online; accessed 08-November-2023].
- Past, present, and future of face recognition: A review. Electronics, 9(8):1188, 2020.
- Vivit: A video vision transformer. In ICCV, pages 6836–6846, 2021.
- Salient-to-broad transition for video person re-identification. In CVPR, pages 7339–7348, 2022.
- Pstr: End-to-end one-step person search with transformers. In CVPR, pages 9458–9467, 2022.
- Learning recurrent 3d attention for video-based person re-identification. IEEE TIP, 29:6963–6976, 2020.
- Learning 3d shape feature for texture-insensitive person re-identification. In CVPR, pages 8146–8155, 2021.
- Masked-attention mask transformer for universal image segmentation. In CVPR, 2022.
- Pku-mmd: A large scale benchmark for continuous multi-modal human action understanding. arXiv preprint arXiv:1703.07475, 2017.
- Expanding accurate person recognition to new altitudes and ranges: The briar dataset. In WACV, pages 593–602, 2023.
- Video person re-identification by temporal residual learning. IEEE TIP, 28(3):1366–1377, 2018.
- Video-based person re-identification with spatial and temporal memory networks. In ICCV, pages 12036–12045, 2021.
- Gaitpart: Temporal part-based model for gait recognition. In CVPR, pages 14225–14233, 2020.
- Opengait: Revisiting gait recognition towards better practicality. In CVPR, pages 9707–9716, 2023.
- Appearance-preserving 3d convolution for video-based person re-identification. In ECCV, pages 228–243. Springer, 2020.
- Clothes-changing person re-identification with rgb modality only. In CVPR, pages 1060–1069, 2022.
- Semantic-aware consistency network for cloth-changing person re-identification. In ACM MM, 2023.
- Deep residual learning for image recognition. In CVPR, pages 770–778, 2016.
- Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531, 2015.
- Fine-grained shape-appearance mutual learning for cloth-changing person re-identification. In CVPR, pages 10513–10522, 2021.
- Bicnet-tks: Learning efficient spatial-temporal representation for video person re-identification. In CVPR, pages 2014–2023, 2021.
- Celebrities-reid: A benchmark for clothes variation in long-term person re-identification. In IJCNN, pages 1–8. IEEE, 2019.
- Rethinking temporal fusion for video-based person re-identification on semantic and time aspect. In AAAI, pages 11133–11140, 2020.
- Cloth-changing person re-identification from a single image with gait prediction and regularization. In CVPR, pages 14278–14287, 2022.
- Adam: A method for stochastic optimization. ICLR, 2015.
- Mvitv2: Improved multiscale vision transformers for classification and detection. In CVPR, pages 4804–4814, 2022.
- Gaitedge: Beyond plain end-to-end gait recognition for better practicality. In ECCV, pages 375–390. Springer, 2022.
- Gait recognition via effective global-local feature representation and local temporal aggregation. In ICCV, pages 14648–14656, 2021.
- Spatially and temporally efficient non-local attention network for video-based person re-identification. arXiv preprint arXiv:1908.01683, 2019a.
- Ntu rgb+ d 120: A large-scale benchmark for 3d human activity understanding. IEEE TPAMI, 42(10):2684–2701, 2019b.
- Deep learning face attributes in the wild. In ICCV, 2015.
- Swin transformer: Hierarchical vision transformer using shifted windows. In ICCV, pages 10012–10022, 2021.
- Magface: A universal representation for face recognition and quality assessment. In CVPR, pages 14225–14234, 2021.
- Accenture-mm1: A multimodal person recognition dataset. In WACVW, 2024.
- Robust re-identification by multiple views knowledge distillation. In ECCV, pages 93–110. Springer, 2020.
- Long-term cloth-changing person re-identification. In ACCV, 2020.
- Grounded sam: Assembling open-world models for diverse visual tasks, 2024.
- Hollywood in homes: Crowdsourcing data collection for activity understanding. In ECCV, pages 510–526, 2016.
- Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. NeurIPS, 30, 2017.
- Pyramid spatial-temporal aggregation for video-based person re-identification. In ICCV, pages 12026–12035, 2021.
- Distilled person re-identification: Towards a more scalable system. In CVPR, pages 1187–1196, 2019.
- Person re-identification by contour sketch under moderate clothing change. IEEE TPAMI, 43(6):2029–2046, 2019.
- Good is bad: Causality inspired cloth-debiasing for cloth-changing person re-identification. In CVPR, pages 1472–1481, 2023.
- Deep learning for person re-identification: A survey and outlook. IEEE TPAMI, 44(6):2872–2893, 2021.
- A framework for evaluating the effect of view angle, clothing and carrying condition on gait recognition. In ICPR, pages 441–444, 2006.
- Image-to-video person re-identification with temporally memorized similarity learning. IEEE TCSVT, 28(10):2622–2632, 2017.
- Gait recognition via disentangled representation learning. In CVPR, pages 4710–4719, 2019.
- Scalable person re-identification: A benchmark. In ICCV, pages 1116–1124, 2015.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.