Papers
Topics
Authors
Recent
Search
2000 character limit reached

Leveraging Ensembles and Self-Supervised Learning for Fully-Unsupervised Person Re-Identification and Text Authorship Attribution

Published 7 Feb 2022 in cs.CV | (2202.03126v4)

Abstract: Learning from fully-unlabeled data is challenging in Multimedia Forensics problems, such as Person Re-Identification and Text Authorship Attribution. Recent self-supervised learning methods have shown to be effective when dealing with fully-unlabeled data in cases where the underlying classes have significant semantic differences, as intra-class distances are substantially lower than inter-class distances. However, this is not the case for forensic applications in which classes have similar semantics and the training and test sets have disjoint identities. General self-supervised learning methods might fail to learn discriminative features in this scenario, thus requiring more robust strategies. We propose a strategy to tackle Person Re-Identification and Text Authorship Attribution by enabling learning from unlabeled data even when samples from different classes are not prominently diverse. We propose a novel ensemble-based clustering strategy whereby clusters derived from different configurations are combined to generate a better grouping for the data samples in a fully-unsupervised way. This strategy allows clusters with different densities and higher variability to emerge, reducing intra-class discrepancies without requiring the burden of finding an optimal configuration per dataset. We also consider different Convolutional Neural Networks for feature extraction and subsequent distance computations between samples. We refine these distances by incorporating context and grouping them to capture complementary information. Our method is robust across both tasks, with different data modalities, and outperforms state-of-the-art methods with a fully-unsupervised solution without any labeling or human intervention.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (72)
  1. M. Caron, I. Misra, J. Mairal, P. Goyal, P. Bojanowski, and A. Joulin, “Unsupervised learning of visual features by contrasting cluster assignments,” arXiv preprint, vol. arXiv:2006.09882, 2020.
  2. K. He, H. Fan, Y. Wu, S. Xie, and R. Girshick, “Momentum contrast for unsupervised visual representation learning,” in Conf. Comput. Vis. Pattern Recog., 2020, pp. 9729–9738.
  3. T. Chen, S. Kornblith, M. Norouzi, and G. Hinton, “A simple framework for contrastive learning of visual representations,” in Int. Conf. Mach. Learn., 2020, pp. 1597–1607.
  4. M. Caron, H. Touvron, I. Misra, H. Jégou, J. Mairal, P. Bojanowski, and A. Joulin, “Emerging properties in self-supervised vision transformers,” arXiv preprint, vol. arXiv:2104.14294, 2021.
  5. J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, “Imagenet: A large-scale hierarchical image database,” in Conf. Comput. Vis. Pattern Recog., 2009, pp. 248–255.
  6. E. Ristani, F. Solera, R. Zou, R. Cucchiara, and C. Tomasi, “Performance measures and a data set for multi-target, multi-camera tracking,” in Eur. Conf. Comput. Vis., 2016, pp. 17–35.
  7. L. Zheng, L. Shen, L. Tian, S. Wang, J. Wang, and Q. Tian, “Scalable person re-identification: A benchmark,” in Int. Conf. Comput. Vis., 2015, pp. 1116–1124.
  8. L. Wei, S. Zhang, W. Gao, and Q. Tian, “Person transfer GAN to bridge domain gap for person re-identification,” in Conf. Comput. Vis. Pattern Recog., 2018, pp. 79–88.
  9. Y. Ge, D. Chen, F. Zhu, R. Zhao, and H. Li, “Self-paced contrastive learning with hybrid memory for domain adaptive object re-id,” arXiv preprint, vol. arXiv:2006.02713, 2020.
  10. X. Zhang, Y. Ge, Y. Qiao, and H. Li, “Refining pseudo labels with clustering consensus over generations for unsupervised object re-identification,” in Conf. Comput. Vis. Pattern Recog., 2021, pp. 3436–3445.
  11. H. Chen, B. Lagadec, and F. Bremond, “Enhancing diversity in teacher-student networks via asymmetric branches for unsupervised person re-identification,” in Winter Conf. Appl. Comput. Vis., 2020, pp. 1–10.
  12. G. C. Bertocco, F. Andaló, and A. Rocha, “Unsupervised and self-adaptative techniques for cross-domain person re-identification,” IEEE Trans. Inf. Forensics Security, vol. 16, pp. 4419–4434, 2021.
  13. Z. Zhong, L. Zheng, D. Cao, and S. Li, “Re-ranking person re-identification with k-reciprocal encoding,” in Conf. Comput. Vis. Pattern Recog., 2017, pp. 1318–1327.
  14. Y. Ge, D. Chen, and H. Li, “Mutual mean-teaching: Pseudo label refinery for unsupervised domain adaptation on person re-identification,” arXiv preprint, vol. arXiv:2001.01526, 2020.
  15. Y. Zhai, Q. Ye, S. Lu, M. Jia, R. Ji, and Y. Tian, “Multiple expert brainstorming for domain adaptive person re-identification,” arXiv preprint, vol. arXiv:2007.01546, 2020.
  16. Z. Zhong, L. Zheng, Z. Luo, S. Li, and Y. Yang, “Learning to adapt invariance in memory for person re-identification,” IEEE Trans. Pattern Anal. Mach. Intell., pp. 1–1, 2020.
  17. Y. Zou, X. Yang, Z. Yu, B. Kumar, and J. Kautz, “Joint disentangling and adaptation for cross-domain person re-identification,” arXiv preprint, vol. arXiv:2007.10315, 2020.
  18. S. Xuan and S. Zhang, “Intra-inter camera similarity for unsupervised person re-identification,” in Conf. Comput. Vis. Pattern Recog., 2021, pp. 11 926–11 935.
  19. M. Wang, B. Lai, J. Huang, X. Gong, and X.-S. Hua, “Camera-aware proxies for unsupervised person re-identification,” arXiv preprint, vol. arXiv:2012.10674, 2020.
  20. H. Chen, B. Lagadec, and F. Bremond, “ICE: Inter-instance contrastive encoding for unsupervised person re-identification,” in Int. Conf. Comput. Vis., 2021, pp. 14 960–14 969.
  21. Z. Wang, J. Zhang, L. Zheng, Y. Liu, Y. Sun, Y. Li, and S. Wang, “CycAs: Self-supervised cycle association for learning re-identifiable descriptions,” in Eur. Conf. Comput. Vis., 2020, pp. 72–88.
  22. J. Wu, Y. Yang, H. Liu, S. Liao, Z. Lei, and S. Z. Li, “Unsupervised graph association for person re-identification,” in Int. Conf. Comput. Vis., 2019, pp. 8321–8330.
  23. M. Li, C.-G. Li, and J. Guo, “Cluster-guided asymmetric contrastive learning for unsupervised person re-identification,” IEEE Transactions on Image Processing, 2022.
  24. X. Zhang, D. Li, Z. Wang, J. Wang, E. Ding, J. Q. Shi, Z. Zhang, and J. Wang, “Implicit sample extension for unsupervised person re-identification,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 7369–7378.
  25. Y. Cho, W. J. Kim, S. Hong, and S.-E. Yoon, “Part-based pseudo label refinement for unsupervised person re-identification,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 7308–7318.
  26. A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention is all you need,” in Adv. Neural Inf. Process. Syst., 2017, pp. 5998–6008.
  27. J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of deep bidirectional transformers for language understanding,” arXiv preprint, vol. arXiv:1810.04805, 2018.
  28. D. Q. Nguyen, T. Vu, and A. T. Nguyen, “BERTweet: A pre-trained language model for english tweets,” arXiv preprint, vol. arXiv:2005.10200, 2020.
  29. C. Raffel, N. Shazeer, A. Roberts, K. Lee, S. Narang, M. Matena, Y. Zhou, W. Li, and P. J. Liu, “Exploring the limits of transfer learning with a unified text-to-text transformer,” arXiv preprint, vol. arXiv:1910.10683, 2019.
  30. A. Theophilo, L. A. Pereira, and A. Rocha, “A needle in a haystack? harnessing onomatopoeia and user-specific stylometrics for authorship attribution of micro-messages,” in International Conference on Acoustics, Speech and Signal Processing.   IEEE, 2019, pp. 2692–2696.
  31. D. D. Kirkpatrick, “Who is behind qanon? linguistic detectives find fingerprints,” https://www.nytimes.com/2022/02/19/technology/qanon-messages-authors.html, 2022, [Online; accessed on January 22nd, 2023].
  32. B. Boenninghoff, S. Hessler, D. Kolossa, and R. M. Nickel, “Explainable authorship verification in social media via attention-based similarity learning,” in Int. Conf. Big Data, 2019, pp. 36–45.
  33. Y. Lin, L. Xie, Y. Wu, C. Yan, and Q. Tian, “Unsupervised person re-identification via softened similarity learning,” in Conf. Comput. Vis. Pattern Recog., 2020, pp. 3390–3399.
  34. Y. Lin, Y. Wu, C. Yan, M. Xu, and Y. Yang, “Unsupervised person re-identification via cross-camera similarity exploration,” IEEE Trans. Image Process., vol. 29, pp. 5481–5490, 2020.
  35. H. Ji, L. Wang, S. Zhou, W. Tang, N. Zheng, and G. Hua, “Meta pairwise relationship distillation for unsupervised person re-identification,” in Int. Conf. Comput. Vis., 2021, pp. 3661–3670.
  36. F. Yang, Z. Zhong, Z. Luo, Y. Cai, Y. Lin, S. Li, and N. Sebe, “Joint noise-tolerant learning and meta camera shift adaptation for unsupervised person re-identification,” in Conf. Comput. Vis. Pattern Recog., 2021, pp. 4855–4864.
  37. J. Li and S. Zhang, “Joint visual and temporal consistency for unsupervised domain adaptive person re-identification,” in Eur. Conf. Comput. Vis., 2020, pp. 483–499.
  38. H. Chen, Y. Wang, B. Lagadec, A. Dantcheva, and F. Bremond, “Joint generative and contrastive learning for unsupervised person re-identification,” in Conf. Comput. Vis. Pattern Recog., 2021, pp. 2004–2013.
  39. S. R. S, M. V. Prasad, and R. Balakrishnan, “Spatio-temporal association rule based deep annotation-free clustering (STAR-DAC) for unsupervised person re-identification,” Pattern Recog., vol. 122, p. 108287, 2022.
  40. M. Li, X. Zhu, and S. Gong, “Unsupervised person re-identification by deep learning tracklet association,” in Eur. Conf. Comput. Vis., 2018, pp. 737–753.
  41. G. Wu, X. Zhu, and S. Gong, “Tracklet self-supervised learning for unsupervised person re-identification,” in Conf. Artif. Intell., vol. 34, no. 7, 2020, pp. 12 362–12 369.
  42. M. Li, X. Zhu, and S. Gong, “Unsupervised tracklet person re-identification,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 42, no. 7, pp. 1770–1782, 2019.
  43. Y. Lin, X. Dong, L. Zheng, Y. Yan, and Y. Yang, “A bottom-up clustering approach to unsupervised person re-identification,” in Conf. Artif. Intell., vol. 33, no. 1, 2019, pp. 8738–8745.
  44. Z. Sun, F. Zhao, and F. Wu, “Unsupervised person re-identification via global-level and patch-level discriminative feature learning,” in IEEE Int. Conf. Image Process., 2021, pp. 2363–2367.
  45. Q. Yin, G. Wang, G. Ding, S. Gong, and Z. Tang, “Multi-view label prediction for unsupervised learning person re-identification,” IEEE Signal Process. Lett., vol. 28, pp. 1390–1394, 2021.
  46. D. Wang and S. Zhang, “Unsupervised person re-identification via multi-label classification,” in Conf. Comput. Vis. Pattern Recog., 2020, pp. 10 981–10 990.
  47. K. Zeng, M. Ning, Y. Wang, and Y. Guo, “Hierarchical clustering with hard-batch triplet loss for person re-identification,” in Conf. Comput. Vis. Pattern Recog., 2020, pp. 13 657–13 665.
  48. H. Luo, W. Jiang, Y. Gu, F. Liu, X. Liao, S. Lai, and J. Gu, “A strong baseline and batch normalization neck for deep person re-identification,” IEEE Trans. Multimedia, vol. 22, no. 10, pp. 2597–2609, 2019.
  49. A. Tarvainen and H. Valpola, “Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results,” arXiv preprint, vol. arXiv:1703.01780, 2017.
  50. M. Ester, H.-P. Kriegel, J. Sander, and X. Xu, “A density-based algorithm for discovering clusters in large spatial databases with noise,” in Int. Conf. Knowledge Discovery Data Mining, 1996, pp. 226–231.
  51. J. Hou, H. Gao, and X. Li, “DSets-DBSCAN: A parameter-free clustering algorithm,” IEEE Trans. Image Process., vol. 25, no. 7, pp. 3182–3193, 2016.
  52. L. Van der Maaten and G. Hinton, “Visualizing data using t-SNE,” Journal Mach. Lear. Res., vol. 9, no. 11, pp. 2579–2605, 2008.
  53. H. Fan, L. Zheng, C. Yan, and Y. Yang, “Unsupervised person re-identification: Clustering and fine-tuning,” ACM Trans. Multimedia Comput., Commun., Appl., vol. 14, no. 4, pp. 1–18, 2018.
  54. H. Whittingham and S. K. Ashenden, “Chapter 5 - hit discovery,” in The Era of Artificial Intelligence, Machine Learning, and Data Science in the Pharmaceutical Industry, S. K. Ashenden, Ed.   Academic Press, 2021, pp. 81–102. [Online]. Available: https://www.sciencedirect.com/science/article/pii/B9780128200452000064
  55. M. K. Pakhira, “A linear time-complexity k-means algorithm using cluster shifting,” in 2014 International Conference on Computational Intelligence and Communication Networks, 2014, pp. 1047–1051.
  56. A. Hermans, L. Beyer, and B. Leibe, “In defense of the triplet loss for person re-identification,” arXiv preprint, vol. arXiv:1703.07737, 2017.
  57. A. Theophilo, R. Giot, and A. Rocha, “Authorship attribution of social media messages,” IEEE Trans. Comput. Social Syst., 2021.
  58. K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Conf. Comput. Vis. Pattern Recog., 2016, pp. 770–778.
  59. K. Zhou, Y. Yang, A. Cavallaro, and T. Xiang, “Omni-scale feature learning for person re-identification,” in Int. Conf. Comput. Vis., 2019, pp. 3702–3712.
  60. G. Huang, Z. Liu, L. Van Der Maaten, and K. Q. Weinberger, “Densely connected convolutional networks,” in Conf. Comput. Vis. Pattern Recog., 2017, pp. 4700–4708.
  61. D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint, vol. arXiv:1412.6980, 2014.
  62. A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A. Desmaison, A. Kopf, E. Yang, Z. DeVito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, and S. Chintala, “PyTorch: An imperative style, high-performance deep learning library,” in Adv. Neural Inf. Process. Syst., 2019, pp. 8024–8035.
  63. K. Zhou and T. Xiang, “Torchreid: A library for deep learning person re-identification in Pytorch,” arXiv preprint, vol. arXiv:1910.10093, 2019.
  64. M. Barni, E. Nowroozi, and B. Tondi, “Improving the security of image manipulation detection through one-and-a-half-class multiple classification,” Multimedia Tools and Applications, vol. 79, no. 3, pp. 2383–2408, 2020.
  65. M. Barni, K. Kallas, E. Nowroozi, and B. Tondi, “Cnn detection of gan-generated face images based on cross-band co-occurrences analysis,” in 2020 IEEE international workshop on information forensics and security (WIFS).   IEEE, 2020, pp. 1–6.
  66. T. Pevny, P. Bas, and J. Fridrich, “Steganalysis by subtractive pixel adjacency matrix,” IEEE Transactions on Information Forensics and Security, vol. 5, no. 2, pp. 215–224, 2010.
  67. M. Barni, E. Nowroozi, and B. Tondi, “Detection of adaptive histogram equalization robust against jpeg compression,” in 2018 International Workshop on Biometrics and Forensics (IWBF).   IEEE, 2018, pp. 1–8.
  68. F. Yang, K. Li, Z. Zhong, Z. Luo, X. Sun, H. Cheng, X. Guo, F. Huang, R. Ji, and S. Li, “Asymmetric co-teaching for unsupervised cross-domain person re-identification,” in Conf. Artif. Intell., 2020, pp. 12 597–12 604.
  69. Z. Dai, G. Wang, W. Yuan, S. Zhu, and P. Tan, “Cluster contrast for unsupervised person re-identification,” arXiv preprint arXiv:2103.11568, 2021.
  70. G. Hinton, O. Vinyals, and J. Dean, “Distilling the knowledge in a neural network,” arXiv preprint arXiv:1503.02531, 2015.
  71. S. D. Khan and H. Ullah, “A survey of advances in vision-based vehicle re-identification,” Comput. Vis. Image Understanding, vol. 182, pp. 50–63, 2019.
  72. H. Wang, C. Wang, and L. Xie, “Online visual place recognition via saliency re-identification,” in Int. Conf. Intell. Robots Syst., 2020, pp. 5030–5036.
Citations (8)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.