Papers
Topics
Authors
Recent
Search
2000 character limit reached

Maximizing Information in Domain-Invariant Representation Improves Transfer Learning

Published 1 Jun 2023 in cs.CV and cs.LG | (2306.00262v4)

Abstract: The most effective domain adaptation (DA) technique involves the decomposition of data representation into a domain-independent representation (DIRep) and a domain-dependent representation (DDRep). A classifier is trained by using the DIRep on the labeled source images. Since the DIRep is domain invariant, the classifier can be "transferred" to make predictions for the target domain with no (or few) labels. However, information useful for classification in the target domain can "hide" in the DDRep. Current DA algorithms, such as Domain-Separation Networks (DSN), do not adequately address this issue. DSN's weak constraint to enforce the orthogonality of DIRep and DDRep allows this hiding effect and can result in poor performance. To address this shortcoming, we develop a new algorithm wherein a stronger constraint is imposed to minimize the information content in DDRep to create a DIRep that retains relevant information about the target labels and, in turn, results in a better invariant representation. By using synthetic datasets, we show explicitly that depending on the initialization, DSN, with its weaker constraint, can lead to sub-optimal solutions with poorer DA performance. In contrast, our algorithm is robust against such perturbations. We demonstrate the equal-or-better performance of our approach against DSN and other recent DA methods by using several standard benchmark image datasets. We further highlight the compatibility of our algorithm with pre-trained models for classifying real-world images and showcase its adaptability and versatility through its application in network intrusion detection.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. Contour detection and hierarchical image segmentation. IEEE transactions on pattern analysis and machine intelligence, 33(5):898–916, 2010.
  2. Domain separation networks. Advances in neural information processing systems, 29, 2016.
  3. Improved techniques for adversarial discriminative domain adaptation. IEEE Transactions on Image Processing, 29:2622–2637, 2019.
  4. Adversarial-learned loss for domain adaptation. In Proceedings of the AAAI conference on artificial intelligence, volume 34, pp.  3521–3528, 2020.
  5. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pp.  248–255. Ieee, 2009.
  6. Informative feature disentanglement for unsupervised domain adaptation. IEEE Transactions on Multimedia, 24:2407–2421, 2021.
  7. Domain-adversarial training of neural networks. The journal of machine learning research, 17(1):2096–2030, 2016.
  8. Deep reconstruction-classification networks for unsupervised domain adaptation. In European conference on computer vision, pp.  597–613. Springer, 2016.
  9. Generative adversarial networks. Communications of the ACM, 63(11):139–144, 2020.
  10. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp.  770–778, 2016.
  11. Variational interaction information maximization for cross-domain disentanglement. Advances in Neural Information Processing Systems, 33:22479–22491, 2020.
  12. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International conference on machine learning, pp. 448–456. PMLR, 2015.
  13. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, 1998.
  14. Transferable adversarial training: A general approach to adapting deep classifiers. In International Conference on Machine Learning, pp. 4013–4022. PMLR, 2019.
  15. Deep unsupervised domain adaptation: a review of recent advances and perspectives. APSIPA Transactions on Signal and Information Processing, 11(1), 2022.
  16. Conditional adversarial domain adaptation. Advances in neural information processing systems, 31, 2018.
  17. Reading digits in natural images with unsupervised feature learning. 2011.
  18. Domain agnostic learning with disentangled representations. In International Conference on Machine Learning, pp. 5102–5112. PMLR, 2019.
  19. Pedro O Pinheiro. Unsupervised domain adaptation with similarity learning. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp.  8004–8013, 2018.
  20. Adapting visual category models to new domains. In Computer Vision–ECCV 2010: 11th European Conference on Computer Vision, Heraklion, Crete, Greece, September 5-11, 2010, Proceedings, Part IV 11, pp.  213–226. Springer, 2010.
  21. Generate to adapt: Aligning domains using generative adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp.  8503–8512, 2018.
  22. Preparing network intrusion detection deep learning models with minimal data using adversarial domain adaptation. In Proceedings of the 15th ACM Asia Conference on Computer and Communications Security, pp.  127–140, 2020.
  23. Adversarial discriminative domain adaptation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp.  7167–7176, 2017.
  24. Deep visual domain adaptation: A survey. Neurocomputing, 312:135–153, 2018.
  25. Larger norm more transferable: An adaptive feature norm approach for unsupervised domain adaptation. In Proceedings of the IEEE/CVF international conference on computer vision, pp.  1426–1435, 2019.
  26. Transfer adaptation learning: A decade survey. IEEE Transactions on Neural Networks and Learning Systems, 2022.
  27. Youshan Zhang. A survey of unsupervised domain adaptation for visual recognition. arXiv preprint arXiv:2112.06745, 2021.
  28. A comprehensive survey on transfer learning. Proceedings of the IEEE, 109(1), 2020.
  29. Unsupervised domain adaptation for semantic segmentation via class-balanced self-training. In Proceedings of the European conference on computer vision (ECCV), pp.  289–305, 2018.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 0 likes about this paper.