Papers
Topics
Authors
Recent
Search
2000 character limit reached

Language-Guided Reinforcement Learning for Hard Attention in Few-Shot Learning

Published 11 Oct 2023 in cs.AI and cs.LG | (2310.07800v3)

Abstract: Attention mechanisms have demonstrated significant potential in enhancing learning models by identifying key portions of input data, particularly in scenarios with limited training samples. Inspired by human perception, we propose that focusing on essential data segments, rather than the entire dataset, can improve the accuracy and reliability of the learning models. However, identifying these critical data segments, or "hard attention finding," is challenging, especially in few-shot learning, due to the scarcity of training data and the complexity of model parameters. To address this, we introduce LaHA, a novel framework that leverages language-guided deep reinforcement learning to identify and utilize informative data regions, thereby improving both interpretability and performance. Extensive experiments on benchmark datasets validate the effectiveness of LaHA.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (32)
  1. Searching for objects driven by context. Advances in Neural Information Processing Systems, 25.
  2. Multiple object recognition with visual attention. arXiv preprint arXiv:1412.7755.
  3. Learning wake-sleep recurrent attention models. Advances in Neural Information Processing Systems, 28.
  4. A baseline for few-shot image classification. arXiv preprint arXiv:1909.02729.
  5. Saccader: Improving accuracy of hard attention models for vision. Advances in Neural Information Processing Systems, 32.
  6. Reinforced attention for few-shot learning and beyond. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 913–923.
  7. Supervised contrastive learning. Advances in neural information processing systems, 33: 18661–18673.
  8. Learning multiple layers of features from tiny images.
  9. Protogan: Towards few shot learning for action recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 0–0.
  10. Meta-sgd: Learning to learn quickly for few-shot learning. arXiv preprint arXiv:1707.09835.
  11. End-to-end multi-task learning with attention. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 1871–1880.
  12. Few-shot learning for road object detection. In AAAI Workshop on Meta-Learning and MetaDL Challenge, 115–126. PMLR.
  13. Recurrent models of visual attention. Advances in neural information processing systems, 27.
  14. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602.
  15. Joint selection using deep reinforcement learning for skeleton-based activity recognition. In 2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC), 1056–1061. IEEE.
  16. Spatial Hard Attention Modeling via Deep Reinforcement Learning for Skeleton-Based Human Activity Recognition. IEEE Transactions on Systems, Man, and Cybernetics: Systems.
  17. Spatio-temporal hard attention learning for skeleton-based activity recognition. Pattern Recognition, 139: 109428.
  18. Deep reinforcement learning in human activity recognition: A survey.
  19. A probabilistic hard attention model for sequentially observed scenes. arXiv preprint arXiv:2111.07534.
  20. Ranzato, M. 2014. On learning where to look. arXiv preprint arXiv:1405.5488.
  21. Optimization as a model for few-shot learning. In International conference on learning representations.
  22. Incremental few-shot learning with attention attractor networks. Advances in neural information processing systems, 32.
  23. Glimpse-attend-and-explore: Self-attention for active visual exploration. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 16137–16146.
  24. Overcoming catastrophic forgetting in incremental few-shot learning by finding flat minima. Advances in neural information processing systems, 34: 6747–6761.
  25. Prototypical networks for few-shot learning. Advances in neural information processing systems, 30.
  26. Few-shot learning for low-data drug discovery. Journal of Chemical Information and Modeling, 63(1): 27–42.
  27. Video captioning via hierarchical reinforcement learning. In Proceedings of the IEEE conference on computer vision and pattern recognition, 4213–4222.
  28. Few-shot hash learning for image retrieval. In Proceedings of the IEEE International Conference on Computer Vision Workshops, 1228–1237.
  29. Caltech-UCSD birds 200.
  30. Williams, R. J. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning, 8(3-4): 229–256.
  31. A Dual Attention Network with Semantic Embedding for Few-Shot Learning. In AAAI, volume 33, 9079–9086.
  32. Action-decision networks for visual tracking with deep reinforcement learning. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2711–2720.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.