Axis Tour: Word Tour Determines the Order of Axes in ICA-transformed Embeddings
Abstract: Word embedding is one of the most important components in natural language processing, but interpreting high-dimensional embeddings remains a challenging problem. To address this problem, Independent Component Analysis (ICA) is identified as an effective solution. ICA-transformed word embeddings reveal interpretable semantic axes; however, the order of these axes are arbitrary. In this study, we focus on this property and propose a novel method, Axis Tour, which optimizes the order of the axes. Inspired by Word Tour, a one-dimensional word embedding method, we aim to improve the clarity of the word embedding space by maximizing the semantic continuity of the axes. Furthermore, we show through experiments on downstream tasks that Axis Tour yields better or comparable low-dimensional embeddings compared to both PCA and ICA.
- Abdulrahman Almuhareb and Massimo Poesio. 2005. Concept learning and categorization from the web. Proceedings of the Annual Meeting of the Cognitive Science Society, 27.
- Proceedings of the ESSLLI Workshop on Distributional Lexical Semantics: Bridging the Gap between Semantic Theory and Computational Simulations. European Summer School in Logic, Language and Information (ESSLLI), Hamburg, Germany.
- Marco Baroni and Alessandro Lenci. 2011. How we blessed distributional semantic evaluation. In Proceedings of the GEMS 2011 Workshop on GEometrical Models of Natural Language Semantics, Edinburgh, UK, July 31, 2011, pages 1–10. Association for Computational Linguistics.
- William F. Battig and William E. Montague. 1969. Category norms of verbal items in 56 categories: A replication and extension of the connecticut category norms. Journal of Experimental Psychology, 80(3, Pt.2):1–46.
- Michael W Browne. 2001. An overview of analytic rotation in exploratory factor analysis. Multivariate behavioral research, 36(1):111–150.
- Multimodal distributional semantics. Journal of Artificial Intelligence Research, 49:1–47.
- Charles B Crawford and George A Ferguson. 1970. A general rotation criterion and its use in orthogonal rotation. Psychometrika, 35(3):321–332.
- Sparse autoencoders find highly interpretable features in language models. CoRR, abs/2309.08600.
- Placing search in context: The concept revisited. ACM Transactions on information systems, 20(1):116–131.
- Keld Helsgaun. 2000. An effective implementation of the lin-kernighan traveling salesman heuristic. Eur. J. Oper. Res., 126(1):106–130.
- Keld Helsgaun. 2018. LKH (Keld Helsgaun).
- Simlex-999: Evaluating semantic models with (genuine) similarity estimation. Computational Linguistics, 41(4):665–695.
- Aapo Hyvärinen. 1999. Fast and robust fixed-point algorithms for independent component analysis. IEEE Trans. Neural Networks, 10(3):626–634.
- Topographic independent component analysis. Neural Comput., 13(7):1527–1558.
- Independent Component Analysis. Wiley.
- Aapo Hyvärinen and Erkki Oja. 2000. Independent component analysis: algorithms and applications. Neural Networks, 13(4-5):411–430.
- How to evaluate word embeddings? on importance of data efficiency and simple supervised tasks. CoRR, abs/1702.02170.
- Teuvo Kohonen. 2001. Self-Organizing Maps, Third Edition. Springer Series in Information Sciences. Springer.
- Shen Lin and Brian W. Kernighan. 1973. An effective heuristic algorithm for the traveling-salesman problem. Oper. Res., 21(2):498–516.
- Better word representations with recursive neural networks for morphology. In Proceedings of the Seventeenth Conference on Computational Natural Language Learning, pages 104–113.
- Hidden in the Layers: Interpretation of Neural Networks for Natural Language Processing, volume 20 of Studies in Computational and Theoretical Linguistics. Institute of Formal and Applied Linguistics, Prague, Czechia.
- Efficient estimation of word representations in vector space. In 1st International Conference on Learning Representations, ICLR 2013, Scottsdale, Arizona, USA, May 2-4, 2013, Workshop Track Proceedings.
- Linguistic regularities in continuous space word representations. In Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, June 9-14, 2013, Westin Peachtree Plaza Hotel, Atlanta, Georgia, USA, pages 746–751. The Association for Computational Linguistics.
- Tomás Musil. 2019. Examining structure of word embeddings with PCA. In Text, Speech, and Dialogue - 22nd International Conference, TSD 2019, Ljubljana, Slovenia, September 11-13, 2019, Proceedings, volume 11697 of Lecture Notes in Computer Science, pages 211–223. Springer.
- Tomás Musil and David Marecek. 2022. Independent components of word embeddings represent semantic features. CoRR, abs/2212.09580.
- Rotated word vector representations and their interpretability. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017, Copenhagen, Denmark, September 9-11, 2017, pages 401–411. Association for Computational Linguistics.
- Scikit-learn: Machine learning in python. J. Mach. Learn. Res., 12:2825–2830.
- Glove: Global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014, October 25-29, 2014, Doha, Qatar, A meeting of SIGDAT, a Special Interest Group of the ACL, pages 1532–1543. ACL.
- A word at a time: Computing word relatedness using temporal semantic analysis. In Proceedings of the 20th International Conference on World Wide Web, page 337–346.
- Herbert Rubenstein and John B. Goodenough. 1965. Contextual correlates of synonymy. Commun. ACM, 8(10):627–633.
- Ryoma Sato. 2022. Word tour: One-dimensional word embeddings via the traveling salesman problem. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2022, Seattle, WA, United States, July 10-15, 2022, pages 2166–2172. Association for Computational Linguistics.
- Discovering universal geometry in embeddings with ICA. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, Singapore, December 6-10, 2023, pages 4647–4675. Association for Computational Linguistics.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.