Papers
Topics
Authors
Recent
Search
2000 character limit reached

Hierarchical NeuroSymbolic Approach for Comprehensive and Explainable Action Quality Assessment

Published 20 Mar 2024 in cs.CV, cs.AI, cs.LG, and cs.SC | (2403.13798v2)

Abstract: Action quality assessment (AQA) applies computer vision to quantitatively assess the performance or execution of a human action. Current AQA approaches are end-to-end neural models, which lack transparency and tend to be biased because they are trained on subjective human judgements as ground-truth. To address these issues, we introduce a neuro-symbolic paradigm for AQA, which uses neural networks to abstract interpretable symbols from video data and makes quality assessments by applying rules to those symbols. We take diving as the case study. We found that domain experts prefer our system and find it more informative than purely neural approaches to AQA in diving. Our system also achieves state-of-the-art action recognition and temporal segmentation, and automatically generates a detailed report that breaks the dive down into its elements and provides objective scoring with visual evidence. As verified by a group of domain experts, this report may be used to assist judges in scoring, help train judges, and provide feedback to divers. Annotated training data and code: https://github.com/laurenok24/NSAQA.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (26)
  1. The kimore dataset: Kinematic assessment of movement and clinical scores for remote monitoring of physical rehabilitation. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 27(7):1436–1448, 2019.
  2. Neural-symbolic learning systems - foundations and applications. In Perspectives in Neural Computing, 2012.
  3. Who’s better, who’s best: Skill determination in video using deep ranking.
  4. Who’s better? who’s best? pairwise deep ranking for skill determination. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 6057–6066, 2018.
  5. The pros and cons: Rank-aware temporal attention for skill determination in long videos. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 7862–7871, 2019.
  6. Action quality assessment using siamese network-based deep metric learning. IEEE Transactions on Circuits and Systems for Video Technology, 31(6):2260–2273, 2020.
  7. Amy Kwan. Usa diving elearning center.
  8. A survey of vision-based human action evaluation methods. Sensors, 19(19):4129, 2019.
  9. End-to-end learning for action quality assessment. In Pacific Rim Conference on Multimedia, pages 125–134. Springer, 2018a.
  10. Scoringnet: Learning key fragment for action quality assessment with ranking loss in skilled sports. In Asian Conference on Computer Vision, pages 149–164. Springer, 2018b.
  11. Manipulation-skill assessment from videos with spatial attention network. In Proceedings of the IEEE/CVF international conference on computer vision workshops, pages 0–0, 2019.
  12. Towards unified surgical skill assessment. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9522–9531, 2021.
  13. Extraction and classification of diving clips from continuous video footage. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pages 38–48, 2017.
  14. Action assessment by joint relation graphs. In Proceedings of the IEEE/CVF international conference on computer vision, pages 6331–6340, 2019.
  15. Action quality assessment across multiple actions. In 2019 IEEE winter conference on applications of computer vision (WACV), pages 1468–1476. IEEE, 2019.
  16. Measuring the quality of exercises. In 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pages 2241–2244. IEEE, 2016.
  17. Learning to score olympic events. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pages 20–28, 2017.
  18. What and how well you performed? a multitask learning approach to action quality assessment. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 304–313, 2019.
  19. Vi-net—view-invariant quality of human movement assessment. Sensors, 20(18):5258, 2020.
  20. Neurosymbolic programming for science. arXiv preprint arXiv:2210.05050, 2022.
  21. Uncertainty-aware score distribution learning for action quality assessment. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
  22. Python tutorial. Centrum voor Wiskunde en Informatica Amsterdam, The Netherlands, 1995.
  23. Learning to score figure skating sport videos. IEEE transactions on circuits and systems for video technology, 30(12):4578–4590, 2019.
  24. Finediving: A fine-grained dataset for procedure-aware action quality assessment, 2022.
  25. Neural-symbolic vqa: Disentangling reasoning from vision and language understanding. Advances in neural information processing systems, 31, 2018.
  26. Group-aware contrastive regression for action quality assessment. In Proceedings of the IEEE/CVF international conference on computer vision, pages 7919–7928, 2021.
Citations (2)

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 1 like about this paper.