Papers
Topics
Authors
Recent
Search
2000 character limit reached

Count What You Want: Exemplar Identification and Few-shot Counting of Human Actions in the Wild

Published 28 Dec 2023 in cs.CV and cs.AI | (2312.17330v1)

Abstract: This paper addresses the task of counting human actions of interest using sensor data from wearable devices. We propose a novel exemplar-based framework, allowing users to provide exemplars of the actions they want to count by vocalizing predefined sounds ''one'', ''two'', and ''three''. Our method first localizes temporal positions of these utterances from the audio sequence. These positions serve as the basis for identifying exemplars representing the action class of interest. A similarity map is then computed between the exemplars and the entire sensor data sequence, which is further fed into a density estimation module to generate a sequence of estimated density values. Summing these density values provides the final count. To develop and evaluate our approach, we introduce a diverse and realistic dataset consisting of real-world data from 37 subjects and 50 action categories, encompassing both sensor and audio data. The experiments on this dataset demonstrate the viability of the proposed method in counting instances of actions from new classes and subjects that were not part of the training data. On average, the discrepancy between the predicted count and the ground truth value is 7.47, significantly lower than the errors of the frequency-based and transformer-based methods. Our project, code and dataset can be found at https://github.com/cvlab-stonybrook/ExRAC.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (56)
  1. Segmentation of periodically moving objects. In Proceedings of the International Conference on Pattern Recognition.
  2. Monitoring worker fatigue using wearable devices: A case study to detect changes in gait parameters. Journal of quality technology, 53(1): 47–71.
  3. Laplacian Eigenmaps for Dimensionality Reduction and Data Representation. Neural Comput., 1373–1396.
  4. Passive capacitive based approach for full body gym workout recognition and counting. In Proceedings of the International Conference on Pervasive Computing and Communications.
  5. Tracking free-weight exercises. In Proceedings of the ACM international joint conference on Pervasive and Ubiquitous Computing.
  6. On Motion Periodicity of Dynamic Textures. In Proceedings of the British Machine Vision Conference.
  7. Robust Real-Time Periodic Motion Detection, Analysis, and Applications. IEEE Trans. Pattern Anal. Mach. Intell., 22(8): 781–796.
  8. Soft-DTW: a Differentiable Loss Function for Time-Series. In Proceedings of the International Conference on Machine Learning.
  9. Femo: A platform for free-weight exercise monitoring with rfids. In Proceedings of the ACM conference on embedded networked sensor systems.
  10. Counting Out Time: Class Agnostic Video Repetition Counting in the Wild. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
  11. AIFit: Automatic 3D Human-Interpretable Feedback Models for Fitness Training. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
  12. A smartwatch step counter for slow and intermittent ambulation. Ieee Access, 5: 13028–13037.
  13. Textile based chemical and physical sensors for healthcare monitoring. Journal of the electrochemical society, 167(3): 037546.
  14. Viewpoint-Invariant Exercise Repetition Counting. CoRR.
  15. TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
  16. Interactive Class-Agnostic Object Counting. In Proceedings of the International Conference on Computer Vision (ICCV).
  17. Exersense: real-time physical exercise segmentation, classification, and counting algorithm using an imu sensor. Activity and Behavior Computing, 239–255.
  18. Broadcasted Residual Learning for Efficient Keyword Spotting. In Proceedings of the Annual Conference of the International Speech Communication Association.
  19. Industrial wearable system: the human-centric empowering technology in Industry 4.0. Journal of Intelligent Manufacturing, 30: 2853–2869.
  20. The mobile fitness coach: Towards individualized skill assessment using personalized mobile devices. Pervasive and Mobile Computing, 9(2): 203–215.
  21. Development of a step counter based on artificial neural networks. Journal of Location Based Services, 10(3): 161–177.
  22. Heart rate variability monitoring during sleep based on capacitively coupled textile electrodes on a bed. Sensors, 15(5): 11295–11311.
  23. Live Repetition Counting. In Proceedings of the IEEE International Conference on Computer Vision.
  24. CounTR: Transformer-based Generalised Visual Counting. In Proceedings of the British Machine Vision Conference.
  25. Pyraformer: Low-complexity pyramidal attention for long-range time series modeling and forecasting. In Proceedings of the International conference on learning representations.
  26. Class-Agnostic Counting. In Proceedings of the Asian Conference on Computer Vision.
  27. RecoFit: using a wearable sensor to find, recognize, and count repetitive exercises. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems.
  28. Determining the Single Best Axis for Exercise Repetition Recognition and Counting on SmartWatches. In Proceedings of the International Conference on Wearable and Implantable Body Sensor Networks.
  29. Sleep monitoring based on a tri-axial accelerometer and a pressure sensor. Sensors, 16(5): 750.
  30. Few-Shot Object Counting and Detection. In Proceedings of the European Conference on Computer Vision.
  31. Few-Shot and Weakly Supervised Repetition Counting With Body-Worn Accelerometers. In Frontiers in Computer Science.
  32. Crowd counting with decomposed uncertainty. In Proceedings of the AAAI conference on artificial intelligence.
  33. Wearable inertial sensor systems for lower limb exercise detection and evaluation: a systematic review. Sports Medicine, 48: 1221–1246.
  34. A novel approach to monitor rehabilitation outcomes in stroke survivors using wearable technology. Proceedings of the IEEE, 98(3): 450–461.
  35. Personalized step counting using wearable sensors: A domain adapted LSTM network approach. arXiv preprint arXiv:2012.08975.
  36. Visual quasi-periodicity. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
  37. Recognition and Repetition Counting for Local Muscular Endurance Exercises in Exercise-Based Rehabilitation: A Comparative Study Using Artificial Intelligence Models. Sensors, 20.
  38. Microfluidic wearable electrochemical sweat sensors for health monitoring. Biomicrofluidics, 16(5): 051501.
  39. Exemplar Free Class Agnostic Counting. In Proceedings of the Asian Conference on Computer Vision (ACCV).
  40. Vicinal Counting Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops.
  41. Learning To Count Everything. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
  42. Real-World Repetition Estimation by Div, Grad and Curl. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
  43. Represent, Compare, and Learn: A Similarity-Aware Framework for Class-Agnostic Counting. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
  44. Recognition and repetition counting for complex physical exercises with deep learning. Sensors, 19(3): 714.
  45. Recognition and Repetition Counting for Complex Physical Exercises with Deep Learning. Sensors, 714.
  46. Wearable activity tracking in car manufacturing. IEEE Pervasive Computing, 7(2): 42–50.
  47. MM-Fit: Multimodal Deep Learning for Automatic Exercise Logging across Sensing Devices. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 4.
  48. Periodic Motion Detection and Estimation via Space-Time Sampling. In Proceedings of the Applications of Computer Vision Workshop.
  49. Warden, P. 2018. Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition. CoRR.
  50. Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. In Advances in Neural Information Processing Systems.
  51. Class-agnostic Few-shot Object Counting. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision.
  52. Few-shot Object Counting with Similarity-Aware Feature Enhancement. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision.
  53. Accelerometer-Based Automated Counting of Ten Exercises without Exercise-Specific Training or Tuning. Journal of Healthcare Engineering.
  54. Context-Aware and Scale-Insensitive Temporal Repetition Counting. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
  55. Repetitive Activity Counting by Sight and Sound. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
  56. Informer: Beyond efficient transformer for long sequence time-series forecasting. In Proceedings of AAAI Conference on Artificial Intelligence.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.