Papers
Topics
Authors
Recent
Search
2000 character limit reached

A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets

Published 22 Feb 2024 in cs.CV and cs.AI | (2402.14241v1)

Abstract: In environments where RGB images are inadequate, pressure maps is a viable alternative, garnering scholarly attention. This study introduces a novel self-supervised pressure map keypoint detection (SPMKD) method, addressing the current gap in specialized designs for human keypoint extraction from pressure maps. Central to our contribution is the Encoder-Fuser-Decoder (EFD) model, which is a robust framework that integrates a lightweight encoder for precise human keypoint detection, a fuser for efficient gradient propagation, and a decoder that transforms human keypoints into reconstructed pressure maps. This structure is further enhanced by the Classification-to-Regression Weight Transfer (CRWT) method, which fine-tunes accuracy through initial classification task training. This innovation not only enhances human keypoint generalization without manual annotations but also showcases remarkable efficiency and generalization, evidenced by a reduction to only $5.96\%$ in FLOPs and $1.11\%$ in parameter count compared to the baseline methods.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (20)
  1. “Using Pressure Map Sequences for Recognition of On Bed Rehabilitation Exercises,” IEEE Journal of Biomedical and Health Informatics, vol. 18, no. 2, pp. 411–418, Mar. 2014.
  2. “Hand motion recognition based on pressure distribution maps and LS-SVM,” in 2014 International Conference on Mechatronics and Control (ICMC), July 2014, pp. 1027–1031.
  3. “Detection of sleep-disordered breating with pressure bed sensor,” in 2013 35th annual international conference of the IEEE engineering in medicine and biology society (EMBC). IEEE, 2013, pp. 1342–1345, IEEE.
  4. “Towards a Visual Privacy Advisor: Understanding and Predicting Privacy Risks in Images,” in 2017 IEEE International Conference on Computer Vision (ICCV). Oct. 2017, pp. 3706–3715, IEEE.
  5. “Pixel Privacy 2019: Protecting Sensitive Scene Information in Images,” 2019.
  6. “Protection of visual privacy in videos acquired with RGB cameras for active and assisted living applications,” Multimed Tools Appl, vol. 80, no. 15, pp. 23649–23664, June 2021.
  7. “Changes in site of obstruction in obstructive sleep apnea patients according to sleep position: A DISE study,” The Laryngoscope, vol. 125, no. 1, pp. 248–254, 2015.
  8. “National pressure ulcer advisory panel’s updated pressure ulcer staging system,” Advances in skin & wound care, vol. 20, no. 5, pp. 269–274, 2007.
  9. “Simultaneously-Collected Multimodal Lying Pose Dataset: Towards In-Bed Human Pose Monitoring under Adverse Vision Conditions,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 45, no. 1, pp. 1106–1118, 2022.
  10. “Multi-context attention for human pose estimation,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017, pp. 1831–1840, IEEE.
  11. “Learning feature pyramids for human pose estimation,” in Proceedings of the IEEE International Conference on Computer Vision. 2017, pp. 1281–1290, IEEE.
  12. “Deep high-resolution representation learning for human pose estimation,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019, pp. 5693–5703, IEEE.
  13. “Simple baselines for human pose estimation and tracking,” in Proceedings of the European Conference on Computer Vision (ECCV), 2018, pp. 466–481.
  14. “3d human pose estimation on a configurable bed from a pressure image,” in 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2018, pp. 54–61, IEEE.
  15. “Bodies at rest: 3d human pose and shape estimation from a pressure image using synthetic data,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020, pp. 6215–6224, IEEE.
  16. “Searching for mobilenetv3,” in Proceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 1314–1324.
  17. “Multi-scale context aggregation by dilated convolutions,” arXiv preprint arXiv:1511.07122, 2015.
  18. “Semi-supervised classification with graph convolutional networks,” arXiv preprint arXiv:1609.02907, 2016.
  19. “Inductive representation learning on large graphs,” Advances in neural information processing systems, vol. 30, 2017.
  20. “Graph attention networks,” stat, vol. 1050, no. 20, pp. 10–48550, 2017.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.