ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Images
Abstract: Designing egocentric 3D hand pose estimation systems that can perform reliably in complex, real-world scenarios is crucial for downstream applications. Previous approaches using RGB or NIR imagery struggle in challenging conditions: RGB methods are susceptible to lighting variations and obstructions like handwear, while NIR techniques can be disrupted by sunlight or interference from other NIR-equipped devices. To address these limitations, we present ThermoHands, the first benchmark focused on thermal image-based egocentric 3D hand pose estimation, demonstrating the potential of thermal imaging to achieve robust performance under these conditions. The benchmark includes a multi-view and multi-spectral dataset collected from 28 subjects performing hand-object and hand-virtual interactions under diverse scenarios, accurately annotated with 3D hand poses through an automated process. We introduce a new baseline method, TherFormer, utilizing dual transformer modules for effective egocentric 3D hand pose estimation in thermal imagery. Our experimental results highlight TherFormer's leading performance and affirm thermal imaging's effectiveness in enabling robust 3D hand pose estimation in adverse conditions.
- Apple: Apple Vision Pro. https://www.apple.com/apple-vision-pro/ (2024), accessed: 2024-02-23
- Apple: Use gestures with Apple Vision Pro. https://support.apple.com/en-us/117741 (2024), accessed: 2024-02-23
- Intel RealSense: Depth Camera D455. https://www.intelrealsense.com/depth-camera-d455/ (2023), accessed: 2024-02-27
- Intel RealSense: LiDAR Camera L515. https://www.intelrealsense.com/lidar-camera-l515/ (2023), accessed: 2024-02-27
- Lloyd, J.M.: Thermal imaging systems. Springer Science & Business Media (2013)
- Meta: Meta Quest. https://www.meta.com/gb/quest/ (2024), accessed: 2024-02-23
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.