Papers
Topics
Authors
Recent
Search
2000 character limit reached

Multi-Scale and Multi-Modal Contrastive Learning Network for Biomedical Time Series

Published 6 Dec 2023 in cs.LG and cs.AI | (2312.03796v1)

Abstract: Multi-modal biomedical time series (MBTS) data offers a holistic view of the physiological state, holding significant importance in various bio-medical applications. Owing to inherent noise and distribution gaps across different modalities, MBTS can be complex to model. Various deep learning models have been developed to learn representations of MBTS but still fall short in robustness due to the ignorance of modal-to-modal variations. This paper presents a multi-scale and multi-modal biomedical time series representation learning (MBSL) network with contrastive learning to migrate these variations. Firstly, MBTS is grouped based on inter-modal distances, then each group with minimum intra-modal variations can be effectively modeled by individual encoders. Besides, to enhance the multi-scale feature extraction (encoder), various patch lengths and mask ratios are designed to generate tokens with semantic information at different scales and diverse contextual perspectives respectively. Finally, cross-modal contrastive learning is proposed to maximize consistency among inter-modal groups, maintaining useful information and eliminating noises. Experiments against four bio-medical applications show that MBSL outperforms state-of-the-art models by 33.9% mean average errors (MAE) in respiration rate, by 13.8% MAE in exercise heart rate, by 1.41% accuracy in human activity recognition, and by 1.14% F1-score in obstructive sleep apnea-hypopnea syndrome.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (21)
  1. “A comparison of signal combinations for deep learning-based simultaneous sleep staging and respiratory event detection,” IEEE Transactions on Biomedical Engineering, vol. 70, no. 5, pp. 1704–1714, 2023.
  2. “An empirical evaluation of generic convolutional and recurrent networks for sequence modeling,” arXiv:1803.01271 [cs.LG], 2018, http://arxiv.org/abs/1803.01271.
  3. “Ts2vec: Towards universal representation of time series,” in Association for the Advancement of Artificial Intelligence (AAAI), 2022.
  4. “Multi-scale convolutional neural networks for time series classification,” arXiv:1603.06995 [cs.CV], 2016, https://arxiv.org/abs/1603.06995.
  5. “InceptionTime: Finding AlexNet for time series classification,” Data Mining and Knowledge Discovery, vol. 34, no. 6, pp. 1936–1962, sep 2020.
  6. “Rrwavenet: A compact end-to-end multiscale residual cnn for robust ppg respiratory rate estimation,” IEEE Internet of Things Journal, vol. 10, no. 18, pp. 15943–15952, 2023.
  7. “Concad: Contrastive learning-based cross attention for sleep apnea detection,” in European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), 2021.
  8. “Self-supervised contrastive pre-training for time series via time-frequency consistency,” in 36th Conference on Neural Information Processing Systems (NeurIPS), 2022.
  9. “First de-trend then attend: Rethinking attention for time-series forecasting,” in 36th Conference on Neural Information Processing Systems (NeurIPS), 2022.
  10. “The capacity and robustness trade-off: Revisiting the channel independent strategy for multivariate time series forecasting,” arXiv:2304.05206 [cs.LG], 2023, https://arxiv.org/abs/2304.05206.
  11. “Unsupervised time-series representation learning with iterative bilinear temporal-spectral fusion,” in Proceedings of the 39 th International Conference on Machine Learning (ICML), 2022.
  12. “Timemae: Self-supervised representations of time series with decoupled masked autoencoders,” arXiv:2303.00320 [cs.LG], 2023, https://arxiv.org/pdf/2303.00320.pdf.
  13. “Respnet: A deep learning model for extraction of respiration from photoplethysmogram,” in 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2019, pp. 5556–5559.
  14. “A transformer-based framework for multivariate time series representation learning,” in Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2021.
  15. “Toward a robust estimation of respiratory rate from pulse oximeters,” IEEE Transactions on Biomedical Engineering, vol. 64, no. 8, pp. 1914–1923, 2017.
  16. “Troika: A general framework for heart rate monitoring using wrist-type photoplethysmographic signals during intensive physical exercise,” IEEE Transactions on Biomedical Engineering, vol. 62, no. 2, pp. 522–531, 2015.
  17. “A public domain dataset for human activity recognition using smartphones,” in The European Symposium on Artificial Neural Networks (ESANN), 2013.
  18. “Probabilistic modelling of sleep stage and apneaic events in the university college of dublin database (ucddb),” in 2019 IEEE 10th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON). IEEE, 2019, pp. 0133–0139.
  19. “Time-series representation learning via temporal and contextual contrasting,” in International Joint Conferences on Artificial Intelligence (IJCAI), 2021.
  20. “Respiratory event detection during sleep using electrocardiogram and respiratory related signals: Using polysomnogram and patch-type wearable device data,” IEEE Journal of Biomedical and Health Informatics, vol. 26, no. 2, pp. 550–560, 2022.
  21. “A co-training approach for noisy time series learning,” in Proceedings of the 32nd ACM International Conference on Information and Knowledge Management (CIKM), 2023.

Summary

  • The paper introduces a novel contrastive learning framework for biomedical time series that groups data by modality for targeted processing.
  • It employs adaptive patching and multi-scale transformation to capture crucial features while maintaining computational efficiency.
  • Experimental validation shows the approach outperforms state-of-the-art models in predicting physiological parameters like heart and respiration rates.

Introduction to Biomedical Time Series Learning

Biomedical time series data, which tracks physiological changes over time, is critical for understanding health states and diagnosing diseases. Harnessing this data, deep learning models can uncover patterns that indicate a patient's condition. However, these models often face robustness issues due to variations between different types of data, known as modalities.

Addressing Multi-Modal Challenges

To tackle the challenges associated with multi-modal biomedical time series (MBTS), a novel approach uses contrastive learning for effective representation. In traditional methods, variations between modalities, such as differences in the scale of measurements, can affect the model's accuracy. This paper proposes a strategy that groups MBTS based on similarities in modality distributions, allowing for targeted processing with individual encoders customized for each group.

Enhancing Feature Extraction

Another layer of complexity is added by the need to extract meaningful patterns from MBTS at various scales. Previous attempts require extensive computation and memory, or they risk losing valuable information. The proposed model introduces an efficient, multi-scale data transformation with adaptive patching and masking strategies. These techniques ensure that important features are captured at all relevant scales while maintaining computational efficiency.

Contrastive Learning for Modal Consistency

Contrastive learning, a technique for learning representations by comparing different data instances, is further refined in this model to focus on cross-modal learning. Instead of relying on augmentations that might distort MBTS, the approach guarantees constructing meaningful pairs from different but related modalities. This strategy enhances the model's ability to learn representations that are invariant to modality-specific differences while focusing on shared physiological information.

Experimental Validation

The performance of this new model outshines current state-of-the-art models in a range of biomedical tasks, including predicting respiration rates and heart rates during exercise. Ablation studies also validate the distinct contributions of inter-modal grouping, multi-scale transformation, and cross-modal contrastive learning. This affirms the approach as both effective and robust.

Conclusion

By successfully addressing modal variation and capturing complex temporal dynamics within MBTS, the proposed model advances the field of bio-medical analysis. These improvements could enable more accurate predictions and diagnoses from physiological data, demonstrating the potential of deep learning in healthcare.

The study provides a route toward more robust and capable biomedical time series learning models. With advancements like these, the future of healthcare data analysis is looking increasingly precise, personalized, and powerful.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (4)

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 0 likes about this paper.