Uncertainty Quantification Using Ensemble Learning and Monte Carlo Sampling for Performance Prediction and Monitoring in Cell Culture Processes
Abstract: Biopharmaceutical products, particularly monoclonal antibodies (mAbs), have gained prominence in the pharmaceutical market due to their high specificity and efficacy. As these products are projected to constitute a substantial portion of global pharmaceutical sales, the application of machine learning models in mAb development and manufacturing is gaining momentum. This paper addresses the critical need for uncertainty quantification in machine learning predictions, particularly in scenarios with limited training data. Leveraging ensemble learning and Monte Carlo simulations, our proposed method generates additional input samples to enhance the robustness of the model in small training datasets. We evaluate the efficacy of our approach through two case studies: predicting antibody concentrations in advance and real-time monitoring of glucose concentrations during bioreactor runs using Raman spectra data. Our findings demonstrate the effectiveness of the proposed method in estimating the uncertainty levels associated with process performance predictions and facilitating real-time decision-making in biopharmaceutical manufacturing. This contribution not only introduces a novel approach for uncertainty quantification but also provides insights into overcoming challenges posed by small training datasets in bioprocess development. The evaluation demonstrates the effectiveness of our method in addressing key challenges related to uncertainty estimation within upstream cell cultivation, illustrating its potential impact on enhancing process control and product quality in the dynamic field of biopharmaceuticals.
- Lu, R.-M. et al. Development of therapeutic antibodies for the treatment of diseases. \JournalTitleJournal of Biomedical Science 27, 1–30 (2020).
- Hong, M. S. et al. Smart process analytics for the end-to-end batch manufacturing of monoclonal antibodies. \JournalTitleComputers & Chemical Engineering 179, 108445 (2023).
- Pharma, E. World preview 2021, outlook to 2026. Tech. Rep., Evaluate Ltd, London, UK (2021).
- Assisting continuous biomanufacturing through advanced control in downstream purification. \JournalTitleComputers and Chemical Engineering 125, 232–248, DOI: 10.1016/j.compchemeng.2019.03.013 (2019).
- Wurm, F. M. Production of recombinant protein therapeutics in cultivated mammalian cells. \JournalTitleNature Biotechnology 22, 1393–1398 (2004).
- Papathanasiou, M. M. et al. Advanced model-based control strategies for the intensification of upstream and downstream processing in mAb production. \JournalTitleBiotechnology Progress 33, 966–988, DOI: 10.1002/btpr.2483 (2017).
- Satheka, A. C. Upscaling of clinical grade stem cell production: Upstream processing (usp) and downstream processing (dsp) operations of cell expansion, harvesting, detachment, separation, washing and concentration steps, and the regulatory requirements. In Stem Cell Production, 159–184 (Springer, 2022).
- Applications of machine learning in antibody discovery, process development, manufacturing and formulation: Current trends, challenges, and opportunities. \JournalTitleComputers & Chemical Engineering 108585 (2024).
- Kelley, B. Industrialization of mab production technology: the bioprocessing industry at a crossroads. \JournalTitleMAbs 1, 443–452 (2009).
- Cell culture processes for monoclonal antibody production. \JournalTitleMAbs 2, 466–479 (2010).
- Derfus, G. E. et al. Cell culture monitoring via an auto-sampler and an integrated multi-functional off-line analyzer. \JournalTitleBiotechnology progress 26, 284–292 (2010).
- McRae, M. et al. Facilitating multisite bioprocess transfer: Multi-instrument and multi-platform comparability and long term stability of nova biomedical’s bioprofile chemistry and gas analyzers. In BioProcess International Europe (2012).
- Characterizing uncertainty in machine learning for chemistry. \JournalTitleJournal of Chemical Information and Modeling 63, 4012–4029 (2023).
- Knowledge transfer across cell lines using hybrid gaussian process models with entity embedding vectors. \JournalTitleBiotechnology and Bioengineering 118, 4389–4401 (2021).
- Advances in industrial biopharmaceutical batch process monitoring: Machine-learning methods for small data problems. \JournalTitleBiotechnology and Bioengineering 115, 1915–1924, DOI: 10.1002/bit.26605 (2018).
- Banner, M. et al. A decade in review: use of data analytics within the biopharmaceutical sector. \JournalTitleCurrent Opinion in Chemical Engineering 34, 100758, DOI: 10.1016/j.coche.2021.100758 (2021).
- Machine learning on small size samples: A synthetic knowledge synthesis. \JournalTitleScience Progress 105, 00368504211029777 (2022).
- Does pls have advantages for small sample size or non-normal data? \JournalTitleMIS quarterly 981–1001 (2012).
- Xu, J. et al. Improving titer while maintaining quality of final formulated drug substance via optimization of cho cell culture conditions in low-iron chemically defined media. \JournalTitleMAbs 10, 488–499 (2018).
- Effects of supplementation of various medium components on chinese hamster ovary cell cultures producing recombinant antibody. \JournalTitleCytotechnology 47, 37–49 (2005).
- Gangadharan, N. et al. Data intelligence for process performance prediction in biologics manufacturing. \JournalTitleComputers & Chemical Engineering 146, 107226 (2021).
- Pedregosa, F. et al. Scikit-learn: Machine learning in python. \JournalTitleJournal of machine Learning research 12, 2825–2830 (2011).
- Optuna: A next-generation hyperparameter optimization framework. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, 2623–2631 (2019).
- Tanemura, H. et al. Comprehensive modeling of cell culture profile using raman spectroscopy and machine learning. \JournalTitleScientific Reports 13, 21805 (2023).
- Gillespie, C. et al. Systematic assessment of process analytical technologies for biologics. \JournalTitleBiotechnology and Bioengineering 119, 423–434 (2022).
- Gibbons, L. et al. An assessment of the impact of raman based glucose feedback control on cho cell bioreactor process development. \JournalTitleBiotechnology Progress 39, e3371 (2023).
- Model-based pre-processing in raman spectroscopy of biological samples. \JournalTitleJournal of Raman Spectroscopy 47, 643–650 (2016).
- Extensive evaluation of machine learning models and data preprocessings for raman modeling in bioprocessing. \JournalTitleJournal of Raman Spectroscopy 53, 1580–1591 (2022).
- Smoothing and differentiation of data by simplified least squares procedures. \JournalTitleAnalytical chemistry 36, 1627–1639 (1964).
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.