Model-aware reinforcement learning for high-performance Bayesian experimental design in quantum metrology
Abstract: Quantum sensors offer control flexibility during estimation by allowing manipulation by the experimenter across various parameters. For each sensing platform, pinpointing the optimal controls to enhance the sensor's precision remains a challenging task. While an analytical solution might be out of reach, machine learning offers a promising avenue for many systems of interest, especially given the capabilities of contemporary hardware. We have introduced a versatile procedure capable of optimizing a wide range of problems in quantum metrology, estimation, and hypothesis testing by combining model-aware reinforcement learning (RL) with Bayesian estimation based on particle filtering. To achieve this, we had to address the challenge of incorporating the many non-differentiable steps of the estimation in the training process, such as measurements and the resampling of the particle filter. Model-aware RL is a gradient-based method, where the derivatives of the sensor's precision are obtained through automatic differentiation (AD) in the simulation of the experiment. Our approach is suitable for optimizing both non-adaptive and adaptive strategies, using neural networks or other agents. We provide an implementation of this technique in the form of a Python library called qsensoropt, alongside several pre-made applications for relevant physical platforms, namely NV centers, photonic circuits, and optical cavities. This library will be released soon on PyPI. Leveraging our method, we've achieved results for many examples that surpass the current state-of-the-art in experimental design. In addition to Bayesian estimation, leveraging model-aware RL, it is also possible to find optimal controls for the minimization of the Cram\'er-Rao bound, based on Fisher information.
- Flamini, F. et al. Photonic architecture for reinforcement learning. New Journal of Physics 22, 045002 (2020). URL https://iopscience.iop.org/article/10.1088/1367-2630/ab783c.
- Broughton, M. et al. TensorFlow Quantum: A Software Framework for Quantum Machine Learning (2021). URL http://arxiv.org/abs/2003.02989.
- Bergholm, V. et al. PennyLane: Automatic differentiation of hybrid quantum-classical computations (2022). URL http://arxiv.org/abs/1811.04968.
- Bukov, M. et al. Reinforcement Learning in Different Phases of Quantum Control. Physical Review X 8, 031086 (2018). URL https://link.aps.org/doi/10.1103/PhysRevX.8.031086.
- When does reinforcement learning stand out in quantum control? A comparative study on state preparation. npj Quantum Information 5, 85 (2019). URL http://www.nature.com/articles/s41534-019-0201-8.
- Universal quantum control through deep reinforcement learning. npj Quantum Information 5, 33 (2019). URL http://www.nature.com/articles/s41534-019-0141-3.
- Deep Reinforcement Learning for Quantum State Preparation with Weak Nonlinear Measurements. Quantum 6, 747 (2022). URL https://quantum-journal.org/papers/q-2022-06-28-747/.
- Gradient-Ascent Pulse Engineering with Feedback. PRX Quantum 4, 030305 (2023). URL https://link.aps.org/doi/10.1103/PRXQuantum.4.030305.
- Reinforcement Learning with Neural Networks for Quantum Feedback. Physical Review X 8, 031084 (2018). URL https://link.aps.org/doi/10.1103/PhysRevX.8.031084.
- Cimini, V. et al. Calibration of Quantum Sensors by Neural Networks. Physical Review Letters 123, 230502 (2019). URL https://link.aps.org/doi/10.1103/PhysRevLett.123.230502.
- Neural-network-based parameter estimation for quantum detection. Quantum Science and Technology 6, 045012 (2021). URL https://iopscience.iop.org/article/10.1088/2058-9565/ac16ed.
- A machine learning approach to Bayesian parameter estimation. npj Quantum Information 7, 169 (2021). URL https://www.nature.com/articles/s41534-021-00497-w.
- Frequentist parameter estimation with supervised learning. AVS Quantum Science 3, 034401 (2021). URL https://avs.scitation.org/doi/10.1116/5.0058163.
- Nguyen, V. et al. Deep reinforcement learning for efficient measurement of quantum devices. npj Quantum Information 7, 100 (2021). URL http://www.nature.com/articles/s41534-021-00434-x.
- Palmieri, A. M. et al. Experimental neural network enhanced quantum tomography. npj Quantum Information 6, 20 (2020). URL http://www.nature.com/articles/s41534-020-0248-6.
- Adaptive quantum state tomography with neural networks. npj Quantum Information 7, 105 (2021). URL http://www.nature.com/articles/s41534-021-00436-9.
- Hsieh, H.-Y. et al. Direct Parameter Estimations from Machine Learning-Enhanced Quantum State Tomography. Symmetry 14, 874 (2022). URL https://www.mdpi.com/2073-8994/14/5/874.
- Marquardt, F. Machine learning and quantum devices. SciPost Physics Lecture Notes 29 (2021). URL https://scipost.org/10.21468/SciPostPhysLectNotes.29.
- Marquardt, F. Online Course: Advanced Machine Learning for Physics, Science, and Artificial Scientific Discovery (2021).
- Artificial intelligence and machine learning for quantum technologies. Physical Review A 107, 010101 (2023). URL https://link.aps.org/doi/10.1103/PhysRevA.107.010101.
- Fisher, R. A. The design of experiments. The design of experiments (Oliver & Boyd, Oxford, England, 1935).
- Foster, A. E. Variational, Monte Carlo and policy-based approaches to Bayesian experimental design. http://purl.org/dc/dcmitype/Text, University of Oxford (2021). URL https://ora.ox.ac.uk/objects/uuid:4a3e13ca-e6c6-4669-955e-f1a87e201228.
- Baydin, A. G. et al. Toward Machine Learning Optimization of Experimental Design. Nuclear Physics News 31, 25–28 (2021). URL https://www.tandfonline.com/doi/full/10.1080/10619127.2021.1881364.
- Machine learning and computation-enabled intelligent sensor design. Nature Machine Intelligence 3, 556–565 (2021). URL http://www.nature.com/articles/s42256-021-00360-9.
- Implicit Deep Adaptive Design: Policy-Based Experimental Design without Likelihoods. In Advances in Neural Information Processing Systems, vol. 34, 25785–25798 (Curran Associates, Inc., 2021). URL https://proceedings.neurips.cc/paper/2021/hash/d811406316b669ad3d370d78b51b1d2e-Abstract.html.
- Deep Bayesian Experimental Design for Quantum Many-Body Systems (2023). URL http://arxiv.org/abs/2306.14510.
- Deep Adaptive Design: Amortizing Sequential Bayesian Experimental Design. In Proceedings of the 38th International Conference on Machine Learning, 3384–3395 (PMLR, 2021). URL https://proceedings.mlr.press/v139/foster21a.html.
- Differentiable Particle Filtering without Modifying the Forward Pass (2021). URL http://arxiv.org/abs/2106.10314.
- Chen, M. et al. Quantum metrology with single spins in diamond under ambient conditions. National Science Review 5, 346–355 (2018). URL https://academic.oup.com/nsr/article/5/3/346/4430770.
- Rembold, P. et al. Introduction to quantum optimal control for quantum sensing with nitrogen-vacancy centers in diamond. AVS Quantum Science 2, 024701 (2020). URL http://avs.scitation.org/doi/10.1116/5.0006785.
- Neural-Network Heuristics for Adaptive Bayesian Quantum Estimation. PRX Quantum 2, 020303 (2021). URL https://link.aps.org/doi/10.1103/PRXQuantum.2.020303.
- Arshad, M. J. et al. Online adaptive estimation of decoherence timescales for a single qubit (2022). URL http://arxiv.org/abs/2210.06103.
- Joas, T. Online adaptive quantum characterization of a nuclear spin. npj Quantum Information 8 (2021).
- An agnostic-Dolinar receiver for coherent states classification. Physical Review A 104, 042606 (2021). URL http://arxiv.org/abs/2106.11909.
- Learning Feedback Control Strategies for Quantum Metrology. PRX Quantum 3, 020310 (2022). URL https://link.aps.org/doi/10.1103/PRXQuantum.3.020310.
- In preparation: The qsensoropt library: examples and applications .
- A variational toolbox for quantum multi-parameter estimation. npj Quantum Information 7, 1–5 (2021). URL https://www.nature.com/articles/s41534-021-00425-y.
- Zhang, M. et al. QuanEstimation: An open-source toolkit for quantum parameter estimation. Physical Review Research 4, 043057 (2022). URL https://link.aps.org/doi/10.1103/PhysRevResearch.4.043057.
- Granade, C. et al. QInfer: Statistical inference software for quantum applications. Quantum 1, 5 (2017). URL http://arxiv.org/abs/1610.00336.
- Optbayesexpt: Sequential Bayesian Experiment Design for Adaptive Measurements. Journal of Research of the National Institute of Standards and Technology 126, 126002 (2021). URL https://nvlpubs.nist.gov/nistpubs/jres/126/jres.126.002.pdf.
- Designing optimal protocols in Bayesian quantum parameter estimation with higher-order operations (2023). URL http://arxiv.org/abs/2311.01513.
- Quantum parameter estimation with optimal control. Physical Review A 96, 012117 (2017). URL http://link.aps.org/doi/10.1103/PhysRevA.96.012117.
- Xu, H. et al. Generalizable control for quantum parameter estimation through reinforcement learning. npj Quantum Information 5, 1–8 (2019). URL https://www.nature.com/articles/s41534-019-0198-z.
- Improving the dynamics of quantum sensors with reinforcement learning. New Journal of Physics 22, 035001 (2020). URL https://iopscience.iop.org/article/10.1088/1367-2630/ab6f1f.
- Generalizable control for multiparameter quantum metrology. Physical Review A 103, 042615 (2021). URL http://arxiv.org/abs/2012.13377.
- Optimal Scheme for Quantum Metrology. Advanced Quantum Technologies 5, 2100080 (2022). URL http://arxiv.org/abs/2111.12279.
- Parameter estimation in quantum sensing based on deep reinforcement learning. npj Quantum Information 8, 1–12 (2022). URL https://www.nature.com/articles/s41534-021-00513-z.
- Efficient and robust entanglement generation with deep reinforcement learning for quantum metrology. New Journal of Physics 24, 083011 (2022). URL https://dx.doi.org/10.1088/1367-2630/ac8285.
- Framework for Learning and Control in the Classical and Quantum Domains (2023). URL http://arxiv.org/abs/2307.04256.
- Gebhart, V. et al. Learning quantum systems. Nature Reviews Physics 5, 141–156 (2023). URL https://www.nature.com/articles/s42254-022-00552-1.
- Ma, Z. et al. Adaptive Circuit Learning for Quantum Metrology. In 2021 IEEE International Conference on Quantum Computing and Engineering (QCE), 419–430 (2021). URL http://arxiv.org/abs/2010.08702.
- Quantum Variational Optimization of Ramsey Interferometry and Atomic Clocks. Physical Review X 11, 041045 (2021). URL https://link.aps.org/doi/10.1103/PhysRevX.11.041045.
- Marciniak, C. D. et al. Optimal metrology with programmable quantum sensors. Nature 603, 604–609 (2022). URL https://www.nature.com/articles/s41586-022-04435-4.
- Optimal and Variational Multi-Parameter Quantum Metrology and Vector Field Sensing (2023). URL http://arxiv.org/abs/2302.07785.
- Superresolution imaging with multiparameter quantum metrology in passive remote sensing. Physical Review A 107, 032607 (2023). URL https://link.aps.org/doi/10.1103/PhysRevA.107.032607.
- Heras, A. M. d. l. et al. Photonic quantum metrology with variational quantum optical non-linearities (2023). URL http://arxiv.org/abs/2309.09841.
- Del Moral, P. Nonlinear filtering: Interacting particle resolution. Comptes Rendus de l’Académie des Sciences - Series I - Mathematics 325, 653–658 (1997). URL https://linkinghub.elsevier.com/retrieve/pii/S0764444297847787.
- A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Transactions on Signal Processing 50, 174–188 (2002). URL https://ieeexplore.ieee.org/document/978374.
- Sequential Monte Carlo Methods for Dynamic Systems. Journal of the American Statistical Association 93, 1032–1044 (1998). URL https://www.jstor.org/stable/2669847.
- On the approximation of functions by tanh neural networks. Neural Networks 143, 732–750 (2021). URL https://www.sciencedirect.com/science/article/pii/S0893608021003208.
- Adam: A Method for Stochastic Optimization. In Bengio, Y. & LeCun, Y. (eds.) 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings (2015). URL http://arxiv.org/abs/1412.6980.
- Particle Filter Networks with Application to Visual Localization. In Proceedings of The 2nd Conference on Robot Learning, 169–178 (PMLR, 2018). URL https://proceedings.mlr.press/v87/karkus18a.html.
- Gali, A. Ab initio theory of the nitrogen-vacancy center in diamond. Nanophotonics 8, 1907–1943 (2019). URL https://www.degruyter.com/document/doi/10.1515/nanoph-2019-0154/html?lang=en.
- Quantum science and technology based on color centers with accessible spin. Journal of Applied Physics 131, 010401 (2022). URL https://aip.scitation.org/doi/10.1063/5.0082219.
- Maze, J. Quantum manipulation of nitrogen-vacancy centers in diamond: From basic properties to applications. Ph.D. thesis (2010).
- Barry, J. F. et al. Sensitivity optimization for NV-diamond magnetometry. Rev. Mod. Phys. 92, 68 (2020).
- Schmitt, S. et al. Optimal frequency measurements with quantum probes. npj Quantum Information 7, 55 (2021). URL http://www.nature.com/articles/s41534-021-00391-5.
- How to best sample a periodic probability distribution, or on the accuracy of Hamiltonian finding strategies. Quantum Information Processing 12, 611–623 (2013). URL http://link.springer.com/10.1007/s11128-012-0407-6.
- Sequential Bayesian Experiment Design for Optically Detected Magnetic Resonance of Nitrogen-Vacancy Centers. Physical Review Applied 14, 054036 (2020). URL https://link.aps.org/doi/10.1103/PhysRevApplied.14.054036.
- Sequential Bayesian experiment design for adaptive Ramsey sequence measurements. Journal of Applied Physics 130, 144401 (2021). URL https://aip.scitation.org/doi/10.1063/5.0055630.
- Robust online Hamiltonian learning. New Journal of Physics 14, 103013 (2012). URL https://dx.doi.org/10.1088/1367-2630/14/10/103013.
- Oshnik, N. et al. Robust magnetometry with single nitrogen-vacancy centers via two-step optimization. Physical Review A 106, 013107 (2022). URL https://link.aps.org/doi/10.1103/PhysRevA.106.013107.
- Resource-efficient adaptive Bayesian tracking of magnetic fields with a quantum sensor. Journal of Physics: Condensed Matter 33, 195801 (2021). URL https://iopscience.iop.org/article/10.1088/1361-648X/abe34f.
- Bonato, C. et al. Optimized quantum sensing with a single electron spin using real-time adaptive measurements. Nature Nanotechnology 11, 247–252 (2016). URL https://www.nature.com/articles/nnano.2015.261.
- Santagati, R. et al. Magnetic-Field Learning Using a Single Electronic Spin in Diamond with One-Photon Readout at Room Temperature. Physical Review X 9, 021019 (2019). URL https://link.aps.org/doi/10.1103/PhysRevX.9.021019.
- Zohar, I. et al. Real-time frequency estimation of a qubit without single-shot-readout. Quantum Science and Technology 8, 035017 (2023). URL https://iopscience.iop.org/article/10.1088/2058-9565/acd415.
- High-dynamic-range magnetometry with a single electronic spin in diamond. Nature Nanotechnology 7, 109–113 (2012). URL http://www.nature.com/articles/nnano.2011.225.
- Wang, J. et al. Experimental quantum Hamiltonian learning. Nature Physics 13, 551–555 (2017). URL http://www.nature.com/articles/nphys4074.
- Bayesian estimation for quantum sensing in the absence of single-shot detection. Physical Review B 99, 125413 (2019). URL https://link.aps.org/doi/10.1103/PhysRevB.99.125413.
- Adaptive tracking of a time-varying field with a quantum sensor. Physical Review A 95, 052348 (2017). URL http://link.aps.org/doi/10.1103/PhysRevA.95.052348.
- Adaptive Hamiltonian estimation using Bayesian experimental design. AIP Conference Proceedings 1443, 165–173 (2012). URL https://doi.org/10.1063/1.3703632.
- Repetitive readout enhanced by machine learning. Machine Learning: Science and Technology 1, 015003 (2020). URL https://iopscience.iop.org/article/10.1088/2632-2153/ab4e24.
- Tsukamoto, M. et al. Machine-learning-enhanced quantum sensors for accurate magnetic field imaging. Scientific Reports 12, 13942 (2022). URL http://arxiv.org/abs/2202.00380.
- Hamiltonian Learning and Certification Using Quantum Resources. Physical Review Letters 112, 190501 (2014). URL https://link.aps.org/doi/10.1103/PhysRevLett.112.190501.
- Processing and transmission of information. Massachusetts Institute of Technology. Research Laboratory of Electronics. Quarterly Progress Report, no. 111 (1973).
- Geremia, J. Distinguishing between optical coherent states with imperfect detection. Physical Review A 70, 062303 (2004). URL https://link.aps.org/doi/10.1103/PhysRevA.70.062303.
- Izumi, S. et al. Displacement receiver for phase-shift-keyed coherent states. Physical Review A 86, 042328 (2012). URL https://link.aps.org/doi/10.1103/PhysRevA.86.042328.
- Revisiting the Dolinar receiver through multiple-copy state discrimination theory. Physical Review A 84, 022342 (2011). URL https://link.aps.org/doi/10.1103/PhysRevA.84.022342.
- Optical coherent state discrimination using a closed-loop quantum measurement. Nature 446, 774–777 (2007). URL https://www.nature.com/articles/nature05655.
- Adaptive discrimination scheme for quantum pulse-position-modulation signals. Physical Review A 89, 012339 (2014). URL https://link.aps.org/doi/10.1103/PhysRevA.89.012339.
- Implementation of projective measurements with linear optics and continuous photon counting. Physical Review A 71, 022318 (2005). URL https://link.aps.org/doi/10.1103/PhysRevA.71.022318.
- Real-time calibration of coherent-state receivers: Learning by trial and error. Physical Review Research 2, 033295 (2020). URL https://link.aps.org/doi/10.1103/PhysRevResearch.2.033295.
- Cui, C. et al. Quantum receiver enhanced by adaptive learning. Light: Science & Applications 11, 344 (2022). URL https://www.nature.com/articles/s41377-022-01039-5.
- Quantum learning of coherent states. EPJ Quantum Technology 2, 17 (2015). URL http://epjquantumtechnology.springeropen.com/articles/10.1140/epjqt/s40507-015-0030-4.
- Helstrom, C. W. Quantum detection and estimation theory. Journal of Statistical Physics 1, 231–252 (1969). URL https://doi.org/10.1007/BF01007479.
- Holevo, A. S. Statistical problems in quantum physics. In Maruyama, G. & Prokhorov, Y. V. (eds.) Proceedings of the Second Japan-USSR Symposium on Probability Theory, Lecture Notes in Mathematics, 104–119 (Springer, Berlin, Heidelberg, 1973).
- Hayashi, M. Asymptotic Theory of Quantum Statistical Inference (WORLD SCIENTIFIC, 2005). URL https://www.worldscientific.com/doi/abs/10.1142/5630.
- Gentile, A. A. et al. Learning models of quantum systems from experiments. Nature Physics 17, 837–843 (2021). URL http://www.nature.com/articles/s41567-021-01201-7.
- Resampling Methods for Particle Filtering: Classification, implementation, and strategies. IEEE Signal Processing Magazine 32, 70–86 (2015). URL https://ieeexplore.ieee.org/document/7079001/.
- Towards Differentiable Resampling (2020). URL http://arxiv.org/abs/2004.11938.
- Particle Filter Recurrent Neural Networks. Proceedings of the AAAI Conference on Artificial Intelligence 34, 5101–5108 (2020).
- Holevo, A. S. Probabilistic and Statistical Aspects of Quantum Theory. Monographs (Scuola Normale Superiore) (Edizioni della Normale, 2011). URL https://www.springer.com/gp/book/9788876423758.
- Policy Gradient Methods for Reinforcement Learning with Function Approximation. In Advances in Neural Information Processing Systems, vol. 12 (MIT Press, 1999). URL https://papers.nips.cc/paper/1999/hash/464d828b85b0bed98e80ade0a5c43b0f-Abstract.html.
- Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning. In Advances in Neural Information Processing Systems, vol. 32 (Curran Associates, Inc., 2019). URL https://proceedings.neurips.cc/paper/2019/hash/6fd6b030c6afec018415662d0db43f9d-Abstract.html.
- The optimal reward baseline for gradient-based reinforcement learning. In Proceedings of the Seventeenth conference on Uncertainty in artificial intelligence, UAI’01, 538–545 (Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 2001).
- Cimini, V. et al. Experimental metrology beyond the standard quantum limit for a wide resources range. npj Quantum Information 9, 1–9 (2023). URL https://www.nature.com/articles/s41534-023-00691-y.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.