2000 character limit reached
Reinforcement learning
Published 16 May 2024 in astro-ph.IM, cs.AI, and cs.LG | (2405.10369v1)
Abstract: Observing celestial objects and advancing our scientific knowledge about them involves tedious planning, scheduling, data collection and data post-processing. Many of these operational aspects of astronomy are guided and executed by expert astronomers. Reinforcement learning is a mechanism where we (as humans and astronomers) can teach agents of artificial intelligence to perform some of these tedious tasks. In this paper, we will present a state of the art overview of reinforcement learning and how it can benefit astronomy.
- TensorFlow: A system for large-scale machine learning. arXiv e-prints , arXiv:1605.086951605.08695.
- A New Look at the Statistical Model Identification. IEEE Transactions on Automatic Control 19, 716–723.
- Dynamic Programming and Optimal Control. Number v. 1 in Athena Scientific optimization and computation series, Athena Scientific.
- Dynamic Programming and Optimal Control: Volume II; Approximate Dynamic Programming. Athena Scientific optimization and computation series, Athena Scientific.
- Neuro-Dynamic Programming. Athena Scientific. 1st edition.
- Chapter 3 - the cross-entropy method for optimization, in: Rao, C., Govindaraju, V. (Eds.), Handbook of Statistics. Elsevier. volume 31 of Handbook of Statistics, pp. 35–59.
- Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends® in Machine Learning 3, 1–122.
- OpenAI Gym. arXiv e-prints , arXiv:1606.015401606.01540.
- Deep reinforcement learning in a handful of trials using probabilistic dynamics models. Advances in neural information processing systems 31.
- Model-Augmented Actor-Critic: Backpropagating through Paths. arXiv e-prints , arXiv:2005.080682005.08068.
- Model-Based Reinforcement Learning via Meta-Policy Optimization. arXiv e-prints , arXiv:1809.052141809.05214.
- Discovering faster matrix multiplication algorithms with reinforcement learning. Nature 610, 47–53.
- Addressing Function Approximation Error in Actor-Critic Methods. arXiv e-prints , arXiv:1802.094771802.09477.
- Combining ADMM and the augmented Lagrangian method for efficiently handling many constraints, in: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19, International Joint Conferences on Artificial Intelligence Organization. pp. 4525–4531.
- Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. arXiv e-prints , arXiv:1801.012901801.01290.
- Soft Actor-Critic Algorithms and Applications. arXiv e-prints , arXiv:1812.059051812.05905.
- Robust statistics: the approach based on influence functions. New York USA:Wiley. ID: unige:23238.
- Deep reinforcement learning that matters, in: Proceedings of the AAAI conference on artificial intelligence.
- Learning to utilize shaping rewards: A new approach of reward shaping. Advances in Neural Information Processing Systems 33, 15931–15941.
- Accelerating Quadratic Optimization with Reinforcement Learning. arXiv e-prints , arXiv:2107.108472107.10847.
- When to trust your model: Model-based policy optimization. Advances in Neural Information Processing Systems 32.
- Observation strategy optimization for distributed telescope arrays with deep reinforcement learning. The Astronomical Journal 165, 233.
- A simulation framework for telescope array and its application in distributed reinforcement learning-based scheduling of telescope arrays. Astronomy and Computing , 100732.
- Optimal control of wide field small aperture telescope arrays with reinforcement learning, in: Observatory Operations: Strategies, Processes, and Systems IX, SPIE. pp. 170–177.
- Adam: A Method for Stochastic Optimization. ArXiv e-prints 1412.6980.
- Auto-Encoding Variational Bayes. arXiv e-prints , arXiv:1312.61141312.6114.
- Understanding black-box predictions via influence functions, in: Precup, D., Teh, Y.W. (Eds.), Proceedings of the 34th International Conference on Machine Learning, PMLR, International Convention Centre, Sydney, Australia. pp. 1885–1894.
- Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles. arXiv e-prints , arXiv:1612.014741612.01474.
- Self-optimizing adaptive optics control with reinforcement learning for high-contrast imaging. Journal of Astronomical Telescopes, Instruments, and Systems 7, 039002–039002.
- Deep learning. Nature 521, 436 EP –.
- End-to-End Training of Deep Visuomotor Policies. arXiv e-prints , arXiv:1504.007021504.00702.
- Continuous control with deep reinforcement learning. arXiv e-prints , arXiv:1509.029711509.02971.
- Faster sorting algorithms discovered using deep reinforcement learning. Nature 618, 257–263.
- Human-level control through deep reinforcement learning. Nature 518, 529–533.
- Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning. arXiv e-prints , arXiv:1708.025961708.02596.
- LOFAR Self-Calibration using a Local Sky Model, in: Gabriel, C., Arviset, C., Ponz, D., Enrique, S. (Eds.), Astronomical Data Analysis Software and Systems XV, p. 291.
- Adaptive optics control using model-based reinforcement learning. Opt. Express 29, 15327–15344.
- Toward on-sky adaptive optics control using reinforcement learning-model-based policy optimization for adaptive optics. Astronomy & Astrophysics 664, A71.
- Automatic differentiation in PyTorch, in: NIPS-W.
- PyTorch: An Imperative Style, High-Performance Deep Learning Library. arXiv e-prints , arXiv:1912.017031912.01703.
- Intelligent reflecting surface-assisted interference mitigation with deep reinforcement learning for radio astronomy. IEEE Antennas and Wireless Propagation Letters 21, 1757–1761.
- MBRL-Lib: A Modular Library for Model-based Reinforcement Learning. arXiv e-prints , arXiv:2104.101592104.10159.
- Stable-baselines3: Reliable reinforcement learning implementations. Journal of Machine Learning Research 22, 1–8.
- Prioritized Experience Replay. arXiv e-prints , arXiv:1511.059521511.05952.
- Mastering the game of go with deep neural networks and tree search. Nature 529, 484–489.
- Reinforcement Learning: An Introduction. A Bradford Book, Cambridge, MA, USA.
- Algorithms for Reinforcement Learning. Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers.
- Scaling up average reward reinforcement learning by approximating the domain models and the value function, in: ICML, Citeseer. pp. 471–479.
- Gymnasium.
- On the theory of the brownian motion. Phys. Rev. 36, 823–841.
- Double q-learning. Advances in neural information processing systems 23.
- Deep reinforcement learning with double q-learning, in: Proceedings of the AAAI conference on artificial intelligence.
- Attention Is All You Need. arXiv e-prints , arXiv:1706.037621706.03762.
- Benchmarking Model-Based Reinforcement Learning. arXiv e-prints , arXiv:1907.020571907.02057.
- Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic. arXiv e-prints , arXiv:2112.105042112.10504.
- Q-learning. Machine learning 8, 279–292.
- Statistical performance of radio interferometric calibration. Monthly Notices of the Royal Astronomical Society 486, 5646–5655.
- Hint assisted reinforcement learning: an application in radio astronomy. arXiv preprint arXiv:2301.03933 .
- Deep reinforcement learning for smart calibration of radio telescopes. Monthly Notices of the Royal Astronomical Society 505, 2141–2150.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.