Figure 5 from

Image Details

View Article Bookmark & Share

Choose export citation format:

Export Citation

Hi-Resolution Export to PDF

Deep Reinforcement Learning for Efficient Scheduling of Ground-based Astronomical Observations

Authors: Hai Cao, Shaoming Hu, Junju Du, Xu Chen, Shuqi Liu, Shuai Feng, Bo Zhang, Yuchen Jiang

Hai Cao et al 2025 The Astronomical Journal 170 .

Provider: AAS Journals

Caption: Figure 5.

The convergence of the total reward per episode in reinforcement training. The solid blue line shows the mean reward of the training batch, which serves as the baseline. The dashed red and green lines represent the maximum and minimum rewards within the batch, respectively.

Copyright and Terms & Conditions

Additional terms of reuse

Other Images in This Article

Copyright and Terms & Conditions