Image Details

Choose export citation format:

Deep Reinforcement Learning for Efficient Scheduling of Ground-based Astronomical Observations

  • Authors: Hai Cao, Shaoming Hu, Junju Du, Xu Chen, Shuqi Liu, Shuai Feng, Bo Zhang, Yuchen Jiang

Hai Cao et al 2025 The Astronomical Journal 170 .

  • Provider: AAS Journals

Caption: Figure 5.

The convergence of the total reward per episode in reinforcement training. The solid blue line shows the mean reward of the training batch, which serves as the baseline. The dashed red and green lines represent the maximum and minimum rewards within the batch, respectively.

Other Images in This Article
Copyright and Terms & Conditions

Additional terms of reuse