Bellman pseudospectral method

The Bellman pseudospectral method is a pseudospectral method for optimal control based on Bellman's principle of optimality. It is part of the larger theory of pseudospectral optimal control, a term coined by Ross.^[1] The method is named after Richard E. Bellman. It was introduced by Ross et al.^[2]^[3] first as a means to solve multiscale optimal control problems, and later expanded to obtain suboptimal solutions for general optimal control problems.

Theoretical foundations[edit]

The multiscale version of the Bellman pseudospectral method is based on the spectral convergence property of the Ross–Fahroo pseudospectral methods. That is, because the Ross–Fahroo pseudospectral method converges at an exponentially fast rate, pointwise convergence to a solution is obtained at very low number of nodes even when the solution has high-frequency components. This aliasing phenomenon in optimal control was first discovered by Ross et al.^[2] Rather than use signal processing techniques to anti-alias the solution, Ross et al. proposed that Bellman's principle of optimality can be applied to the converged solution to extract information between the nodes. Because the Gauss–Lobatto nodes cluster at the boundary points, Ross et al. suggested that if the node density around the initial conditions satisfy the Nyquist–Shannon sampling theorem, then the complete solution can be recovered by solving the optimal control problem in a recursive fashion over piecewise segments known as Bellman segments.^[2]

In an expanded version of the method, Ross et al.,^[3] proposed that method could also be used to generate feasible solutions that were not necessarily optimal. In this version, one can apply the Bellman pseudospectral method at even lower number of nodes even under the knowledge that the solution may not have converged to the optimal one. In this situation, one obtains a feasible solution.

A remarkable feature of the Bellman pseudospectral method is that it automatically determines several measures of suboptimality based on the original pseudospectral cost and the cost generated by the sum of the Bellman segments.^[2]^[3]

Computational efficiency[edit]

One of the computational advantages of the Bellman pseudospectral method is that it allows one to escape Gaussian rules in the distribution of node points. That is, in a standard pseudospectral method, the distribution of node points are Gaussian (typically Gauss-Lobatto for finite horizon and Gauss-Radau for infinite horizon). The Gaussian points are sparse in the middle of the interval (middle is defined in a shifted sense for infinite-horizon problems) and dense at the boundaries. The second-order accumulation of points near the boundaries have the effect of wasting nodes. The Bellman pseudospectral method takes advantage of the node accumulation at the initial point to anti-alias the solution and discards the remainder of the nodes. Thus the final distribution of nodes is non-Gaussian and dense while the computational method retains a sparse structure.

Applications[edit]

The Bellman pseudospectral method was first applied by Ross et al.^[2] to solve the challenging problem of very low thrust trajectory optimization. It has been successfully applied to solve a practical problem of generating very high accuracy solutions to a trans-Earth-injection problem of bringing a space capsule from a lunar orbit to a pin-pointed Earth-interface condition for successful reentry.^[4]^[5]

The Bellman pseudospectral method is most commonly used as an additional check on the optimality of a pseudospectral solution generated by the Ross–Fahroo pseudospectral methods. That is, in addition to the use of Pontryagin's minimum principle in conjunction with the solutions obtained by the Ross–Fahroo pseudospectral methods, the Bellman pseudospectral method is used as a primal-only test on the optimality of the computed solution.^[6]^[7]

References[edit]

^ Ross, I. M.; Karpenko, M. (2012). "A Review of Pseudospectral Optimal Control: From Theory to Flight". Annual Reviews in Control. 36 (2): 182–197. doi:10.1016/j.arcontrol.2012.09.002.
^ ^a ^b ^c ^d ^e Ross, I. M.; Gong, Q.; Sekhavat, P. (2007). "Low-Thrust, High-Accuracy Trajectory Optimization". Journal of Guidance, Control and Dynamics. 30 (4): 921–933. Bibcode:2007JGCD...30..921R. doi:10.2514/1.23181. hdl:10945/49785. S2CID 5991144.
^ ^a ^b ^c I. M. Ross, Q. Gong and P. Sekhavat, The Bellman pseudospectral method, AIAA/AAS Astrodynamics Specialist Conference and Exhibit, Honolulu, Hawaii, AIAA-2008-6448, August 18–21, 2008.
^ Yan, H.; Gong, Q.; Park, C.; Ross, I. M.; D'Souza, C. N. (2011). "High Accuracy Trajectory Optimization for a Trans-Earth Lunar Mission". Journal of Guidance, Control and Dynamics. 34 (4): 1219–1227. Bibcode:2011JGCD...34.1219Y. doi:10.2514/1.49237. S2CID 123555190.
^ H. Yan, Q. Gong, C. D. Park, I. M. Ross and C. N. D'Souza, High-Accuracy Moon to Earth trajectory optimization, AIAA Guidance, Navigation, and Control Conference, 2010.
^ Fleming, A.; Sekhavat, P.; Ross, I. M. (2010). "Minimum-Time Reorientation of a Rigid Body". Journal of Guidance, Control and Dynamics. 33 (1): 160–170. Bibcode:2010JGCD...33..160F. doi:10.2514/1.43549. S2CID 120117410.
^ Ross, I. M.; Sekhavat, P.; Fleming, A.; Gong, Q. (2008). "Optimal feedback control: foundations, examples, and experimental results for a new approach". Journal of Guidance, Control, and Dynamics. 31 (2): 307–321. Bibcode:2008JGCD...31..307R. doi:10.2514/1.29532.

[RK-1] Ross, I. M.; Karpenko, M. (2012). "A Review of Pseudospectral Optimal Control: From Theory to Flight". Annual Reviews in Control. 36 (2): 182–197. doi:10.1016/j.arcontrol.2012.09.002.

[Ross1-2] Ross, I. M.; Gong, Q.; Sekhavat, P. (2007). "Low-Thrust, High-Accuracy Trajectory Optimization". Journal of Guidance, Control and Dynamics. 30 (4): 921–933. Bibcode:2007JGCD...30..921R. doi:10.2514/1.23181. hdl:10945/49785. S2CID 5991144.

[Ross2-3] I. M. Ross, Q. Gong and P. Sekhavat, The Bellman pseudospectral method, AIAA/AAS Astrodynamics Specialist Conference and Exhibit, Honolulu, Hawaii, AIAA-2008-6448, August 18–21, 2008.

[Yan1-4] Yan, H.; Gong, Q.; Park, C.; Ross, I. M.; D'Souza, C. N. (2011). "High Accuracy Trajectory Optimization for a Trans-Earth Lunar Mission". Journal of Guidance, Control and Dynamics. 34 (4): 1219–1227. Bibcode:2011JGCD...34.1219Y. doi:10.2514/1.49237. S2CID 123555190.

[Yan2-5] H. Yan, Q. Gong, C. D. Park, I. M. Ross and C. N. D'Souza, High-Accuracy Moon to Earth trajectory optimization, AIAA Guidance, Navigation, and Control Conference, 2010.

[F1-6] Fleming, A.; Sekhavat, P.; Ross, I. M. (2010). "Minimum-Time Reorientation of a Rigid Body". Journal of Guidance, Control and Dynamics. 33 (1): 160–170. Bibcode:2010JGCD...33..160F. doi:10.2514/1.43549. S2CID 120117410.

[Ross3-7] Ross, I. M.; Sekhavat, P.; Fleming, A.; Gong, Q. (2008). "Optimal feedback control: foundations, examples, and experimental results for a new approach". Journal of Guidance, Control, and Dynamics. 31 (2): 307–321. Bibcode:2008JGCD...31..307R. doi:10.2514/1.29532.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

Theoretical foundations[edit]

Computational efficiency[edit]

Applications[edit]

See also[edit]

References[edit]