Online convex optimization for constrained control of nonlinear systems

Research output: Working paper/PreprintPreprint

Authors

Research Organisations

External Research Organisations

  • ETH Zurich
View graph of relations

Details

Original languageEnglish
Publication statusE-pub ahead of print - 1 Dec 2024

Abstract

This paper investigates the problem of controlling nonlinear dynamical systems subject to state and input constraints while minimizing time-varying and a priori unknown cost functions. We propose a modular approach that combines the online convex optimization framework and reference governors to solve this problem. Our method is general in the sense that we do not limit our analysis to a specific choice of online convex optimization algorithm or reference governor. We show that the dynamic regret of the proposed framework is bounded linearly in both the dynamic regret and the path length of the chosen online convex optimization algorithm, even though the online convex optimization algorithm does not account for the underlying dynamics. We prove that a linear bound with respect to the online convex optimization algorithm's dynamic regret is optimal, i.e., cannot be improved upon. Furthermore, for a standard class of online convex optimization algorithms, our proposed framework attains a bound on its dynamic regret that is linear only in the variation of the cost functions, which is known to be an optimal bound. Finally, we demonstrate implementation and flexibility of the proposed framework by comparing different combinations of online convex optimization algorithms and reference governors to control a nonlinear chemical reactor in a numerical experiment.

Keywords

    eess.SY, cs.SY, math.OC

Cite this

Online convex optimization for constrained control of nonlinear systems. / Nonhoff, Marko; Köhler, Johannes; Müller, Matthias A.
2024.

Research output: Working paper/PreprintPreprint

Nonhoff M, Köhler J, Müller MA. Online convex optimization for constrained control of nonlinear systems. 2024 Dec 1. Epub 2024 Dec 1. doi: 10.48550/arXiv.2412.00922
Download
@techreport{15fc22467dd642918f8b4b55d85d1574,
title = "Online convex optimization for constrained control of nonlinear systems",
abstract = "This paper investigates the problem of controlling nonlinear dynamical systems subject to state and input constraints while minimizing time-varying and a priori unknown cost functions. We propose a modular approach that combines the online convex optimization framework and reference governors to solve this problem. Our method is general in the sense that we do not limit our analysis to a specific choice of online convex optimization algorithm or reference governor. We show that the dynamic regret of the proposed framework is bounded linearly in both the dynamic regret and the path length of the chosen online convex optimization algorithm, even though the online convex optimization algorithm does not account for the underlying dynamics. We prove that a linear bound with respect to the online convex optimization algorithm's dynamic regret is optimal, i.e., cannot be improved upon. Furthermore, for a standard class of online convex optimization algorithms, our proposed framework attains a bound on its dynamic regret that is linear only in the variation of the cost functions, which is known to be an optimal bound. Finally, we demonstrate implementation and flexibility of the proposed framework by comparing different combinations of online convex optimization algorithms and reference governors to control a nonlinear chemical reactor in a numerical experiment. ",
keywords = "eess.SY, cs.SY, math.OC",
author = "Marko Nonhoff and Johannes K{\"o}hler and M{\"u}ller, {Matthias A.}",
year = "2024",
month = dec,
day = "1",
doi = "10.48550/arXiv.2412.00922",
language = "English",
type = "WorkingPaper",

}

Download

TY - UNPB

T1 - Online convex optimization for constrained control of nonlinear systems

AU - Nonhoff, Marko

AU - Köhler, Johannes

AU - Müller, Matthias A.

PY - 2024/12/1

Y1 - 2024/12/1

N2 - This paper investigates the problem of controlling nonlinear dynamical systems subject to state and input constraints while minimizing time-varying and a priori unknown cost functions. We propose a modular approach that combines the online convex optimization framework and reference governors to solve this problem. Our method is general in the sense that we do not limit our analysis to a specific choice of online convex optimization algorithm or reference governor. We show that the dynamic regret of the proposed framework is bounded linearly in both the dynamic regret and the path length of the chosen online convex optimization algorithm, even though the online convex optimization algorithm does not account for the underlying dynamics. We prove that a linear bound with respect to the online convex optimization algorithm's dynamic regret is optimal, i.e., cannot be improved upon. Furthermore, for a standard class of online convex optimization algorithms, our proposed framework attains a bound on its dynamic regret that is linear only in the variation of the cost functions, which is known to be an optimal bound. Finally, we demonstrate implementation and flexibility of the proposed framework by comparing different combinations of online convex optimization algorithms and reference governors to control a nonlinear chemical reactor in a numerical experiment.

AB - This paper investigates the problem of controlling nonlinear dynamical systems subject to state and input constraints while minimizing time-varying and a priori unknown cost functions. We propose a modular approach that combines the online convex optimization framework and reference governors to solve this problem. Our method is general in the sense that we do not limit our analysis to a specific choice of online convex optimization algorithm or reference governor. We show that the dynamic regret of the proposed framework is bounded linearly in both the dynamic regret and the path length of the chosen online convex optimization algorithm, even though the online convex optimization algorithm does not account for the underlying dynamics. We prove that a linear bound with respect to the online convex optimization algorithm's dynamic regret is optimal, i.e., cannot be improved upon. Furthermore, for a standard class of online convex optimization algorithms, our proposed framework attains a bound on its dynamic regret that is linear only in the variation of the cost functions, which is known to be an optimal bound. Finally, we demonstrate implementation and flexibility of the proposed framework by comparing different combinations of online convex optimization algorithms and reference governors to control a nonlinear chemical reactor in a numerical experiment.

KW - eess.SY

KW - cs.SY

KW - math.OC

U2 - 10.48550/arXiv.2412.00922

DO - 10.48550/arXiv.2412.00922

M3 - Preprint

BT - Online convex optimization for constrained control of nonlinear systems

ER -

By the same author(s)