For learning-based control in complex physical systems, bridging the "sim-to-real" gap while preserving strict stability guarantees remains a primary challenge.
To address this, we introduce a framework that seamlessly integrates offline generative pre-training with online Bayesian adaptation to reformulate perturbed nonlinear dynamics into a tractable linear system, thereby facilitating straightforward control synthesis.
Specifically, Generative Adversarial Networks (GANs) are employed offline to extract underlying structural dynamics as a robust prior. An online Gaussian Process (GP) subsequently leverages this learned structure to refine the residual dynamics. Conditioned on this informed prior, the GP posterior inference achieves dramatically accelerated estimation convergence and enhanced overall modeling precision, ultimately enabling highly accurate real-time feedback linearization. Concurrently, the GP's predictive variance endows the downstream linear controller with critical uncertainty awareness. Theoretically, the framework is supported by strict convergence and approximation bounds. Practically, hardware experiments involving a multi-quadrotor cooperative payload task demonstrate its superior reliability in disturbance-sensitive scenarios.
Front View
Rear View
Front View
Rear View
Front View
Rear View
Front View
Rear View
Front View
Rear View
The offline phase utilizes GANs to extract structural priors f0, g0, initializing the linearization control law. Subsequently, the online Bayesian GP incorporates these priors as the baseline mean, iteratively refining the dynamics estimate against real-time data to ensure the system converges to the linear form. Ultimately, the closed-loop controller operates on these linearized dynamics to execute precise trajectory tracking, explicitly leveraging the predictive uncertainty quantified by the online Bayesian update.
The offline phase establishes structural priors f0, g0 via adversarial gradient dynamics. During deployment, the online Bayesian phase continuously fuses these priors with real-time data to update posterior estimates, which subsequently drive the robust feedback linearization and adaptive gain scheduling.
Hardware Architecture and UWB Positioning Setup