Uncertainty-Aware Dynamics Learning

Abstract

Achieving robust generalization with rigorous stability guarantees remains the central challenge for the reliable application of learning-based control in complex physical systems.

To bridge the gap between offline training (Sim) and responsive online adaptation (Real) in unconstrained practical environments, this work introduces a framework that synergizes generative adversarial networks (GANs)-based structural priors with online Bayesian Gaussian process (GP) refinement, enabling rapid dynamic linearization and adaptive modeling.

Specifically, these offline priors provide a warm-start for the online adaptation, significantly enhancing its convergence speed and precision, while the GP's predictive variance provides the subsequent closed-loop controller with uncertainty awareness. Definitive convergence and approximation bounds are rigorously established for both components. Finally, physical multi-quadrotor cooperative payload experiments confirm the framework's superior reliability in disturbance-sensitive scenarios.

Video

Offline Data Collection

Front View

Rear View

Baseline 1 (No Intentional Disturbance)

Front View

Rear View

Baseline 2 (Offline + LMPC)

Front View

Rear View

Baseline 3 (Online + RTMPC)

Front View

Rear View

Proposed (Offline + Online + RTMPC)

Front View

Rear View

Methodology

The offline phase utilizes GANs to extract structural priors f0, g0, initializing the linearization control law. Subsequently, the online Bayesian GP incorporates these priors as the baseline mean, iteratively refining the dynamics estimate against real-time data to ensure the system converges to the linear form. Ultimately, the closed-loop controller operates on these linearized dynamics to execute precise trajectory tracking, explicitly leveraging the predictive uncertainty quantified by the online Bayesian update.

The offline phase establishes structural priors f0, g0 via adversarial gradient dynamics. During deployment, the online Bayesian phase continuously fuses these priors with real-time data to update posterior estimates, which subsequently drive the robust feedback linearization and adaptive gain scheduling.

Hardware Architecture and UWB Positioning Setup

UWB Positioning System Setup

Ground Station Architecture: The experimental area is surrounded by six fixed UWB anchors to establish a global coordinate reference.
Onboard Receiver: Each UAV is equipped with a bottom-mounted UWB tag to acquire real-time position signals and transmit them to the onboard processor.
Coordinate Alignment: An external compass ensures precise alignment between the UWB reference frame and the flight controller's internal coordinate system.
Altitude Measurement: Due to the inherent inaccuracy of UWB in the vertical (Z-axis) direction, a Time-of-Flight (ToF) sensor is utilized for precise altitude estimation.

UAV Platform & Physical Specifications

Airframe Architecture: Built on a classic QAV250 quadrotor frame, equipped with high-performance TMOTOR VELOX V3 (KV1950) motors.
Weight Distribution: The single-UAV Maximum Takeoff Weight (MTOW) is 0.9 kg, carrying an experimental payload of 224 g.
Dynamic Characteristics: The combined lateral aerodynamic drag generated by industrial fans and the transient tension spikes from payload swinging account for approximately 20%–25% of the vehicle's nominal hovering thrust.

Onboard Computing Unit

Core Processor: A Raspberry Pi 4B (8GB) serves as the onboard computer, executing the high-level online learning algorithms and real-time closed-loop control for outer-loop position and velocity.
Flight Controller: A Pixhawk 6 mini autopilot running PX4 firmware handles high-frequency attitude estimation, inner-loop attitude control, and motor driving.
Data Flow Integration: The onboard computer achieves centimeter-level positioning resolution by fusing UWB positional data with IMU inertial measurements via an EKF2 filter.

Uncertainty-Aware Dynamics Learning via Offline Generative Prior Extraction and Online Bayesian Posterior Refinement

Abstract

Video

Offline Data Collection

Baseline 1 (No Intentional Disturbance)

Baseline 2 (Offline + LMPC)

Baseline 3 (Online + RTMPC)

Proposed (Offline + Online + RTMPC)

Methodology

UWB Positioning System Setup

UAV Platform & Physical Specifications

Onboard Computing Unit