论文标题
连续时间系统的碰撞概率,而无需取样[附录]
Collision Probabilities for Continuous-Time Systems Without Sampling [with Appendices]
论文作者
论文摘要
近年来,对高性能,健壮和安全自治系统的需求已大大增长。这些目标激发了人们对有效的安全理论推理的渴望,这些理论推理可以嵌入到核心决策任务中,例如运动计划,尤其是在受约束的环境中。一方面,蒙特卡洛(MC)和其他基于抽样的技术为多种运动模型提供了准确的碰撞概率估计值,但在连续优化的背景下很麻烦。另一方面,“直接”近似旨在计算(或上限)故障概率作为决策变量的平滑函数,因此可以方便地进行优化。但是,现有的直接方法从根本上采用离散的时间动力学,并且在现实世界中无处不在的连续时间系统应用时可以不可预测,通常表现为严重的保守主义。在常规离散时间框架内解决此问题的最新尝试需要额外的高斯近似,最终会产生自己的不一致性。在本文中,我们采用了一种根本不同的方法,直接在连续的时间内直接得出风险近似框架,并产生轻巧的估计值,该估算实际上随着基本离散化的形式而实际收敛。我们的近似值显示在复制MC估计的同时,在维持直接方法的功能和计算益处时,可以显着胜过最新技术。这可以为一类广泛的非线性和/或部分观察的系统提供强大的,风险感知的连续运动规划。
Demand for high-performance, robust, and safe autonomous systems has grown substantially in recent years. These objectives motivate the desire for efficient safety-theoretic reasoning that can be embedded in core decision-making tasks such as motion planning, particularly in constrained environments. On one hand, Monte-Carlo (MC) and other sampling-based techniques provide accurate collision probability estimates for a wide variety of motion models but are cumbersome in the context of continuous optimization. On the other, "direct" approximations aim to compute (or upper-bound) the failure probability as a smooth function of the decision variables, and thus are convenient for optimization. However, existing direct approaches fundamentally assume discrete-time dynamics and can perform unpredictably when applied to continuous-time systems ubiquitous in the real world, often manifesting as severe conservatism. State-of-the-art attempts to address this within a conventional discrete-time framework require additional Gaussianity approximations that ultimately produce inconsistency of their own. In this paper we take a fundamentally different approach, deriving a risk approximation framework directly in continuous time and producing a lightweight estimate that actually converges as the underlying discretization is refined. Our approximation is shown to significantly outperform state-of-the-art techniques in replicating the MC estimate while maintaining the functional and computational benefits of a direct method. This enables robust, risk-aware, continuous motion-planning for a broad class of nonlinear and/or partially-observable systems.
