论文标题
夜间汽车索赔从远程信息处理衍生的功能中预测:多级方法
Nightly Automobile Claims Prediction from Telematics-Derived Features: A Multilevel Approach
论文作者
论文摘要
近年来,已经有可能从驾驶员那里收集GPS数据,并将这些数据纳入驾驶员的汽车保险定价中。该数据被连续收集和处理到元数据中,包括每次离散旅行的里程和时间汇总,以及描述旅行属性的一组行为分数(例如,驾驶员疲劳或驾驶员的分心),因此我们可以检查它是否可以通过在驾驶员事件之前成功地识别出驾驶员的行程,以确定驾驶员的行程,以确定风险增加的时期。确定驾驶员风险增加的时期是有价值的,因为它为干预和可能避免索赔创造了机会。我们检查了驱动程序带和训练分类器的每次旅行的元数据,以预测\ textit {以下旅行}是该驾驶员发生索赔的一种。通过在0.6高于0.6的接收器操作员特征下实现一个区域,我们表明可以提前预测索赔。此外,我们比较了预测能力,该预测能力是由受过训练的XGBoost分类器的接收器操作器特征下的区域衡量的,以预测驾驶员是否使用曝光功能(例如驱动器里程)和使用行为功能(例如计算速度得分)进行培训的驾驶功能。
In recent years it has become possible to collect GPS data from drivers and to incorporate this data into automobile insurance pricing for the driver. This data is continuously collected and processed nightly into metadata consisting of mileage and time summaries of each discrete trip taken, and a set of behavioral scores describing attributes of the trip (e.g, driver fatigue or driver distraction) so we examine whether it can be used to identify periods of increased risk by successfully classifying trips that occur immediately before a trip in which there was an incident leading to a claim for that driver. Identification of periods of increased risk for a driver is valuable because it creates an opportunity for intervention and, potentially, avoidance of a claim. We examine metadata for each trip a driver takes and train a classifier to predict whether \textit{the following trip} is one in which a claim occurs for that driver. By achieving a area under the receiver-operator characteristic above 0.6, we show that it is possible to predict claims in advance. Additionally, we compare the predictive power, as measured by the area under the receiver-operator characteristic of XGBoost classifiers trained to predict whether a driver will have a claim using exposure features such as driven miles, and those trained using behavioral features such as a computed speed score.