论文标题
用于秘密网络分析的合成网络生成器
A Synthetic Network Generator for Covert Network Analytics
论文作者
论文摘要
我们研究社交网络,并专注于秘密(也称为隐藏)网络,例如恐怖分子或犯罪网络。他们的结构,会员和活动是非法的。因此,有关秘密网络的数据通常不完整且部分不正确,从而使这些网络的结构和活动充满挑战。出于法律原因,研究人员无法访问有关主动秘密网络的真实数据。为了应对这些挑战,我们在这里介绍一个网络生成器,用于合成网络,该网络在统计上与真实的网络相似,但没有有关其成员的个人信息。发电机使用有关真实或想象中的秘密组织网络的统计数据。它生成了网络组的随机块模型的随机实例,但保留了该网络组织结构。这种匿名网络的直接使用是为他们培训研究和分析工具,以查找秘密网络的结构和动态。由于这些合成网络的边缘和社区集有所不同,因此可以用作网络分析的新来源。首先,它们提供了有关原始网络的数据的替代解释。这些替代解释的概率分布可以实现新的网络分析。分析师可以找到频繁的社区结构,因此在扰动下稳定。他们还可以分析稳定性如何随扰动的强度而变化。对于秘密网络,分析师可以量化统计上预期的拦截结果。这种分析适用于数据不完整或部分不正确的所有复杂网络。
We study social networks and focus on covert (also known as hidden) networks, such as terrorist or criminal networks. Their structures, memberships and activities are illegal. Thus, data about covert networks is often incomplete and partially incorrect, making interpreting structures and activities of such networks challenging. For legal reasons, real data about active covert networks is inaccessible to researchers. To address these challenges, we introduce here a network generator for synthetic networks that are statistically similar to a real network but void of personal information about its members. The generator uses statistical data about a real or imagined covert organization network. It generates randomized instances of the Stochastic Block model of the network groups but preserves this network organizational structure. The direct use of such anonymized networks is for training on them the research and analytical tools for finding structure and dynamics of covert networks. Since these synthetic networks differ in their sets of edges and communities, they can be used as a new source for network analytics. First, they provide alternative interpretations of the data about the original network. The distribution of probabilities for these alternative interpretations enables new network analytics. The analysts can find community structures which are frequent, therefore stable under perturbations. They may also analyze how the stability changes with the strength of perturbation. For covert networks, the analysts can quantify statistically expected outcomes of interdiction. This kind of analytics applies to all complex network in which the data are incomplete or partially incorrect.