论文标题
端到端多标签网站指纹攻击:检测观点
End-to-End Multi-Tab Website Fingerprinting Attack: A Detection Perspective
论文作者
论文摘要
网站指纹攻击(WFA)的目的是使用户通过匿名网络渠道访问的网站(例如TOR)访问该网站。尽管过去几年取得了显着的进展,但大多数现有方法都会隐含一些人为的假设,即(1)每次仅访问一个网站(即单个TAB),并且(2)网站指纹数据被预处理到每个网站的单个跟踪中。实际上,用户通常会自发地为多个网站打开多个选项卡。的确,在最近的几部作品中已经研究了这种多标签WFA(MT-WFA)设置,但是所有这些设置仍然未能完全尊重现实世界的情况。特别是,网站指纹之间的重叠挑战从未深入研究。在这项工作中,我们将MT-WFA的问题重新定义为检测多个受监视的轨迹,并给定自然的未修剪流量数据,包括受监视的轨迹,未监视的痕迹以及它们之间潜在的不受限制的重叠。这消除了上述假设,超出了所有以前所有WFA方法的传统网站指纹分类的观点。为了解决这个现实的MT-WFA问题,我们制定了一个新颖的网站指纹检测(WFD)模型,能够准确检测所有受监视的痕迹的起点和终点,并共同对其进行分类,鉴于长时间,未绘制的原始流量数据。 WFD是端到端的,将跟踪本地化和网站分类集成到单个统一管道中。为了在MT-WFA环境中启用定量评估,我们介绍了新的性能指标。对几个新建造的基准测试的广泛实验表明,即使有很小的训练集,我们的WFD的精度和效率都超过了准确性和效率的最先进方法。代码可从https://github.com/wfdetector/wfdetection获得
Website fingerprinting attack (WFA) aims to deanonymize the website a user is visiting through anonymous networks channels (e.g., Tor). Despite of remarkable progress in the past years, most existing methods make implicitly a couple of artificial assumptions that (1) only a single website (i.e., single-tab) is visited each time, and (2) website fingerprinting data are pre-trimmed into a single trace per website manually. In reality, a user often open multiple tabs for multiple websites spontaneously. Indeed, this multi-tab WFA (MT-WFA) setting has been studied in a few recent works, but all of them still fail to fully respect the real-world situations. In particular, the overlapping challenge between website fingerprinting has never been investigated in depth. In this work, we redefine the problem of MT-WFA as detecting multiple monitored traces, given a natural untrimmed traffic data including monitored traces, unmonitored traces, and potentially unconstrained overlapping between them. This eliminates the above assumptions, going beyond the conventional single website fingerprint classification perspective taken by all previous WFA methods. To tackle this realistic MT-WFA problem, we formulate a novel Website Fingerprint Detection (WFD) model capable of detecting accurately the start and end points of all the monitored traces and classifying them jointly, given long, untrimmed raw traffic data. WFD is end-to-end, with the trace localization and website classification integrated in a single unified pipeline. To enable quantitative evaluation in our MT-WFA setting, we introduce new performance metrics. Extensive experiments on several newly constructed benchmarks show that our WFD outperforms the state-of-the-art alternative methods in both accuracy and efficiency by a large margin, even with a very small training set. Code is available at https://github.com/WFDetector/WFDetection