论文标题

“当时的道路”:25年以上Web Evolution的数据驱动视图

"Way back then": A Data-driven View of 25+ years of Web Evolution

论文作者

Agarwal, Vibhor, Sastry, Nishanth

论文摘要

自三十年前第一个网页的成立以来,从静态HTML页面到当今的动态网页,网络已经发生了很大的发展,从1990年代的主要基于文本的页面到当今的多媒体丰富的页面等等。本文试图通过从Internet Archive或Archive.org的Internet Archive或“ Wayback Machine”中查看前100个Alexa网站,以解决文献中的这一差距。我们研究受欢迎程度的变化,从地球和雅虎!在1990年代中期,如今的Google,Facebook和Tiktok之类的人。我们还研究了多年来的不同类别的网站及其知名度,并找到了与新闻和教育相关的网站流行的证据,这些证据已被流媒体媒体和社交网站所取代。我们探讨了不同哑剧类型的出现和相对患病率(文本与图像与视频与JavaScript和JSON),并研究了互联网上文​​本的使用是否正在下降。

Since the inception of the first web page three decades back, the Web has evolved considerably, from static HTML pages in the beginning to the dynamic web pages of today, from mainly the text-based pages of the 1990s to today's multimedia rich pages, etc. Although much of this is known anecdotally, to our knowledge, there is no quantitative documentation of the extent and timing of these changes. This paper attempts to address this gap in the literature by looking at the top 100 Alexa websites for over 25 years from the Internet Archive or the "Wayback Machine", archive.org. We study the changes in popularity, from Geocities and Yahoo! in the mid-to-late 1990s to the likes of Google, Facebook, and Tiktok of today. We also look at different categories of websites and their popularity over the years and find evidence for the decline in popularity of news and education-related websites, which have been replaced by streaming media and social networking sites. We explore the emergence and relative prevalence of different MIME-types (text vs. image vs. video vs. javascript and json) and study whether the use of text on the Internet is declining.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源