论文标题

Nela-Local:美国本地新闻文章的数据集,用于研究县级新闻生态系统

NELA-Local: A Dataset of U.S. Local News Articles for the Study of County-level News Ecosystems

论文作者

Horne, Benjamin D., Gruppi, Maurício, Joseph, Kenneth, Green, Jon, Wihbey, John P., Adalı, Sibel

论文摘要

在本文中,我们介绍了一个超过140万个在线新闻文章的数据集,该文章来自313个本地新闻媒体,在20个月内(2020年4月4日至2021年12月31日之间)发表了一个数据集。这些媒体涵盖了美国各地的各种社区。为了估算本文文章的当地观众的特征,数据包括县级元数据,包括人口统计,2020年总统选举投票股和美国人口普查局的社区韧性估计。 Nela-Local数据集可在以下网址找到:https://dataverse.harvard.edu/dataset.xhtml?

In this paper, we present a dataset of over 1.4M online news articles from 313 local U.S. news outlets published over 20 months (between April 4th, 2020 and December 31st, 2021). These outlets cover a geographically diverse set of communities across the United States. In order to estimate characteristics of the local audience, included with this news article data is a wide range of county-level metadata, including demographics, 2020 Presidential Election vote shares, and community resilience estimates from the U.S. Census Bureau. The NELA-Local dataset can be found at: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/GFE66K.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源