论文标题
Nela-Local:美国本地新闻文章的数据集,用于研究县级新闻生态系统
NELA-Local: A Dataset of U.S. Local News Articles for the Study of County-level News Ecosystems
论文作者
论文摘要
在本文中,我们介绍了一个超过140万个在线新闻文章的数据集,该文章来自313个本地新闻媒体,在20个月内(2020年4月4日至2021年12月31日之间)发表了一个数据集。这些媒体涵盖了美国各地的各种社区。为了估算本文文章的当地观众的特征,数据包括县级元数据,包括人口统计,2020年总统选举投票股和美国人口普查局的社区韧性估计。 Nela-Local数据集可在以下网址找到:https://dataverse.harvard.edu/dataset.xhtml?
In this paper, we present a dataset of over 1.4M online news articles from 313 local U.S. news outlets published over 20 months (between April 4th, 2020 and December 31st, 2021). These outlets cover a geographically diverse set of communities across the United States. In order to estimate characteristics of the local audience, included with this news article data is a wide range of county-level metadata, including demographics, 2020 Presidential Election vote shares, and community resilience estimates from the U.S. Census Bureau. The NELA-Local dataset can be found at: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/GFE66K.