多个文本是在线学习中的限制因素：跨语言的知识网络的相似性

论文标题

多个文本是在线学习中的限制因素：跨语言的知识网络的相似性

Multiple Texts as a Limiting Factor in Online Learning: Quantifying (Dis-)similarities of Knowledge Networks across Languages

论文作者

Mehler, Alexander, Hemati, Wahed, Welke, Pascal, Konca, Maxim, Uslu, Tolga

论文摘要

我们检验了以下假设：通过Wikipedia获得有关给定主题的信息的程度取决于咨询的语言。控制尺寸因子，我们研究了25个主题领域的这一假设。由于Wikipedia是基于Web的信息格局的核心部分，因此这表明与语言有关的语言偏见。因此，本文讨论了维基百科是否表现出这种语言相对论。从教育科学的角度来看，本文开发了信息格局的计算模型，从中绘制了多个文本作为基于Web的阅读的典型输入。为此，它开发了信息景观不同部分的内部和互文性相似性的混合模型，并以35种语言和相应的Wikipedias的示例来测试该模型。通过这种方式，本文在阅读研究，教育科学，维基百科研究和计算语言学之间建立了桥梁。

We test the hypothesis that the extent to which one obtains information on a given topic through Wikipedia depends on the language in which it is consulted. Controlling the size factor, we investigate this hypothesis for a number of 25 subject areas. Since Wikipedia is a central part of the web-based information landscape, this indicates a language-related, linguistic bias. The article therefore deals with the question of whether Wikipedia exhibits this kind of linguistic relativity or not. From the perspective of educational science, the article develops a computational model of the information landscape from which multiple texts are drawn as typical input of web-based reading. For this purpose, it develops a hybrid model of intra- and intertextual similarity of different parts of the information landscape and tests this model on the example of 35 languages and corresponding Wikipedias. In this way the article builds a bridge between reading research, educational science, Wikipedia research and computational linguistics.

下载PDF全文

下载文献需遵守相关版权规定

论文标题