大规模结构性潜在属性分析的联合MLE方法

论文标题

大规模结构性潜在属性分析的联合MLE方法

A Joint MLE Approach to Large-Scale Structured Latent Attribute Analysis

论文作者

Gu, Yuqi, Xu, Gongjun

论文摘要

结构化潜在属性模型（SLAM）是一个离散的潜在变量模型的家族，广泛用于教育，心理学和流行病学，用于对多元分类数据进行建模。 SLAM假设多个离散的潜在属性以高度结构化的方式解释了观察到的变量的依赖性。通常，采用最大边缘可能性估计方法，将潜在属性视为随机效应。现代评估数据的增加范围涉及大量观察到的变量和高维的潜在属性。这给经典估计方法带来了挑战，需要新的方法论和对潜在变量建模的理解。在此激励的情况下，我们考虑了对猛击的关节最大似然估计（MLE）方法，将潜在属性视为固定的未知参数。我们研究了样本量，变量数量和潜在属性数量的估计性，一致性和计算，所有这些都可以分歧。我们建立了关节MLE的统计一致性，并提出了有效的算法，可很好地扩展到几个流行的大满贯的大规模数据。仿真研究证明了所提出的方法的出色经验性能。来自国际教育评估的真实数据的应用提供了可解释的认知诊断发现。

Structured Latent Attribute Models (SLAMs) are a family of discrete latent variable models widely used in education, psychology, and epidemiology to model multivariate categorical data. A SLAM assumes that multiple discrete latent attributes explain the dependence of observed variables in a highly structured fashion. Usually, the maximum marginal likelihood estimation approach is adopted for SLAMs, treating the latent attributes as random effects. The increasing scope of modern assessment data involves large numbers of observed variables and high-dimensional latent attributes. This poses challenges to classical estimation methods and requires new methodology and understanding of latent variable modeling. Motivated by this, we consider the joint maximum likelihood estimation (MLE) approach to SLAMs, treating latent attributes as fixed unknown parameters. We investigate estimability, consistency, and computation in the regime where sample size, number of variables, and number of latent attributes all can diverge. We establish the statistical consistency of the joint MLE and propose efficient algorithms that scale well to large-scale data for several popular SLAMs. Simulation studies demonstrate the superior empirical performance of the proposed methods. An application to real data from an international educational assessment gives interpretable findings of cognitive diagnosis.

下载PDF全文

下载文献需遵守相关版权规定

论文标题