论文标题
识别的一般理论
A general theory of identification
论文作者
论文摘要
说可以从数据中识别数量是什么意思?在参数统计模型的上下文中,统计学家似乎同意一个定义。这个定义提出了重要的问题:参数是唯一可以识别的数量吗?在参数统计外,识别概念是否有意义?它甚至需要统计模型的概念吗?在计量经济学,生物建模以及因果推断等统计的某些子领域,已经讨论了对这些问题的部分和特质答案。本文提出了一种统一的识别理论,该理论结合了参数和非参数模型的现有定义,并正式化了识别分析过程。通过一系列示例和两个扩展案例研究来说明该框架的适用性。
What does it mean to say that a quantity is identifiable from the data? Statisticians seem to agree on a definition in the context of parametric statistical models --- roughly, a parameter $θ$ in a model $\mathcal{P} = \{P_θ: θ\in Θ\}$ is identifiable if the mapping $θ\mapsto P_θ$ is injective. This definition raises important questions: Are parameters the only quantities that can be identified? Is the concept of identification meaningful outside of parametric statistics? Does it even require the notion of a statistical model? Partial and idiosyncratic answers to these questions have been discussed in econometrics, biological modeling, and in some subfields of statistics like causal inference. This paper proposes a unifying theory of identification that incorporates existing definitions for parametric and nonparametric models and formalizes the process of identification analysis. The applicability of this framework is illustrated through a series of examples and two extended case studies.