论文标题
一项有关法律判断预测的调查:数据集,指标,模型和挑战
A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and Challenges
论文作者
论文摘要
法律判断预测(LJP)采用自然语言处理(NLP)技术来自动根据事实描述来预测判断结果。最近,大规模的公共数据集和NLP研究的进步导致对LJP的兴趣增加。尽管机器和人类性能之间存在明显的差距,但在各种基准数据集中取得了令人印象深刻的结果。在本文中,为了解决当前对现有LJP任务,数据集,模型和评估的全面调查,(1)我们以6种语言分析了31个LJP数据集,请提供其施工过程并定义具有3种不同属性的LJP的分类方法; (2)我们总结了LJP任务不同输出的四个类别下的14个评估指标; (3)我们回顾了12种法律域预识别模型,并突出显示了LJP的3个主要研究指示; (4)我们显示了来自不同法院案件的8个代表性数据集的最新结果,并讨论了公开挑战。本文可以提供最新的全面评论,以帮助读者了解LJP的状态。我们希望促进NLP研究人员和法律专业人员在此问题上的进一步共同努力。
Legal judgment prediction (LJP) applies Natural Language Processing (NLP) techniques to predict judgment results based on fact descriptions automatically. Recently, large-scale public datasets and advances in NLP research have led to increasing interest in LJP. Despite a clear gap between machine and human performance, impressive results have been achieved in various benchmark datasets. In this paper, to address the current lack of comprehensive survey of existing LJP tasks, datasets, models and evaluations, (1) we analyze 31 LJP datasets in 6 languages, present their construction process and define a classification method of LJP with 3 different attributes; (2) we summarize 14 evaluation metrics under four categories for different outputs of LJP tasks; (3) we review 12 legal-domain pretrained models in 3 languages and highlight 3 major research directions for LJP; (4) we show the state-of-art results for 8 representative datasets from different court cases and discuss the open challenges. This paper can provide up-to-date and comprehensive reviews to help readers understand the status of LJP. We hope to facilitate both NLP researchers and legal professionals for further joint efforts in this problem.