论文标题

EXFOR-NSR PDF数据库:核知识保存和数据策划的系统

EXFOR-NSR PDF database: a system for nuclear knowledge preservation and data curation

论文作者

Zerkin, V. V., Pritychenko, B., Totans, J., Vrapcenjak, L., Rodionov, A., Shulyak, G. I.

论文摘要

核科学和技术的当前需求包括完整,有据可查的核数据。完整的数据记录需要支持核参考书目,此外仍存储在专用库中的核参考书目,此外还需要用于实际数据。实验核反应数据(EXFOR)和核科学参考(NSR)数据库包含基于主要(期刊)和次要(会议论文集,论文,预印本等)出版物的汇编,以及通过私人通信从作者那里收到的数据。二级图书馆材料和私人通信通常代表用于核数据验证,汇编,评估和传播活动的瓶颈。为了解决此问题,将书目材料扫描到PDF(便携式文档格式)文件中,并在关系数据库中上传。传统的核数据库范围包括元数据和以专用格式得出的数据得出的数字,以适应大量原始核数据出版物。完整的PDF出版物文件存储在关系数据库中,作为二进制大型对象(BLOB)。这种独特的核数据汇编和支持出版物的收集为机器学习应用带来了许多机会。

Current needs of nuclear science and technology include complete, well-documented, and easily verifiable nuclear data. The complete data records require supporting nuclear bibliography, presently stored in dedicated libraries, in addition, to actual data. Experimental nuclear reaction data (EXFOR) and Nuclear Science References (NSR) databases contain compilations based on primary (journals) and secondary (conference proceedings, theses, preprints, etc.) publications, and data received from authors via private communications. The secondary library materials and private communications often represent a bottleneck for nuclear data verification, compilation, evaluation, and dissemination activities. To address this issue, bibliographic materials were scanned into PDF (Portable Document Format) files and uploaded in a relational database. The traditional scope of nuclear databases that includes meta-data and numbers derived from data in specialized formats was broadened to accommodate the large volumes of original nuclear data publications. The complete PDF publication files were stored in a relational database as Binary Large OBjects (BLOB). This unique collection of nuclear data compilations and supporting publications generate many opportunities for machine learning applications.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源