Relative Quantitative Predictions of Unseen Events with Limited Data Availability using Machine Learning

Year
2022
Author(s)
Nidhi Parikh - Los Alamos National Laboratory
Garrison Flynn - Los Alamos National Laboratory
Anand Iyer - Los Alamos National Laboratory
Dan Archer - Oak Ridge National Laboratory
Thomas Karnowski - ORNL
Andrew Nicholson - NNSA, Defense Nuclear Nonproliferation R&D (NA-22)
Monica Maceira - Oak Ridge National Laboratory
Omar Marcillo - Oak Ridge National Laboratory
Will Ray - Oak Ridge National Laboratory
Grady Wetherington - Oak Ridge National Laboratory
Michael Willis - Oak Ridge National Laboratory
Abstract

One of the activities of interest for nuclear non-proliferation is monitoring nuclear facilities. This is a challenging task because in realistic scenarios, the events of interest occur under variable environmental or operating conditions and a limited set of labeled data, covering only a subset of events, is available. Predictions about new or future events needs to be made based on the subset of data available. Traditional machine learning algorithms such as supervised (e.g., classification) and unsupervised learning work entirely with a labeled and unlabeled dataset, respectively, and couldn’t be used to make predictions about unseen events directly. In this work, we use concepts from transductive learning (a form of semi-supervised learning), clustering, and other distance-based methods to make relative quantitative predictions about events that may not be covered by available labeled data. We apply our approach to prediction of reactor power level where data about intermittent power level may not be covered by the labeled dataset. We term this problem as an intermittent data evaluation and evaluate our approach using sensors positioned near a collocated research nuclear reactor and reprocessing facility at Oak Ridge National Laboratory for the Multi-Informatics for Nuclear Operations Scenarios (MINOS) venture.