首页>
外国专利>
Tracking missing data using provenance traces and data simulation
Tracking missing data using provenance traces and data simulation
展开▼
机译:使用出处跟踪和数据模拟来跟踪丢失的数据
展开▼
页面导航
摘要
著录项
相似文献
摘要
Methods, systems, and computer program products for tracking missing data using provenance traces and data simulation are provided herein. A computer-implemented method includes generating, for each of multiple stages in a data curation sequence, a machine learning model of the data curation sequence, wherein the model is based on historical input records within the data curation sequence, historical output records within the data curation sequence, and provenance data within the data curation sequence; creating a simulated output record based on a detected anomaly corresponding to the data curation sequence; predicting the content of absent input records that precede the simulated output record in the data curation sequence and provenance data corresponding to the simulated output record; and outputting, to a user, in response to a query pertaining to the detected anomaly, the predicted input records and information relating the predicted input records to the detected anomaly.
展开▼