首页> 外国专利> METHOD AND SYSTEM FOR GENERATING SYNTHETIC DATA USING A REGRESSION MODEL WHILE PRESERVING STATISTICAL PROPERTIES OF UNDERLYING DATA

METHOD AND SYSTEM FOR GENERATING SYNTHETIC DATA USING A REGRESSION MODEL WHILE PRESERVING STATISTICAL PROPERTIES OF UNDERLYING DATA

机译:用于使用回归模型生成合成数据的方法和系统,同时保留底层数据的统计特性

摘要

A method for generating a synthetic dataset involves generating discretized synthetic data based on driving a model of a cumulative distribution function (CDF) with random numbers. The CDF is based on a source dataset. The method further includes generating the synthetic dataset from the discretized synthetic data by selecting, for inclusion into the synthetic dataset, values from a multitude of entries of the source dataset, based on the discretized synthetic data, and providing the synthetic dataset to a downstream application that is configured to operate on the source dataset.
机译:生成合成数据集的方法涉及基于驱动具有随机数的累积分布函数(CDF)的模型来生成离散化的合成数据。 CDF基于源数据集。该方法还包括通过选择以将合成数据集选择到源数据集的多个条目的值,基于离散化的合成数据来生成来自离散的合成数据的合成数据集,并将合成数据集提供给下游应用程序这被配置为在源数据集上运行。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号