首页> 外国专利> SCALABLE IMPLEMENTATIONS OF EXACT DISTINCT COUNTS AND MULTIPLE EXACT DISTINCT COUNTS IN DISTRIBUTED QUERY PROCESSING SYSTEMS

SCALABLE IMPLEMENTATIONS OF EXACT DISTINCT COUNTS AND MULTIPLE EXACT DISTINCT COUNTS IN DISTRIBUTED QUERY PROCESSING SYSTEMS

机译:分布式查询处理系统中精确离散计数和多个精确离散计数的可实现

摘要

Scalable implementations of exact distinct counts and multiple exact distinct counts in distributed query processing systems are implemented via systems and devices. Distinct counts and multiple exact distinct counts for identifiers/values are performed based on keys. For distinct counts, datasets including data fields are sorted by values of fields and divided into balanced partitions in distributed servers. Subsets of fields with the same value are partitioned together. Key presence is determined for subsets on each partition, and the number of instances for the key are aggregated for exact distinct counts of values. For multiple distinct counts, fields of a dataset are combined by un-pivoting field columns. Compound keys are generated for combined fields from field identifiers of the combined fields and values of another field. Totals of unique values of the combined fields are determined for values in the counted field based on the compound keys.
机译:通过系统和设备来实现分布式查询处理系统中精确非重复计数和多个精确非重复计数的可扩展实现。基于键执行标识符/值的唯一计数和多个确切的唯一计数。对于不同的计数,包括数据字段的数据集按字段值排序,并在分布式服务器中划分为平衡分区。具有相同值的字段子集将被分区在一起。确定每个分区上的子集的键存在,并针对确切的不同计数汇总键的实例数量。对于多个不同的计数,数据集的字段通过不透视的字段列进行组合。从组合字段的字段标识符和另一个字段的值为组合字段生成复合键。基于复合键,为计数字段中的值确定组合字段的唯一值的总计。

著录项

  • 公开/公告号WO2020112421A1

    专利类型

  • 公开/公告日2020-06-04

    原文格式PDF

  • 申请/专利权人 MICROSOFT TECHNOLOGY LICENSING LLC;

    申请/专利号WO2019US62085

  • 发明设计人 VISWANADHA SREENIVASA;

    申请日2019-11-19

  • 分类号G06F16/2455;G06F16/27;G06F16/2453;

  • 国家 WO

  • 入库时间 2022-08-21 11:10:55

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号