首页> 外国专利> SCALABLE IMPLEMENTATIONS OF EXACT DISTINCT COUNTS AND MULTIPLE EXACT DISTINCT COUNTS IN DISTRIBUTED QUERY PROCESSING SYSTEMS

SCALABLE IMPLEMENTATIONS OF EXACT DISTINCT COUNTS AND MULTIPLE EXACT DISTINCT COUNTS IN DISTRIBUTED QUERY PROCESSING SYSTEMS

机译:分布式查询处理系统中的精确不同计数和多种精确不同计数的可扩展实现

摘要

Scalable implementations of exact distinct counts and multiple exact distinct counts in distributed query processing systems are implemented via systems and devices. Distinct counts and multiple exact distinct counts for identifiers/values are performed based on keys. For distinct counts, datasets including data fields are sorted by values of fields and divided into balanced partitions in distributed servers. Subsets of fields with the same value are partitioned together. Key presence is determined for subsets on each partition, and the number of instances for the key are aggregated for exact distinct counts of values. For multiple distinct counts, fields of a dataset are combined by un-pivoting field columns. Compound keys are generated for combined fields from field identifiers of the combined fields and values of another field. Totals of unique values of the combined fields are determined for values in the counted field based on the compound keys.
机译:通过系统和设备实现了分布式查询处理系统中精确不同的计数和多种精确不同计数的可扩展实现。 基于键执行标识符/值的不同计数和多种精确的不同计数。 对于不同的计数,包括数据字段的数据集按字段的值排序,并分为分布式服务器中的平衡分区。 具有相同值的字段的子集在一起分区。 对于每个分区上的子集确定键的存在,并且键的实例数量被聚合以用于精确不同的值计数。 对于多个不同的计数,数据集的字段由未旋转的字段列组合。 从组合字段的字段标识符和另一个字段的值的字段标识符生成复合键。 基于复合键的计数字段中的值确定组合字段的唯一值的总数。

著录项

  • 公开/公告号EP3867773A1

    专利类型

  • 公开/公告日2021-08-25

    原文格式PDF

  • 申请/专利权人 MICROSOFT TECHNOLOGY LICENSING LLC;

    申请/专利号EP20190820974

  • 发明设计人 VISWANADHA SREENIVASA;

    申请日2019-11-24

  • 分类号G06F16/2455;

  • 国家 EP

  • 入库时间 2022-08-24 20:46:40

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号