首页>
外国专利>
SCALABLE IMPLEMENTATIONS OF EXACT DISTINCT COUNTS AND MULTIPLE EXACT DISTINCT COUNTS IN DISTRIBUTED QUERY PROCESSING SYSTEMS
SCALABLE IMPLEMENTATIONS OF EXACT DISTINCT COUNTS AND MULTIPLE EXACT DISTINCT COUNTS IN DISTRIBUTED QUERY PROCESSING SYSTEMS
展开▼
机译:分布式查询处理系统中的精确明显计数和多种精确不同计数的可扩展实现
展开▼
页面导航
摘要
著录项
相似文献
摘要
Scalable implementations of exact distinct counts and multiple exact distinct counts in distributed query processing systems are implemented via systems and devices. Distinct counts and multiple exact distinct counts for identifiers/values are performed based on keys. For distinct counts, datasets including data fields are sorted by values of fields and divided into balanced partitions in distributed servers. Subsets of fields with the same value are partitioned together. Key presence is determined for subsets on each partition, and the number of instances for the key are aggregated for exact distinct counts of values. For multiple distinct counts, fields of a dataset are combined by un-pivoting field columns. Compound keys are generated for combined fields from field identifiers of the combined fields and values of another field. Totals of unique values of the combined fields are determined for values in the counted field based on the compound keys.
展开▼