Parallelization and scheduling of data intensive particle physics analysis jobs on clusters of PCs

机译：数据密集型粒子物理分析工作中的PCS集群的并行化和调度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Summary form only given. Scheduling policies are proposed for parallelizing data intensive particle physics analysis applications on computer clusters. Particle physics analysis jobs require the analysis of tens of thousands of particle collision events, each event requiring typically 200ms processing time and 600KB of data. Many jobs are launched concurrently by a large number of physicists. At a first view, particle physics jobs seem to be easy to parallelize, since particle collision events can be processed independently one from another. However, since large amounts of data need to be accessed, the real challenge resides in making an efficient use of the underlying computing resources. We propose several job parallelization and scheduling policies aiming at reducing job processing times and at increasing the sustainable load of a cluster server. Since particle collision events are usually reused by several jobs, cache based job splitting strategies considerably increase cluster utilization and reduce job processing times. Compared with straightforward job scheduling on a processing form, cache based first in first out job splitting speeds up average response times by an order of magnitude and reduces job waiting times in the system's queues from hours to minutes. By scheduling the jobs out of order, according to the availability of their collision events in the node disk caches, response times are further reduced, especially at high loads. In the delayed scheduling policy, job requests are accumulated during a time period, divided into subjob requests according to a parameterizable subjob size, and scheduled at the beginning of the next time period according to the availability of their data segments within the disk node caches. Delayed scheduling sustains a load close to the maximal theoretically sustainable load of a cluster, but at the cost of longer average response times. Finally we propose an adaptive delay scheduling approach, where the scheduling delay is adapted to the current load. This last scheduling approach sustains very high loads and offers low response times at normal loads.

机译：摘要表格仅给出。调度策略提出了并行数据上的计算机集群密集的粒子物理分析中的应用。粒子物理分析工作需要成千上万的粒子碰撞事件的分析，每个事件通常需要200ms的处理时间和数据的600KB。许多工作是由大量的物理学家同时启动。在第一视图中，粒子物理工作似乎容易并行化，由于颗粒碰撞事件可以独立地处理一个从另一个。然而，由于要访问大量数据的需求，真正的挑战所在在做一个有效利用底层计算资源。我们建议旨在减少加工时间并提高集群服务器的负载持续几个任务并行化和调度策略。由于粒子碰撞事件通常是由几个工作重用，基于缓存的工作拆分策略大大增加群集利用率，减少加工时间。与处理表单上简单的作业调度相比，高速缓存第一个基于入先出作业分割由一个数量级加快平均响应时间与从几小时缩短到几分钟减少了系统中的工作队列等待时间。通过调度作业乱序，根据在节点磁盘高速缓存其碰撞事件的可用性，响应时间进一步减少，尤其是在高负荷。在延迟调度策略，工作请求是在一段时间内积累，根据设定参数的子作业大小分为子作业的请求，并且根据盘节点的高速缓存内的数据片段的可用性预定在下一时间段的开始。延迟调度维持负载接近最大群集的理论上可持续负载，但是在更长的平均响应时间的成本。最后，我们提出了一种自适应延迟调度方法，其中，所述调度延迟适合于当前负载。这最后的调度方式，在正常负荷维持非常高的负载，并提供低响应时间。

著录项

来源
《International Parallel and Distributed Processing Symposium》|2004年||共1页
会议地点
作者
Ponce S.; Hersch R.D.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.133-53;
关键词
workstation clusters; physics computing; cache storage; job shop scheduling; adaptive scheduling; resource allocation; processor scheduling; physics; queueing theory; data analysis; data intensive particle physics analysis job; PC clusters; particle collision event; job parallelization policy; job scheduling policy; cache based first in first out job splitting; system queue; disk node cache; adaptive delay scheduling approach;

机译：工作站集群;物理计算;高速缓存存储;作业商店调度;自适应调度;资源分配;处理器调度;物理;排队理论;数据分析;数据密集型粒子物理分析工作;PC群;工作并行事件;工作并行化政策策略;基于第一个Out作业分裂的缓存;系统队列;磁盘节点缓存;自适应延迟调度方法;

相似文献

外文文献
中文文献
专利

1. Dependency-Aware Network Adaptive Scheduling of Data-Intensive Parallel Jobs [J] . Wang Shaoqi, Chen Wei, Zhou Xiaobo, IEEE Transactions on Parallel and Distributed Systems . 2019,第3期

机译：数据密集型并行作业的依赖关系网络自适应调度
2. Mary, Hugo, and Hugo: Learning to schedule distributed data-parallel processing jobs on shared clusters [J] . Thamsen Lauritz, Beilharz Jossekin, Vinh Thuy Tran, Concurrency and computation: practice and experience . 2021,第18期

机译：Mary，Hugo和Hugo：学习在共享群集中安排分布式数据并行处理作业
3. A scheduling system for exploiting data and task parallelism on PC laboratory clusters [J] . Ying-Nan Chen, Li-Ming Tseng, Yi-Ming Chen Campus-Wide Information Systems . 2003,第1期

机译：用于在PC实验室集群上利用数据和任务并行性的调度系统
4. Parallelization and scheduling of data intensive particle physics analysis jobs on clusters of PCs [C] . Ponce, S., Hersch, . 2004

机译：PC集群上数据密集型粒子物理分析作业的并行化和调度
5. Optimizing parallel job performance in data-intensive clusters. [D] . Ananthanarayanan, Ganesh. 2014

机译：在数据密集型集群中优化并行作业性能。
6. Scheduling Jobs with Variable Job Processing Times on Unrelated Parallel Machines [O] . Guang-Qian Zhang, Jian-Jun Wang, Ya-Jing Liu -1

机译：在不相关的并行机上调度具有可变作业处理时间的作业
7. A Task Scheduling Method after Clustering for Data Intensive Jobs in Heterogeneous Distributed Systems [O] . 2016

机译：异构分布式系统中数据密集型作业聚类后的任务调度方法
8. Framework for Graph-Based Synthesis, Analysis, and Visualization of HPC Cluster Job Data [R] . Brandt, J., De Sapio, V., Gentile, A., 2010

机译：HpC群集作业数据的基于图形的综合，分析和可视化框架

Parallelization and scheduling of data intensive particle physics analysis jobs on clusters of PCs

摘要

著录项

相似文献

相关主题

期刊订阅