Results 1 - 10
of
12
A taxonomy of Data Grids for distributed data sharing, management, and processing
- ACM Computing Surveys
"... Data Grids have been adopted as the platform for scientific communities that need to share, access, transport, process and manage large data collections distributed worldwide. They combine high-end computing technologies with high-performance networking and wide-area storage management techniques. I ..."
Abstract
-
Cited by 61 (9 self)
- Add to MetaCart
(Show Context)
Data Grids have been adopted as the platform for scientific communities that need to share, access, transport, process and manage large data collections distributed worldwide. They combine high-end computing technologies with high-performance networking and wide-area storage management techniques. In this paper, we discuss the key concepts behind Data Grids and compare them with other data sharing and distribution paradigms such as content delivery networks, peer-to-peer networks and distributed databases. We then provide comprehensive taxonomies that cover various aspects of architecture, data transportation, data replication and resource allocation and scheduling. Finally, we map the proposed taxonomy to various Data Grid systems not only to validate the taxonomy but also to identify areas for future exploration. Through this taxonomy, we aim to categorise existing systems to better understand their goals and their methodology. This would help evaluate their applicability for solving similar problems. This taxonomy also provides a ”gap analysis ” of this area through which researchers can potentially identify new issues for investigation. Finally, we hope that the proposed taxonomy and mapping also helps to provide an easy way for new practitioners to understand this complex area of research. 1
Auto-scaling to minimize cost and meet application deadlines in cloud workflows"
- in Proceedings of International Conference for High Performance Computing, Networking, Storage and Analysis,
, 2011
"... ABSTRACT A goal in cloud computing is to allocate (and thus pay for) only those cloud resources that are truly needed. To date, cloud practitioners have pursued schedule-based (e.g., time-of-day) and rule-based mechanisms to attempt to automate this matching between computing requirements and compu ..."
Abstract
-
Cited by 51 (1 self)
- Add to MetaCart
(Show Context)
ABSTRACT A goal in cloud computing is to allocate (and thus pay for) only those cloud resources that are truly needed. To date, cloud practitioners have pursued schedule-based (e.g., time-of-day) and rule-based mechanisms to attempt to automate this matching between computing requirements and computing resources. However, most of these "auto-scaling" mechanisms only support simple resource utilization indicators and do not specifically consider both user performance requirements and budget concerns. In this paper, we present an approach whereby the basic computing elements are virtual machines (VMs) of various sizes/costs, jobs are specified as workflows, users specify performance requirements by assigning (soft) deadlines to jobs, and the goal is to ensure all jobs are finished within their deadlines at minimum financial cost. We accomplish our goal by dynamically allocating/deallocating VMs and scheduling tasks on the most cost-efficient instances. We evaluate our approach in four representative cloud workload patterns and show cost savings from 9.8% to 40.4% compared to other approaches.
Scientific workflow management: between generality and applicability
- International Workshop on Grid and Peer-to-Peer based Workflows in conjunction with the 5th International Conference on Quality Software
"... In a Problem Solving Environment (PSE), a Scientific Workflow Management System (SWMS) provides a meta environment for managing activities and data in scientific experiments, for prototyping experimental computing systems and for orchestrating the runtime system behaviour. The realisation of a SWMS ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
(Show Context)
In a Problem Solving Environment (PSE), a Scientific Workflow Management System (SWMS) provides a meta environment for managing activities and data in scientific experiments, for prototyping experimental computing systems and for orchestrating the runtime system behaviour. The realisation of a SWMS is often driven by domain specific applications and thus is at application level. Investigating the common characteristics in domain specific SWMSs and encapsulating them in a generic framework improve the reusability of the SWMS components and reduce the costs for introducing an e-Science framework in a new science domain. In this position paper, we present our research in an ongoing project: Virtual Laboratory for e-Science (VLe). In the VL-e project, we are building a generic e-Science framework which will support scientists from different domains to share their knowledge and to perform specific experiments. We summarise the lessons we have learned from a previous VL-e implementation, and discuss the plan for improving the quality of the SWMS support in the VL-e framework. 1
Title Security-driven heuristics and a fast genetic algorithm for trusted grid job scheduling Security-Driven Heuristics and A Fast Genetic Algorithm for Trusted Grid Job Scheduling
"... Abstract In this paper, our contributions are two-fold: First ..."
(Show Context)
grid workflow
"... On the use of meta-heuristics to increase the efficiency of online ..."
(Show Context)
Cost-based Scheduling of Scientific Workflow Applications on Utility Grids
"... Abstract—over the last few years, Grid technologies have progressed towards a service-oriented paradigm that enables a new way of service provisioning based on utility computing models. Users consume these services based on their QoS (Quality of Service) requirements. In such “pay-per-use ” Grids, w ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract—over the last few years, Grid technologies have progressed towards a service-oriented paradigm that enables a new way of service provisioning based on utility computing models. Users consume these services based on their QoS (Quality of Service) requirements. In such “pay-per-use ” Grids, workflow execution cost must be considered during scheduling based on users ’ QoS constraints. In this paper, we propose a cost-based workflow scheduling algorithm that minimizes execution cost while meeting the deadline for delivering results. It can also adapt to the delays of service executions by rescheduling unexecuted tasks. We also attempt to optimally solve the task scheduling problem in branches with several sequential tasks by modeling the branch as a Markov Decision Process and using the value iteration method. I.
People’s Republic of China SUMMARY
"... A probabilistic strategy for temporal constraint management in scientific workflow systems ‡ ..."
Abstract
- Add to MetaCart
A probabilistic strategy for temporal constraint management in scientific workflow systems ‡
Smart Distribution of Bio-Signal Processing Tasks in M-health
"... Abstract. The past few years have witnessed a rapid advancement of mobile healthcare systems. However, in the mobile computing environment, the resource fluctuations, stringent application requirements and user mobility have severely hindered the performance and reliability of the healthcare service ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract. The past few years have witnessed a rapid advancement of mobile healthcare systems. However, in the mobile computing environment, the resource fluctuations, stringent application requirements and user mobility have severely hindered the performance and reliability of the healthcare service delivery. The current approaches to solve this resource supply and service demand mismatch problem either limit the adaptation to an isolated node or require significant user's involvement. Given the distributed processing paradigm of mhealth system, we propose that a new adaptation approach could be dynamically redistributing processing tasks across distributed nodes. This PhD research addresses two main issues to validate this approach: (1) computation of a suitable assignment of tasks at compile-time or run-time; and (2) dynamic distribution of tasks across the nodes according to this new assignment at run-time. 1
Copyright © 2007 Inderscience Enterprises Ltd. Error recovery mechanism for grid-based workflow within SLA context
"... Service Level Agreements (SLAs), as stated by Sahai et al. (2003), are currently one of the major research topics in grid computing, as they serve as a foundation for reliable and predictable job execution at remote grid sites. SLAs are ..."
Abstract
- Add to MetaCart
Service Level Agreements (SLAs), as stated by Sahai et al. (2003), are currently one of the major research topics in grid computing, as they serve as a foundation for reliable and predictable job execution at remote grid sites. SLAs are
Toward a Tool for Scheduling Application Workflows onto Distributed Grid Systems
, 2006
"... In this dissertation, we present a design and implementation of a tool for auto-matic mapping and scheduling of large scientific application workflows onto dis-tributed, heterogeneous Grid environments. The thesis of this work is that plan-ahead, application-independent scheduling of workflow applic ..."
Abstract
- Add to MetaCart
In this dissertation, we present a design and implementation of a tool for auto-matic mapping and scheduling of large scientific application workflows onto dis-tributed, heterogeneous Grid environments. The thesis of this work is that plan-ahead, application-independent scheduling of workflow applications based on perfor-mance models can reduce the turnaround time for Grid execution of the application, reducing burden of Grid application development. We applied the scheduling strate-gies successfully to Grid applications from the domains of bio-imaging and astronomy and demonstrated the effectiveness and efficiency of the scheduling approaches. We also proposed and evaluated a novel scheduling heuristic based on a middle-out traver-sal of the application workflow. A study showed that jobs have to wait in batch queues for a considerable amount of time before they begin execution. Schedulers must consider batch queue waiting times when scheduling Grid applications onto resources with batch queue front ends. Hence, we developed a smart scheduler that considers estimates of batch queue wait