Data Intensive Workflows (a.k.a. scientific workflows) are routinely used in most scientific disciplines today, especially in the context of parallel and distributed computing. Workflows provide a systematic way of describing the analysis and rely on workflow management systems to execute the complex analyses on a variety of distributed resources. They are at the interface of end-users and computing infrastructures. With the drastic increase of raw data volume in every domain, they play an even more critical role to assist scientists in organizing and processing their data and to leverage HPC or HTC resources.
This workshop focuses on the many facets of data-intensive workflow management systems, ranging from job execution to service management and the coordination of data, service and job dependencies. The workshop therefore covers a broad range of issues in the scientific workflow lifecycle that include: data intensive workflows representation and enactment; designing workflow composition interfaces; workflow mapping techniques that may optimize the execution of the workflow; workflow enactment engines that need to deal with failures in the application and execution environment; and a number of computer science problems related to scientific workflows such as semantic technologies, compiler methods, fault detection and tolerance. The topics of the workshop include but are not limited to:
Processing tasks with dependencies is a very long-standing research topic for the SuperComputing community. Modern workflow environment are integrating core parallel and/or distributed-computing capability with a rich experiment design and support environment. They have become companions for scientists in nearly any discipline. They facilitate the access to compute-intensive resources for non-expert users. Scientific workflow support environment and the tooling attached for data management, traceability, experiments reporting, community management, sharing, etc, have become critical components for producing scientific results from wealth of data and complex high performance computers. This is the tenth WORKS event. Out of the nine former events, the two first ones were collocated with the HPDC conference and the seven following ones were collocated with SuperComputing. This event is now well recognized by its community and it attracts a regular audience of computer scientists, workflow designers and workflow users over the years.