TY - GEN
T1 - The benefits of service choreography for data-intensive computing
AU - Barker, Adam
AU - Besana, Paolo
AU - Robertson, David
AU - Weissman, Jon B.
PY - 2009
Y1 - 2009
N2 - As the number of services and the size of data involved in workflows increases, centralised orchestration techniques are reaching the limits of scalability. In the classic orchestration model, all data pass through a centralised engine, which results in unnecessary data transfer, wasted bandwidth and the engine to become a bottleneck to the execution of a workflow. Choreography techniques, although more complex to model offer a decentralised alternative and are the optimal architecture for data-centric workflows; data are passed directly to where they are required, at the next service in the workflow. While orchestration is the dominant architectural approach, there are relatively few choreography languages and even fewer concrete implementations. This papers contributions are twofold. Firstly we argue the case for choreography in data-intensive computing, and demonstrate through workflow patterns the advantages in terms of scalability when a choreography architecture is adopted. Secondly we introduce the Light Weight Coordination Calculus (LCC), a type of process calculus used to formally define choreographies, and the OpenKnowledge framework, a choreography-based architecture, providing the functionality for peers to coordinate in an open peer-to-peer system. Through LCC and the OpenKnowledge framework we practically demonstrate how choreography can be achieved in a lightweight manner with a comparatively simple process language.
AB - As the number of services and the size of data involved in workflows increases, centralised orchestration techniques are reaching the limits of scalability. In the classic orchestration model, all data pass through a centralised engine, which results in unnecessary data transfer, wasted bandwidth and the engine to become a bottleneck to the execution of a workflow. Choreography techniques, although more complex to model offer a decentralised alternative and are the optimal architecture for data-centric workflows; data are passed directly to where they are required, at the next service in the workflow. While orchestration is the dominant architectural approach, there are relatively few choreography languages and even fewer concrete implementations. This papers contributions are twofold. Firstly we argue the case for choreography in data-intensive computing, and demonstrate through workflow patterns the advantages in terms of scalability when a choreography architecture is adopted. Secondly we introduce the Light Weight Coordination Calculus (LCC), a type of process calculus used to formally define choreographies, and the OpenKnowledge framework, a choreography-based architecture, providing the functionality for peers to coordinate in an open peer-to-peer system. Through LCC and the OpenKnowledge framework we practically demonstrate how choreography can be achieved in a lightweight manner with a comparatively simple process language.
KW - Algorithms
KW - Design
UR - https://www.scopus.com/pages/publications/70450233896
UR - https://www.scopus.com/pages/publications/70450233896#tab=citedBy
U2 - 10.1145/1552315.1552317
DO - 10.1145/1552315.1552317
M3 - Conference contribution
AN - SCOPUS:70450233896
SN - 9781605585888
T3 - Proc. 7th Int. Workshop on Challenges of Large Applications in Distributed Environments, CLADE'09, Co-located with the 2009 Int. Symposium on High Performance Distributed Computing Conf., HPDC'09
SP - 1
EP - 10
BT - Proc. 7th Int. Workshop on Challenges of Large Applications in Distributed Environments, CLADE'09, Co-located with the 2009 Int. Symposium on High Performance Distributed Computing Conf., HPDC'09
T2 - 7th International Workshop on Challenges of Large Applications in Distributed Environments, CLADE'09, Co-located with the 2009 International Symposium on High Performance Distributed Computing Conference, HPDC'09
Y2 - 9 June 2009 through 10 June 2009
ER -