Arvados-DCO-1.1-Signed-off-by: Peter Amstutz <pamstutz@veritasgenetics.com>
SPDX-License-Identifier: CC-BY-SA-3.0
{% endcomment %}
-Arvados supports federated workflows, where different steps of a running workflow may execute on different clusters. Arvados manages data transfer and delegation of credentials, so this as easy as simply adding cluster target hints to your existing workflow. This supports running analysis on geographically dispersed data (avoiding expensive data transfers by sending the computation to the data) and "hybrid cloud" configurations where an on-premise cluster can expand its capabilities by delegating work to a cloud-base cluster.
+To support running analysis on geographically dispersed data (avoiding expensive data transfers by sending the computation to the data) and "hybrid cloud" configurations where an on-premise cluster can expand its capabilities by delegating work to a cloud-base cluster, Arvados supports federated workflows. In a federated workflow, different steps of a workflow may execute on different clusters. Arvados manages data transfer and delegation of credentials, so this as easy as simply adding cluster target hints to your existing workflow.
h2. Federated scatter/gather example
+
+
{% codeblock as yaml %}
{% include 'federated_cwl' %}
{% endcodeblock %}
hints:
arv:ClusterTarget:
cluster_id: $(inputs.shards.cluster)
+ project_uuid: $(inputs.shards.project)
out: [out]
run: md5sum.cwl
gather-results:
shards:
- cluster: clsr1
+ project: clsr1-j7d0g-qxc4jcji7n4lafx
file:
class: File
location: keep:485df2c5cec3207a32f49c42f1cdcca9+61/file-on-clsr1.dat
- cluster: clsr2
+ project: clsr2-j7d0g-ivdrm1hyym21vkq
file:
class: File
location: keep:ae6e9c3e9bfa52a0122ecb489d8198ff+61/file-on-clsr2.dat
- cluster: clsr3
+ project: clsr3-j7d0g-e3njz2s53lyb0ka
file:
class: File
location: keep:0b43a0ef9ea592d5d7b299978dfa8643+61/file-on-clsr3.dat