3 navsection: installguide
4 title: Install Node Manager
7 Copyright (C) The Arvados Authors. All rights reserved.
9 SPDX-License-Identifier: CC-BY-SA-3.0
12 Arvados Node Manager provides elastic computing for Arvados and SLURM by creating and destroying virtual machines on demand. Node Manager currently supports Amazon Web Services (AWS), Google Cloud Platform (GCP) and Microsoft Azure.
14 Note: node manager is only required for elastic computing cloud environments. Fixed size clusters (such as on-premise HPC) do not require node manager.
18 Node manager may run anywhere, however it must be able to communicate with the cloud provider's APIs, and use the command line tools @sinfo@, @squeue@ and @scontrol@ to communicate with the cluster's SLURM controller.
20 On Debian-based systems:
23 <pre><code>~$ <span class="userinput">sudo apt-get install arvados-node-manager</span>
27 On Red Hat-based systems:
30 <pre><code>~$ <span class="userinput">sudo yum install arvados-node-manager</span>
34 h2. Create compute image
36 Configure a virtual machine following the "instructions to set up a compute node.":{{site.baseurl}}/install/crunch2-slurm/install-compute-node.html and set it up to run a "ping script":{{site.baseurl}}/install/install-compute-ping.html at boot.
38 Create a virtual machine image using the commands provided by your cloud provider. We recommend using a tool such as "Packer":https://www.packer.io/ to automate this process.
40 Configure node manager to use the image with the @image@ or @image_id@ parameter.
42 h2. Configure node manager
44 The configuration file at @/etc/arvados-node-manager/config.ini@ . Some configuration details are specific to the cloud provider you are using:
46 * "Amazon Web Services":#aws
47 * "Google Cloud Platform":#gcp
48 * "Microsoft Azure":#azure
50 h3(#aws). Amazon Web Services
53 # EC2 configuration for Arvados Node Manager.
54 # All times are in seconds unless specified otherwise.
57 # The management server responds to http://addr:port/status.json with
58 # a snapshot of internal state.
60 # Management server listening address (default 127.0.0.1)
63 # Management server port number (default -1, server is disabled)
67 # The dispatcher can customize the start and stop procedure for
68 # cloud nodes. For example, the SLURM dispatcher drains nodes
69 # through SLURM before shutting them down.
72 # Node Manager will ensure that there are at least this many nodes running at
73 # all times. If node manager needs to start new idle nodes for the purpose of
74 # satisfying min_nodes, it will use the cheapest node type. However, depending
75 # on usage patterns, it may also satisfy min_nodes by keeping alive some
76 # more-expensive nodes
79 # Node Manager will not start any compute nodes when at least this
83 # Upper limit on rate of spending (in $/hr), will not boot additional nodes
84 # if total price of already running nodes meets or exceeds this threshold.
85 # default 0 means no limit.
88 # Poll EC2 nodes and Arvados for new information every N seconds.
91 # Polls have exponential backoff when services fail to respond.
92 # This is the longest time to wait between polls.
95 # If Node Manager can't succesfully poll a service for this long,
96 # it will never start or stop compute nodes, on the assumption that its
97 # information is too outdated.
98 poll_stale_after = 600
100 # If Node Manager boots a cloud node, and it does not pair with an Arvados
101 # node before this long, assume that there was a cloud bootstrap failure and
102 # shut it down. Note that normal shutdown windows apply (see the Cloud
103 # section), so this should be shorter than the first shutdown window value.
104 boot_fail_after = 1800
106 # "Node stale time" affects two related behaviors.
107 # 1. If a compute node has been running for at least this long, but it
108 # isn't paired with an Arvados node, do not shut it down, but leave it alone.
109 # This prevents the node manager from shutting down a node that might
110 # actually be doing work, but is having temporary trouble contacting the
112 # 2. When the Node Manager starts a new compute node, it will try to reuse
113 # an Arvados node that hasn't been updated for this long.
114 node_stale_after = 14400
116 # Number of consecutive times a node must report as "idle" before it
117 # will be considered eligible for shutdown. Node status is checked
118 # each poll period, and node can go idle at any point during a poll
119 # period (meaning a node could be reported as idle that has only been
120 # idle for 1 second). With a 60 second poll period, three consecutive
121 # status updates of "idle" suggests the node has been idle at least
123 consecutive_idle_count = 3
125 # Scaling factor to be applied to nodes' available RAM size. Usually there's a
126 # variable discrepancy between the advertised RAM value on cloud nodes and the
127 # actual amount available.
128 # If not set, this value will be set to 0.95
129 node_mem_scaling = 0.95
131 # File path for Certificate Authorities
132 certs_file = /etc/ssl/certs/ca-certificates.crt
136 file = /var/log/arvados/node-manager.log
138 # Log level for most Node Manager messages.
139 # Choose one of DEBUG, INFO, WARNING, ERROR, or CRITICAL.
140 # WARNING lets you know when polling a service fails.
141 # INFO additionally lets you know when a compute node is started or stopped.
144 # You can also set different log levels for specific libraries.
145 # Pykka is the Node Manager's actor library.
146 # Setting this to DEBUG will display tracebacks for uncaught
147 # exceptions in the actors, but it's also very chatty.
150 # Setting apiclient to INFO will log the URL of every Arvados API request.
154 host = zyxwv.arvadosapi.com
155 token = ARVADOS_TOKEN
158 # Accept an untrusted SSL certificate from the API server?
164 # It's usually most cost-effective to shut down compute nodes during narrow
165 # windows of time. For example, EC2 bills each node by the hour, so the best
166 # time to shut down a node is right before a new hour of uptime starts.
167 # Shutdown windows define these periods of time. These are windows in
168 # full minutes, separated by commas. Counting from the time the node is
169 # booted, the node WILL NOT shut down for N1 minutes; then it MAY shut down
170 # for N2 minutes; then it WILL NOT shut down for N3 minutes; and so on.
171 # For example, "54, 5, 1" means the node may shut down from the 54th to the
172 # 59th minute of each hour of uptime.
173 # Specify at least two windows. You can add as many as you need beyond that.
174 shutdown_windows = 54, 5, 1
183 # This section defines filters that find compute nodes.
184 # Tags that you specify here will automatically be added to nodes you create.
185 # Replace colons in Amazon filters with underscores
186 # (e.g., write "tag:mytag" as "tag_mytag").
187 instance-state-name = running
188 tag_arvados-class = dynamic-compute
192 # New compute nodes will send pings to Arvados at this host.
193 # You may specify a port, and use brackets to disambiguate IPv6 addresses.
194 ping_host = hostname:port
196 # Give the name of an SSH key on AWS...
199 # ... or a file path for an SSH key that can log in to the compute node.
200 # (One or the other, not both.)
203 # The EC2 IDs of the image and subnet compute nodes should use.
207 # Comma-separated EC2 IDs for the security group(s) assigned to each
209 security_groups = idstring1, idstring2
211 # Apply an Instance Profile ARN to the newly created compute nodes
212 # For more info, see:
213 # https://aws.amazon.com/premiumsupport/knowledge-center/iam-policy-restrict-vpc/
214 # ex_iamprofile = arn:aws:iam::ACCOUNTNUMBER:instance-profile/ROLENAME
217 # You can define any number of Size sections to list EC2 sizes you're
218 # willing to use. The Node Manager should boot the cheapest size(s) that
219 # can run jobs in the queue.
221 # Each size section MUST define the number of cores are available in this
222 # size class (since libcloud does not provide any consistent API for exposing
224 # You may also want to define the amount of scratch space (expressed
225 # in GB) for Crunch jobs. You can also override Amazon's provided
226 # data fields (such as price per hour) by setting them here.
239 h3(#gcp). Google Cloud Platform
242 # Google Compute Engine configuration for Arvados Node Manager.
243 # All times are in seconds unless specified otherwise.
246 # The management server responds to http://addr:port/status.json with
247 # a snapshot of internal state.
249 # Management server listening address (default 127.0.0.1)
252 # Management server port number (default -1, server is disabled)
256 # Node Manager will ensure that there are at least this many nodes running at
257 # all times. If node manager needs to start new idle nodes for the purpose of
258 # satisfying min_nodes, it will use the cheapest node type. However, depending
259 # on usage patterns, it may also satisfy min_nodes by keeping alive some
260 # more-expensive nodes
263 # Node Manager will not start any compute nodes when at least this
264 # running at all times. By default, these will be the cheapest node size.
267 # Poll compute nodes and Arvados for new information every N seconds.
270 # Upper limit on rate of spending (in $/hr), will not boot additional nodes
271 # if total price of already running nodes meets or exceeds this threshold.
272 # default 0 means no limit.
275 # Polls have exponential backoff when services fail to respond.
276 # This is the longest time to wait between polls.
279 # If Node Manager can't succesfully poll a service for this long,
280 # it will never start or stop compute nodes, on the assumption that its
281 # information is too outdated.
282 poll_stale_after = 600
284 # "Node stale time" affects two related behaviors.
285 # 1. If a compute node has been running for at least this long, but it
286 # isn't paired with an Arvados node, do not shut it down, but leave it alone.
287 # This prevents the node manager from shutting down a node that might
288 # actually be doing work, but is having temporary trouble contacting the
290 # 2. When the Node Manager starts a new compute node, it will try to reuse
291 # an Arvados node that hasn't been updated for this long.
292 node_stale_after = 14400
294 # Number of consecutive times a node must report as "idle" before it
295 # will be considered eligible for shutdown. Node status is checked
296 # each poll period, and node can go idle at any point during a poll
297 # period (meaning a node could be reported as idle that has only been
298 # idle for 1 second). With a 60 second poll period, three consecutive
299 # status updates of "idle" suggests the node has been idle at least
301 consecutive_idle_count = 3
303 # Scaling factor to be applied to nodes' available RAM size. Usually there's a
304 # variable discrepancy between the advertised RAM value on cloud nodes and the
305 # actual amount available.
306 # If not set, this value will be set to 0.95
307 node_mem_scaling = 0.95
309 # File path for Certificate Authorities
310 certs_file = /etc/ssl/certs/ca-certificates.crt
314 file = /var/log/arvados/node-manager.log
316 # Log level for most Node Manager messages.
317 # Choose one of DEBUG, INFO, WARNING, ERROR, or CRITICAL.
318 # WARNING lets you know when polling a service fails.
319 # INFO additionally lets you know when a compute node is started or stopped.
322 # You can also set different log levels for specific libraries.
323 # Pykka is the Node Manager's actor library.
324 # Setting this to DEBUG will display tracebacks for uncaught
325 # exceptions in the actors, but it's also very chatty.
328 # Setting apiclient to INFO will log the URL of every Arvados API request.
332 host = zyxwv.arvadosapi.com
333 token = ARVADOS_TOKEN
336 # Accept an untrusted SSL certificate from the API server?
342 # Shutdown windows define periods of time when a node may and may not
343 # be shut down. These are windows in full minutes, separated by
344 # commas. Counting from the time the node is booted, the node WILL
345 # NOT shut down for N1 minutes; then it MAY shut down for N2 minutes;
346 # then it WILL NOT shut down for N3 minutes; and so on. For example,
347 # "54, 5, 1" means the node may shut down from the 54th to the 59th
348 # minute of each hour of uptime.
349 # GCE bills by the minute, and does not provide information about when
350 # a node booted. Node Manager will store this information in metadata
351 # when it boots a node; if that information is not available, it will
352 # assume the node booted at the epoch. These shutdown settings are
353 # very aggressive. You may want to adjust this if you want more
354 # continuity of service from a single node.
355 shutdown_windows = 20, 999999
358 user_id = client_email_address@developer.gserviceaccount.com
359 key = path_to_certificate.pem
360 project = project-id-from-google-cloud-dashboard
363 # Valid location (zone) names: https://cloud.google.com/compute/docs/zones
364 datacenter = us-central1-a
366 # Optional settings. For full documentation see
367 # http://libcloud.readthedocs.org/en/latest/compute/drivers/gce.html#libcloud.compute.drivers.gce.GCENodeDriver
369 # auth_type = SA # SA, IA or GCE
370 # scopes = https://www.googleapis.com/auth/compute
374 # A comma-separated list of tags that must be applied to a node for it to
375 # be considered a compute node.
376 # The driver will automatically apply these tags to nodes it creates.
377 tags = zyxwv, compute
380 # New compute nodes will send pings to Arvados at this host.
381 # You may specify a port, and use brackets to disambiguate IPv6 addresses.
382 ping_host = hostname:port
384 # A file path for an SSH key that can log in to the compute node.
387 # The GCE image name and network zone name to use when creating new nodes.
389 # network = your_network_name
391 # JSON string of service account authorizations for this cluster.
392 # See http://libcloud.readthedocs.org/en/latest/compute/drivers/gce.html#specifying-service-account-scopes
393 # service_accounts = [{'email':'account@example.com', 'scopes':['storage-ro']}]
396 # You can define any number of Size sections to list node sizes you're
397 # willing to use. The Node Manager should boot the cheapest size(s) that
398 # can run jobs in the queue.
400 # The Size fields are interpreted the same way as with a libcloud NodeSize:
401 # http://libcloud.readthedocs.org/en/latest/compute/api.html#libcloud.compute.base.NodeSize
403 # See https://cloud.google.com/compute/docs/machine-types for a list
404 # of known machine types that may be used as a Size parameter.
406 # Each size section MUST define the number of cores are available in this
407 # size class (since libcloud does not provide any consistent API for exposing
409 # You may also want to define the amount of scratch space (expressed
410 # in GB) for Crunch jobs.
411 # You can also override Google's provided data fields (such as price per hour)
412 # by setting them here.
425 h3(#azure). Microsoft Azure
428 # Azure configuration for Arvados Node Manager.
429 # All times are in seconds unless specified otherwise.
432 # The management server responds to http://addr:port/status.json with
433 # a snapshot of internal state.
435 # Management server listening address (default 127.0.0.1)
438 # Management server port number (default -1, server is disabled)
442 # The dispatcher can customize the start and stop procedure for
443 # cloud nodes. For example, the SLURM dispatcher drains nodes
444 # through SLURM before shutting them down.
447 # Node Manager will ensure that there are at least this many nodes running at
448 # all times. If node manager needs to start new idle nodes for the purpose of
449 # satisfying min_nodes, it will use the cheapest node type. However, depending
450 # on usage patterns, it may also satisfy min_nodes by keeping alive some
451 # more-expensive nodes
454 # Node Manager will not start any compute nodes when at least this
458 # Upper limit on rate of spending (in $/hr), will not boot additional nodes
459 # if total price of already running nodes meets or exceeds this threshold.
460 # default 0 means no limit.
463 # Poll Azure nodes and Arvados for new information every N seconds.
466 # Polls have exponential backoff when services fail to respond.
467 # This is the longest time to wait between polls.
470 # If Node Manager can't succesfully poll a service for this long,
471 # it will never start or stop compute nodes, on the assumption that its
472 # information is too outdated.
473 poll_stale_after = 600
475 # If Node Manager boots a cloud node, and it does not pair with an Arvados
476 # node before this long, assume that there was a cloud bootstrap failure and
477 # shut it down. Note that normal shutdown windows apply (see the Cloud
478 # section), so this should be shorter than the first shutdown window value.
479 boot_fail_after = 1800
481 # "Node stale time" affects two related behaviors.
482 # 1. If a compute node has been running for at least this long, but it
483 # isn't paired with an Arvados node, do not shut it down, but leave it alone.
484 # This prevents the node manager from shutting down a node that might
485 # actually be doing work, but is having temporary trouble contacting the
487 # 2. When the Node Manager starts a new compute node, it will try to reuse
488 # an Arvados node that hasn't been updated for this long.
489 node_stale_after = 14400
491 # Number of consecutive times a node must report as "idle" before it
492 # will be considered eligible for shutdown. Node status is checked
493 # each poll period, and node can go idle at any point during a poll
494 # period (meaning a node could be reported as idle that has only been
495 # idle for 1 second). With a 60 second poll period, three consecutive
496 # status updates of "idle" suggests the node has been idle at least
498 consecutive_idle_count = 3
500 # Scaling factor to be applied to nodes' available RAM size. Usually there's a
501 # variable discrepancy between the advertised RAM value on cloud nodes and the
502 # actual amount available.
503 # If not set, this value will be set to 0.95
504 node_mem_scaling = 0.95
506 # File path for Certificate Authorities
507 certs_file = /etc/ssl/certs/ca-certificates.crt
511 file = /var/log/arvados/node-manager.log
513 # Log level for most Node Manager messages.
514 # Choose one of DEBUG, INFO, WARNING, ERROR, or CRITICAL.
515 # WARNING lets you know when polling a service fails.
516 # INFO additionally lets you know when a compute node is started or stopped.
519 # You can also set different log levels for specific libraries.
520 # Pykka is the Node Manager's actor library.
521 # Setting this to DEBUG will display tracebacks for uncaught
522 # exceptions in the actors, but it's also very chatty.
525 # Setting apiclient to INFO will log the URL of every Arvados API request.
529 host = zyxwv.arvadosapi.com
530 token = ARVADOS_TOKEN
533 # Accept an untrusted SSL certificate from the API server?
539 # Shutdown windows define periods of time when a node may and may not be shut
540 # down. These are windows in full minutes, separated by commas. Counting from
541 # the time the node is booted, the node WILL NOT shut down for N1 minutes; then
542 # it MAY shut down for N2 minutes; then it WILL NOT shut down for N3 minutes;
543 # and so on. For example, "20, 999999" means the node may shut down between
544 # the 20th and 999999th minutes of uptime.
545 # Azure bills by the minute, so it makes sense to agressively shut down idle
546 # nodes. Specify at least two windows. You can add as many as you need beyond
548 shutdown_windows = 20, 999999
551 # Use "azure account list" with the azure CLI to get these values.
552 tenant_id = 00000000-0000-0000-0000-000000000000
553 subscription_id = 00000000-0000-0000-0000-000000000000
555 # The following directions are based on
556 # https://azure.microsoft.com/en-us/documentation/articles/resource-group-authenticate-service-principal/
557 # and updated for v2 of the Azure cli tool.
559 # az ad app create --display-name "Node Manager" --homepage "https://arvados.org" --identifier-uris "https://<Your_Application_Uri>" --password <Your_Password> --end-date <Desired_credential_expiry_date>
560 # az ad sp create "<Application_Id>"
561 # az role assignment create --assignee "<Application_Id>" --role Owner --resource-group "<Your_Azure_Arvados_Resource_Group>"
563 # Use <Application_Id> for "key" and the <Your_Password> for "secret"
565 key = 00000000-0000-0000-0000-000000000000
571 # The resource group in which the compute node virtual machines will be created
573 ex_resource_group = ArvadosResourceGroup
576 # The compute node image, as a link to a VHD in Azure blob store.
577 image = https://example.blob.core.windows.net/system/Microsoft.Compute/Images/images/zyxwv-compute-osDisk.vhd
579 # Path to a local ssh key file that will be used to provision new nodes.
580 ssh_key = /home/arvadosuser/.ssh/id_rsa.pub
582 # The account name for the admin user that will be provisioned on new nodes.
583 ex_user_name = arvadosuser
585 # The Azure storage account that will be used to store the node OS disk images.
586 ex_storage_account = arvadosstorage
588 # The virtual network the VMs will be associated with.
589 ex_network = ArvadosNetwork
591 # Optional subnet of the virtual network.
595 tag_arvados-class = dynamic-compute
598 # the API server to ping
599 ping_host = hostname:port
601 # You can define any number of Size sections to list Azure sizes you're willing
602 # to use. The Node Manager should boot the cheapest size(s) that can run jobs
603 # in the queue. You must also provide price per hour as the Azure driver
604 # compute currently does not report prices.
606 # See https://azure.microsoft.com/en-us/pricing/details/virtual-machines/
607 # for a list of known machine types that may be used as a Size parameter.
609 # Each size section MUST define the number of cores are available in this
610 # size class (since libcloud does not provide any consistent API for exposing
612 # You may also want to define the amount of scratch space (expressed
613 # in GB) for Crunch jobs. You can also override Microsoft's provided
614 # data fields by setting them here.
628 $ arvados-node-manager --config /etc/arvados-node-manager/config.ini