X-Git-Url: https://git.arvados.org/arvados.git/blobdiff_plain/6c0bf267d795a3ca49c3258c9490714c9e18d333..fd507a52e72e992a3fd19309de65905341630396:/doc/install/crunch2-slurm/install-test.html.textile.liquid diff --git a/doc/install/crunch2-slurm/install-test.html.textile.liquid b/doc/install/crunch2-slurm/install-test.html.textile.liquid index d51cfceead..ffd75a7793 100644 --- a/doc/install/crunch2-slurm/install-test.html.textile.liquid +++ b/doc/install/crunch2-slurm/install-test.html.textile.liquid @@ -1,54 +1,53 @@ --- layout: default navsection: installguide -title: Test SLURM dispatch +title: Test Slurm dispatch ... +{% comment %} +Copyright (C) The Arvados Authors. All rights reserved. + +SPDX-License-Identifier: CC-BY-SA-3.0 +{% endcomment %} + +{% include 'notebox_begin_warning' %} +@crunch-dispatch-slurm@ is only relevant for on premises clusters that will spool jobs to Slurm. Skip this section if you use LSF or if you are installing a cloud cluster. +{% include 'notebox_end' %} h2. Test compute node setup -You should now be able to submit SLURM jobs that run in Docker containers. On the node where you're running the dispatcher, you can test this by running: +You should now be able to submit Slurm jobs that run in Docker containers. On the node where you're running the dispatcher, you can test this by running:
~$ sudo -u crunch srun -N1 docker run busybox echo OK
 
-If it works, this command should print @OK@ (it may also show some status messages from SLURM and/or Docker). If it does not print @OK@, double-check your compute node setup, and that the @crunch@ user can submit SLURM jobs. +If it works, this command should print @OK@ (it may also show some status messages from Slurm and/or Docker). If it does not print @OK@, double-check your compute node setup, and that the @crunch@ user can submit Slurm jobs. h2. Test the dispatcher +Make sure all of your compute nodes are set up with "Docker":../crunch2/install-compute-node-docker.html or "Singularity":../crunch2/install-compute-node-singularity.html. + On the dispatch node, start monitoring the crunch-dispatch-slurm logs: -
~$ sudo journalctl -o cat -fu crunch-dispatch-slurm.service
+
# journalctl -o cat -fu crunch-dispatch-slurm.service
 
-*On your shell server*, submit a simple container request: +In another terminal window, use the diagnostics tool to run a simple container. -
shell:~$ arv container_request create --container-request '{
-  "name":            "test",
-  "state":           "Committed",
-  "priority":        1,
-  "container_image": "arvados/jobs:latest",
-  "command":         ["echo", "Hello, Crunch!"],
-  "output_path":     "/out",
-  "mounts": {
-    "/out": {
-      "kind":        "tmp",
-      "capacity":    1000
-    }
-  },
-  "runtime_constraints": {
-    "vcpus": 1,
-    "ram": 8388608
-  }
-}'
+
# arvados-client sudo diagnostics
+INFO       5: running health check (same as `arvados-server check`)
+INFO      10: getting discovery document from https://zzzzz.arvadosapi.com/discovery/v1/apis/arvados/v1/rest
+...
+INFO     160: running a container
+INFO      ... container request submitted, waiting up to 10m for container to run
 
-This command should return a record with a @container_uuid@ field. Once crunch-dispatch-slurm polls the API server for new containers to run, you should see it dispatch that same container. It will log messages like: +Once @crunch-dispatch-slurm@ polls the API server for new containers to run, you should see it dispatch the new container. It will log messages like:
2016/08/05 13:52:54 Monitoring container zzzzz-dz642-hdp2vpu9nq14tx0 started
@@ -57,9 +56,7 @@ This command should return a record with a @container_uuid@ field.  Once crunch-
 
-If you do not see crunch-dispatch-slurm try to dispatch the container, double-check that it is running and that the API hostname and token in @/etc/arvados/crunch-dispatch-slurm/crunch-dispatch-slurm.yml@ are correct. - -Before the container finishes, SLURM's @squeue@ command will show the new job in the list of queued and running jobs. For example, you might see: +Before the container finishes, Slurm's @squeue@ command will show the new job in the list of queued and running jobs. For example, you might see:
~$ squeue --long
@@ -101,9 +98,9 @@ You can use standard Keep tools to view the container's output and logs from the
 ./crunch-run.txt
 ./stderr.txt
 ./stdout.txt
-~$ arv keep get a01df2f7e5bc1c2ad59c60a837e90dc6+166/stdout.txt
+~$ arv-get a01df2f7e5bc1c2ad59c60a837e90dc6+166/stdout.txt
 2016-08-05T13:53:06.201011Z Hello, Crunch!
 
-If the container does not dispatch successfully, refer to the crunch-dispatch-slurm logs for information about why it failed. +If the container does not dispatch successfully, refer to the @crunch-dispatch-slurm@ logs for information about why it failed.