Peter Amstutz [Thu, 23 Mar 2017 20:16:33 +0000 (16:16 -0400)]
Merge branch '11325-no-broken-nodes' refs #11323, refs #11324, refs #11325
Peter Amstutz [Thu, 23 Mar 2017 17:30:35 +0000 (13:30 -0400)]
11323: Don't try to offer_arvados_pair on unpaired nodes which are being shut down.
Peter Amstutz [Thu, 23 Mar 2017 18:05:51 +0000 (14:05 -0400)]
11324: Fix crash in NodeManagerDaemonActor when receiving a node_can_shutdown
message for a node that has already been shut down.
Tom Clegg [Thu, 23 Mar 2017 20:10:48 +0000 (16:10 -0400)]
Merge branch '10766-install-ws'
closes #10766
Tom Clegg [Thu, 23 Mar 2017 19:56:44 +0000 (15:56 -0400)]
Merge branch '11065-rotate-logs'
closes #11065
Tom Clegg [Thu, 23 Mar 2017 19:56:16 +0000 (15:56 -0400)]
11065: Avoid scientific notation in PostgreSQL literal.
Tom Clegg [Thu, 23 Mar 2017 19:18:51 +0000 (15:18 -0400)]
11065: Update comments.
Tom Clegg [Thu, 23 Mar 2017 18:51:21 +0000 (14:51 -0400)]
Merge branch '11221-always-restart-services'
closes #11221
Peter Amstutz [Thu, 23 Mar 2017 17:23:54 +0000 (13:23 -0400)]
11325: Remove "broken node" check. Assume if the node really isn't
functioning, it should be "down" in SLURM anyway. Remove test_broken_node_not_counted because broken node check is removed.
Tom Clegg [Thu, 23 Mar 2017 18:05:06 +0000 (14:05 -0400)]
10766: Fix copy-paste error in godoc link.
Tom Clegg [Thu, 23 Mar 2017 17:46:37 +0000 (13:46 -0400)]
10766: Migrate install guide from puma to arvados-ws.
Tom Clegg [Thu, 23 Mar 2017 14:56:34 +0000 (10:56 -0400)]
Merge branch '10669-safe-http-cache'
closes #10669
Tom Clegg [Thu, 23 Mar 2017 14:43:01 +0000 (10:43 -0400)]
10669: Improve error reporting in test case.
Peter Amstutz [Thu, 23 Mar 2017 13:52:39 +0000 (09:52 -0400)]
Merge branch '11255-docker-host-networking' closes #11255
Peter Amstutz [Thu, 23 Mar 2017 13:49:56 +0000 (09:49 -0400)]
11255: Fix whitespace
Peter Amstutz [Wed, 22 Mar 2017 16:56:34 +0000 (12:56 -0400)]
11255: Add documentation about configuration to enable host networking always.
Peter Amstutz [Thu, 23 Mar 2017 13:49:31 +0000 (09:49 -0400)]
11255: Add -container-enable-networking and -container-network-mode options.
Lucas Di Pentima [Thu, 23 Mar 2017 13:33:14 +0000 (10:33 -0300)]
Merge branch '11170-stale-squeue-procs'
Closes #11170
Tom Clegg [Thu, 23 Mar 2017 04:19:38 +0000 (00:19 -0400)]
10669: pep8
Tom Clegg [Thu, 23 Mar 2017 04:16:26 +0000 (00:16 -0400)]
10669: Test that SafeHTTPCache is actually used.
Tom Clegg [Wed, 22 Mar 2017 21:57:47 +0000 (17:57 -0400)]
10669: Only delete *.tmp when cleaning old cache files.
Tom Clegg [Wed, 22 Mar 2017 21:45:32 +0000 (17:45 -0400)]
10699: Delete >2-day old cache entries.
Tom Clegg [Wed, 22 Mar 2017 21:44:52 +0000 (17:44 -0400)]
10669: Bust old (possibly broken) cache entries by naming our files *.tmp
Tom Clegg [Wed, 22 Mar 2017 19:16:37 +0000 (15:16 -0400)]
10669: Robust cache for httplib2
Lucas Di Pentima [Wed, 22 Mar 2017 20:18:59 +0000 (17:18 -0300)]
Merge branch '10218-record-node-info'
Closes #10218
Lucas Di Pentima [Wed, 22 Mar 2017 19:56:58 +0000 (16:56 -0300)]
10218: Avoid running df commands more than necessary. Updated test to check for disk inodes report. Added deferred Close() call on CallRaw reader.
Tom Clegg [Wed, 22 Mar 2017 19:30:52 +0000 (15:30 -0400)]
11065: Merge branch 'master' into 11065-rotate-logs
Lucas Di Pentima [Wed, 22 Mar 2017 19:07:37 +0000 (16:07 -0300)]
10218: Added calls to "df -i" to record free i-nodes. Combined stdout and stderr when running node info commands.
Lucas Di Pentima [Wed, 22 Mar 2017 18:51:13 +0000 (15:51 -0300)]
10218: Ask for disk space on 2 specific filesystems (/ & /tmp) to avoid having the df command to return an error on special cases.
Lucas Di Pentima [Wed, 22 Mar 2017 18:45:49 +0000 (15:45 -0300)]
10218: Merge branch 'master' into 10218-record-node-info
Lucas Di Pentima [Wed, 22 Mar 2017 18:01:48 +0000 (15:01 -0300)]
10218: Get the container record directly from the API server, instead of Marshaling runner.Container.
Tom Clegg [Wed, 22 Mar 2017 17:33:55 +0000 (13:33 -0400)]
Merge branch '7709-sdk-cli-active_support'
refs #7709
Tom Clegg [Wed, 22 Mar 2017 16:07:48 +0000 (12:07 -0400)]
Fix broken link from crunch2 to crunch1 docs.
refs #6520
Tom Clegg [Wed, 22 Mar 2017 16:04:43 +0000 (12:04 -0400)]
Do not let golang.org/x/tools/cmd/bundle get ahead of Ruby bundler.
refs https://dev.arvados.org/projects/arvados/wiki/Emacs_and_Go
No issue #
Tom Clegg [Sun, 19 Mar 2017 01:12:45 +0000 (21:12 -0400)]
7709: arvados-cli allows activesupport>=4
Lucas Di Pentima [Wed, 22 Mar 2017 14:18:55 +0000 (11:18 -0300)]
11170: Calling close method from an ensure block.
Tom Clegg [Wed, 22 Mar 2017 14:18:43 +0000 (10:18 -0400)]
11065: Defer audit log cleanup until after_commit.
Lucas Di Pentima [Wed, 22 Mar 2017 13:21:17 +0000 (10:21 -0300)]
11170: Updated tests to reflect the use of IO instead of File.
Peter Amstutz [Tue, 21 Mar 2017 15:41:15 +0000 (11:41 -0400)]
Fix test broken by change of default behavior in arv-keepdocker. refs #8567
Tom Clegg [Tue, 21 Mar 2017 15:39:20 +0000 (11:39 -0400)]
11065: Delete old audit logs.
Lucas Di Pentima [Mon, 20 Mar 2017 21:39:43 +0000 (18:39 -0300)]
11170: Treat the squeue/scancel calls as files instead of treating them as processes. Calling close when finished reading from them.
Lucas Di Pentima [Mon, 20 Mar 2017 19:07:40 +0000 (16:07 -0300)]
10218: Merge branch 'master' into 10218-record-node-info
Peter Amstutz [Mon, 20 Mar 2017 19:04:38 +0000 (15:04 -0400)]
If API server doesn't publish supported Docker image versions, make that an
error instead of a warning (can be overridden with --force-image-format)
refs #8567
Lucas Di Pentima [Mon, 20 Mar 2017 18:56:55 +0000 (15:56 -0300)]
11170: Set up a thread to reap the status of squeue runs so that they don't become zombie processes.
Peter Amstutz [Mon, 20 Mar 2017 18:34:17 +0000 (14:34 -0400)]
Merge branch '8567-docker-migrator' refs #8567
Peter Amstutz [Mon, 20 Mar 2017 15:58:37 +0000 (11:58 -0400)]
8567: Bugfix check for existing repo+tag links.
radhika [Mon, 20 Mar 2017 15:04:44 +0000 (11:04 -0400)]
closes #11290
Merge branch '11290-caret-font'
Peter Amstutz [Mon, 20 Mar 2017 14:02:16 +0000 (10:02 -0400)]
8567: arv-keepdocker catches EPIPE so it doesn't print backtrace when piped to
other commands such as "head".
Peter Amstutz [Mon, 20 Mar 2017 13:36:17 +0000 (09:36 -0400)]
8567: Add migrate19
radhika [Mon, 20 Mar 2017 10:34:32 +0000 (06:34 -0400)]
11290: Reverted "fa-caret-down" usage back to 'span class="caret"' with href added to it. Working with Firefox and Chrome now.
Peter Amstutz [Fri, 17 Mar 2017 21:54:11 +0000 (17:54 -0400)]
8567: Better command line UI
Peter Amstutz [Fri, 17 Mar 2017 20:36:41 +0000 (16:36 -0400)]
Merge branch '11288-slurm-down-reason' closes #11288
Lucas Di Pentima [Fri, 17 Mar 2017 20:29:56 +0000 (17:29 -0300)]
10218: Save the container record as a json file and also write it on the logs.
Peter Amstutz [Fri, 17 Mar 2017 17:27:03 +0000 (13:27 -0400)]
11288: Slurm requires reason to put node in DOWN state.
Lucas Di Pentima [Fri, 17 Mar 2017 14:44:20 +0000 (11:44 -0300)]
Merge branch '11014-hide-node-status'
Closes #11014
Lucas Di Pentima [Fri, 17 Mar 2017 14:16:05 +0000 (11:16 -0300)]
11014: When PipelineInstance API is off and also show_recent_collections_on_dashboard configuration is off, now the "Recent processes" panel will take full screen width.
Corrected test name.
Avoid calling PipelineInstance.api_exist?(:index) more than once.
Peter Amstutz [Thu, 16 Mar 2017 20:52:11 +0000 (16:52 -0400)]
Merge branch '11254-nodemanager-no-actor' closes #11254
Peter Amstutz [Thu, 16 Mar 2017 20:49:32 +0000 (16:49 -0400)]
11254: Refactor _node_states
Peter Amstutz [Thu, 16 Mar 2017 20:11:00 +0000 (16:11 -0400)]
11254: Cloud nodes where "actor is None" are considered to be in shutdown. The
only time it should be "None" is the period between a successful shutdown and
when the node disappears from the cloud node list.
Lucas Di Pentima [Thu, 16 Mar 2017 19:16:34 +0000 (16:16 -0300)]
10218: Use a []string for the entire command instead of splitting it.
Lucas Di Pentima [Thu, 16 Mar 2017 18:18:23 +0000 (15:18 -0300)]
11014: Check for PipelineIndex#index API to exist for rendering the compute node status pane on the Dashboard.
Added related test.
Tom Clegg [Thu, 16 Mar 2017 17:24:58 +0000 (13:24 -0400)]
10218: Wait for container to be started (not just created) before trying to cancel it.
Lucas Di Pentima [Thu, 16 Mar 2017 14:01:19 +0000 (11:01 -0300)]
10218: Split multi-line command output so that each line is written to the logs independently.
Lucas Di Pentima [Wed, 15 Mar 2017 22:21:10 +0000 (19:21 -0300)]
10218: Logging node information (cpu, mem, disk) by storing command outputs on the log collection. Added relevant test.
Peter Amstutz [Wed, 15 Mar 2017 15:03:20 +0000 (11:03 -0400)]
8567: Add check that admin token is used and ensure that migration links are
created owned by system user. Also fix tests now that arv-keepdocker uses
logging instead of printing directly to sys.stderr.
radhika [Tue, 14 Mar 2017 20:01:14 +0000 (16:01 -0400)]
closes #11071
Merge branch '11071-fts-perf-test'
radhika [Tue, 14 Mar 2017 20:00:17 +0000 (16:00 -0400)]
11071: formatting on the long if statement
Peter Amstutz [Tue, 14 Mar 2017 19:35:29 +0000 (15:35 -0400)]
Add missing documentation file. refs #6520
Peter Amstutz [Tue, 14 Mar 2017 19:17:49 +0000 (15:17 -0400)]
Merge branch '8567-api-select-docker-fmt' refs #8567
Peter Amstutz [Tue, 14 Mar 2017 19:17:34 +0000 (15:17 -0400)]
Merge branch '8567-cwl-docker-img' refs #8567
Peter Amstutz [Tue, 14 Mar 2017 19:09:00 +0000 (15:09 -0400)]
Merge branch '6520-nodemanager-docs' closes #11123
Peter Amstutz [Tue, 14 Mar 2017 16:44:35 +0000 (12:44 -0400)]
8567: Add note about updating API server configuration.
Peter Amstutz [Tue, 14 Mar 2017 16:39:24 +0000 (12:39 -0400)]
8567: Add docker19 migration instructions to install guide.
Peter Amstutz [Thu, 9 Mar 2017 22:41:55 +0000 (17:41 -0500)]
8567: Fix migrate links to use PDH instead of UUID.
Better error reporting.
Migrate script cleans up /var/lib/docker inside container.
Peter Amstutz [Tue, 7 Mar 2017 15:02:39 +0000 (10:02 -0500)]
8567: Add check for ARVADOS_API_HOST_INSECURE
Peter Amstutz [Tue, 7 Mar 2017 14:26:17 +0000 (09:26 -0500)]
8567: Rename docker19-migrate to migrate-docker19 for consistency with
arv-migrate-docker19. Add docstrict to migrate19() function.
Peter Amstutz [Mon, 6 Mar 2017 19:52:55 +0000 (14:52 -0500)]
8567: Move out of tools/ into sdk/python and docker/docker19-migrate.
Peter Amstutz [Mon, 6 Mar 2017 19:31:17 +0000 (14:31 -0500)]
8567: Add status reporting to migrate script.
Peter Amstutz [Mon, 6 Mar 2017 15:39:57 +0000 (10:39 -0500)]
8567: Creates migration links.
Peter Amstutz [Mon, 6 Mar 2017 15:22:52 +0000 (10:22 -0500)]
8567: Docker image migration WIP.
Peter Amstutz [Tue, 14 Mar 2017 16:25:09 +0000 (12:25 -0400)]
8567: Fix tests now that container_image provides docker repo+tag and not PDH.
Peter Amstutz [Tue, 14 Mar 2017 16:03:33 +0000 (12:03 -0400)]
8567: If a search_term looks like a PDH, always treat it as one. Also use
correct optional parameter syntax.
Peter Amstutz [Tue, 14 Mar 2017 15:38:46 +0000 (11:38 -0400)]
6520: Add page with sample ping script. Improve instructions on creating
compute node a little bit.
Peter Amstutz [Tue, 14 Mar 2017 15:06:43 +0000 (11:06 -0400)]
6520: Add node_mem_scaling to documentation.
Peter Amstutz [Mon, 20 Feb 2017 21:13:29 +0000 (16:13 -0500)]
6520: Add information about setting up SLURM to crunchv2 documentation.
Peter Amstutz [Thu, 16 Feb 2017 22:20:19 +0000 (17:20 -0500)]
6520: Node manager docs WIP
Peter Amstutz [Fri, 10 Mar 2017 19:38:16 +0000 (14:38 -0500)]
8567: Use Docker image repo+tag name instead of PDH so that API server can select correct image format.
radhika [Mon, 13 Mar 2017 22:05:26 +0000 (18:05 -0400)]
11071: test count=none in groups#contents method.
Peter Amstutz [Mon, 13 Mar 2017 21:37:31 +0000 (17:37 -0400)]
8567: Tests default to image format v1 to avoid breaking all the tests that
uses the :docker_image collection.
Peter Amstutz [Mon, 13 Mar 2017 21:31:20 +0000 (17:31 -0400)]
8567: Adjust job container resolve test now that images are filtered based on
support version.
Peter Amstutz [Mon, 13 Mar 2017 20:40:26 +0000 (16:40 -0400)]
8567: Refactor code that queries migration links into get_compatible_images.
Peter Amstutz [Fri, 10 Mar 2017 19:06:22 +0000 (14:06 -0500)]
8567: Add & tweak tests for selecting compatible Docker image format.
Peter Amstutz [Fri, 10 Mar 2017 18:51:49 +0000 (13:51 -0500)]
8567: find_all_for_docker_image() returns only Docker images compatible with Rails.configuration.docker_image_formats. Follows migration links.
Peter Amstutz [Thu, 9 Mar 2017 22:44:26 +0000 (17:44 -0500)]
Bugfix: python Collection class sets _portable_data_hash in _populate_from_api_server
refs #10956
Tom Clegg [Thu, 9 Mar 2017 20:28:36 +0000 (15:28 -0500)]
Merge branch '5036-arv-mount-type'
refs #5036
Tom Clegg [Thu, 9 Mar 2017 19:53:51 +0000 (14:53 -0500)]
5036: Add "--subtype foo" flag to set mounted filesystem type to "fuse.foo".
Peter Amstutz [Thu, 9 Mar 2017 19:44:26 +0000 (14:44 -0500)]
Merge branch '11226-discovery-doc-cache' closes #11226
Peter Amstutz [Thu, 9 Mar 2017 18:49:15 +0000 (13:49 -0500)]
11226: Disable google api client discovery doc caching, use only httplib2 caching.
Google API client has its own caching mechanism. The problem is, by default
this goes to /tmp/google-api-python-client-discovery-doc.cache This is a
problem on a multi-user system. Arvados already provides for discovery doc
caching via httplib2.
Peter Amstutz [Wed, 8 Mar 2017 20:33:50 +0000 (15:33 -0500)]
Add ability to use "arvbox start test" to re-run tests without tearing down &
restarting whole container. no issue #
Peter Amstutz [Wed, 8 Mar 2017 20:33:00 +0000 (15:33 -0500)]
Arvbox now uses Go websockets server instead of Puma. no issue #