radhika [Fri, 24 Feb 2017 05:54:24 +0000 (00:54 -0500)]
Merge branch 'master' into 10979-cancelled-job-nodes
radhika [Fri, 24 Feb 2017 05:52:18 +0000 (00:52 -0500)]
10979: ruby way of doing it
Lucas Di Pentima [Thu, 23 Feb 2017 20:56:24 +0000 (17:56 -0300)]
11002: Don't save the state and log the stack trace before quitting upon catching an exception. Also, when receiving SIGINT (KeyboardInterrupt), just quit without any logging.
Updated tests to reflect this new behaviour.
Tom Clegg [Thu, 23 Feb 2017 19:04:42 +0000 (14:04 -0500)]
11156: Fix infinite loop condition.
closes #11156
Tom Clegg [Wed, 22 Feb 2017 21:36:50 +0000 (16:36 -0500)]
Merge branch '7995-keep-balance-docs'
closes #7995
Tom Clegg [Wed, 22 Feb 2017 21:33:35 +0000 (16:33 -0500)]
Remove pidfiles after shutting down test servers.
No issue #
Peter Amstutz [Wed, 22 Feb 2017 21:28:22 +0000 (16:28 -0500)]
10629: Don't flush dirhandles.
Peter Amstutz [Wed, 22 Feb 2017 21:08:41 +0000 (16:08 -0500)]
10629: Make tracking and dirtying of _committed flag efficient.
Peter Amstutz [Wed, 22 Feb 2017 20:40:22 +0000 (20:40 +0000)]
10629: improve debug logging
--debug includes Keep logging.
--logfile includes timestamps.
radhika [Wed, 22 Feb 2017 19:49:37 +0000 (14:49 -0500)]
Merge branch 'master' into 10979-cancelled-job-nodes
radhika [Wed, 22 Feb 2017 19:48:47 +0000 (14:48 -0500)]
10979: refactor squeue invocations
Lucas Di Pentima [Wed, 22 Feb 2017 18:26:16 +0000 (15:26 -0300)]
11002: Track this specific error with its own exception class, for future-proofing.
Tom Clegg [Wed, 22 Feb 2017 18:19:09 +0000 (13:19 -0500)]
Merge branch '11097-reuse-impure'
closes #11097
Tom Clegg [Wed, 22 Feb 2017 16:45:35 +0000 (11:45 -0500)]
7995: Add note about one keep-balance process at a time.
Tom Clegg [Wed, 22 Feb 2017 16:29:48 +0000 (11:29 -0500)]
7995: Fix up dry-run instructions.
Tom Clegg [Wed, 22 Feb 2017 16:24:13 +0000 (11:24 -0500)]
7995: Fix up inconsistent "e.g." vs. "e.g.,".
Tom Clegg [Wed, 22 Feb 2017 16:02:32 +0000 (11:02 -0500)]
7995: Copy edits.
Tom Clegg [Wed, 22 Feb 2017 15:29:31 +0000 (10:29 -0500)]
Fix dispatch panic when processing an update after tracker has been closed/deleted.
refs #11151
Tom Morris [Tue, 21 Feb 2017 21:35:00 +0000 (16:35 -0500)]
A few copy edits
Tom Morris [Tue, 21 Feb 2017 21:34:07 +0000 (16:34 -0500)]
Document epydoc dependency
Tom Clegg [Tue, 21 Feb 2017 20:27:54 +0000 (15:27 -0500)]
11097: Clarify reuse query.
Tom Clegg [Tue, 21 Feb 2017 20:27:37 +0000 (15:27 -0500)]
11097: Update docs to reflect new container reuse behavior.
radhika [Tue, 21 Feb 2017 18:29:58 +0000 (13:29 -0500)]
10979: scancel orphaned job nodes in crunch1.
Lucas Di Pentima [Tue, 21 Feb 2017 17:41:58 +0000 (14:41 -0300)]
11002: Do not try to save internal state when receiving a KeyboardInterrupt exception.
Updated test accordingly.
Lucas Di Pentima [Tue, 21 Feb 2017 16:22:07 +0000 (13:22 -0300)]
11002: Merge branch 'master' into 11002-arvput-crash-fix
Lucas Di Pentima [Tue, 21 Feb 2017 13:00:19 +0000 (10:00 -0300)]
11002: When trying to save the cache's state before quitting, if an exception
is caught because of a BlockManager problem induced by an interruption,
print a warning message and quit without saving the last checkpoint.
Lucas Di Pentima [Tue, 21 Feb 2017 12:54:07 +0000 (09:54 -0300)]
11002: Added missing assertion to test.
Lucas Di Pentima [Tue, 21 Feb 2017 11:41:07 +0000 (08:41 -0300)]
11002: Improved test mocking a more suitable method and catching the specific exception type.
Tom Clegg [Mon, 20 Feb 2017 22:03:10 +0000 (17:03 -0500)]
7995: Add "dry run" note.
Peter Amstutz [Mon, 20 Feb 2017 21:18:44 +0000 (16:18 -0500)]
Merge branch '6520-pending-reason' refs #6520
Peter Amstutz [Mon, 20 Feb 2017 18:54:47 +0000 (13:54 -0500)]
6520: Add ReqNodeNotAvail to list of reasons (along with "Resources") to boot a new node.
Tom Clegg [Mon, 20 Feb 2017 20:45:40 +0000 (15:45 -0500)]
11097: Update test to match new behavior.
Tom Clegg [Mon, 20 Feb 2017 20:34:15 +0000 (15:34 -0500)]
11097: Merge branch 'master' into 11097-reuse-impure
Lucas Di Pentima [Mon, 20 Feb 2017 20:25:05 +0000 (17:25 -0300)]
11002: Added test to make the bug happen.
Tom Clegg [Mon, 20 Feb 2017 15:41:52 +0000 (10:41 -0500)]
7995: Add keep-balance to install guide.
Tom Clegg [Fri, 17 Feb 2017 22:32:53 +0000 (17:32 -0500)]
Merge branch '11127-delete-trash-with-links'
refs #11127
Lucas Di Pentima [Fri, 17 Feb 2017 22:18:54 +0000 (19:18 -0300)]
Merge branch '11121-crunch-output-collection-owner'
Closes #11121
Lucas Di Pentima [Fri, 17 Feb 2017 22:17:40 +0000 (19:17 -0300)]
11121: Merge branch 'master' into 11121-crunch-output-collection-owner
Tom Clegg [Fri, 17 Feb 2017 21:17:26 +0000 (16:17 -0500)]
11127: Delete dependent links too when emptying trash.
radhika [Fri, 17 Feb 2017 20:15:14 +0000 (15:15 -0500)]
10979: TrackContainer implementation and test
Tom Clegg [Fri, 17 Feb 2017 19:49:13 +0000 (14:49 -0500)]
11127: Do not crash server if SweepTrashedCollections thread has an exception.
radhika [Thu, 16 Feb 2017 23:22:07 +0000 (18:22 -0500)]
Merge branch 'master' into 10979-cancelled-job-nodes
Peter Amstutz [Thu, 16 Feb 2017 21:44:22 +0000 (16:44 -0500)]
Merge branch '6520-nodemanager-crunchv2' refs #6520
Peter Amstutz [Thu, 16 Feb 2017 21:43:06 +0000 (16:43 -0500)]
6520: Crunch v1 recognizes 'mix' and 'drng' as busy states for
consistency (won't affect node manager behavior going forward, however.)
Peter Amstutz [Thu, 16 Feb 2017 21:34:00 +0000 (16:34 -0500)]
Merge branch '6520-skip-compute0' refs #6520
Peter Amstutz [Thu, 16 Feb 2017 21:31:43 +0000 (16:31 -0500)]
6520: Expand recognized 'busy' states.
Lucas Di Pentima [Thu, 16 Feb 2017 21:11:09 +0000 (18:11 -0300)]
11121: Added test.
radhika [Thu, 16 Feb 2017 21:07:02 +0000 (16:07 -0500)]
Merge branch 'master' into 10979-cancelled-job-nodes
Peter Amstutz [Thu, 16 Feb 2017 20:13:40 +0000 (15:13 -0500)]
Merge branch '9277-trash-container-outputs' closes #9277
Peter Amstutz [Thu, 16 Feb 2017 20:10:34 +0000 (15:10 -0500)]
6520: Explicitly mark node as 'down' before shutting down.
radhika [Thu, 16 Feb 2017 20:04:39 +0000 (15:04 -0500)]
closes #11104
Merge branch '11104-work-unit-expand-on-caret'
radhika [Thu, 16 Feb 2017 19:08:40 +0000 (14:08 -0500)]
11104: add fa-caret-down class to hyperlink rather than adding a span with icon.
Peter Amstutz [Thu, 16 Feb 2017 17:51:38 +0000 (12:51 -0500)]
9277: Include trashed collections when creating copy collection for container
request.
radhika [Thu, 16 Feb 2017 17:49:09 +0000 (12:49 -0500)]
closes #11103
Merge branch '11103-fts-crs'
radhika [Thu, 16 Feb 2017 17:48:11 +0000 (12:48 -0500)]
11103: ensure the index exists before attempting to drop it.
Peter Amstutz [Thu, 16 Feb 2017 17:00:30 +0000 (12:00 -0500)]
9277: crunch-run Log and output collections set is_trashed to true instead of
setting trash_at.
radhika [Thu, 16 Feb 2017 17:27:51 +0000 (12:27 -0500)]
11103: Omit mounts from container_requests and definition from workflows for full text search index.
Lucas Di Pentima [Thu, 16 Feb 2017 17:00:59 +0000 (14:00 -0300)]
11121: Add owner_uuid to output collection on crunch-job
Tom Clegg [Thu, 16 Feb 2017 15:17:34 +0000 (10:17 -0500)]
Merge branch '11017-docker-migration'
closes #11017
Tom Clegg [Wed, 15 Feb 2017 22:41:21 +0000 (17:41 -0500)]
Merge branch '10797-ruby-2.3'
closes #10797
Tom Clegg [Wed, 15 Feb 2017 22:40:40 +0000 (17:40 -0500)]
10797: Merge branch 'master' into 10797-ruby-2.3
Tom Clegg [Wed, 15 Feb 2017 22:40:16 +0000 (17:40 -0500)]
10797: Ruby 2.3.3
Ward Vandewege [Wed, 15 Feb 2017 22:35:40 +0000 (17:35 -0500)]
Update golang version to 1.7.5. Also make the equivalent changes to the
(new) ubuntu1604 build scripts.
refs #10797
Ward Vandewege [Wed, 15 Feb 2017 22:25:24 +0000 (17:25 -0500)]
Merge branch 'master' into 10797-ruby-2.3
refs #10797
Peter Amstutz [Wed, 15 Feb 2017 20:15:09 +0000 (15:15 -0500)]
6520: Don't ever assign compute0, start at slot 1 when assigning hostname.
Peter Amstutz [Wed, 15 Feb 2017 20:08:39 +0000 (15:08 -0500)]
6520: Check that hostname is set before setting weight.
Peter Amstutz [Wed, 15 Feb 2017 19:56:16 +0000 (14:56 -0500)]
6520: Apply node size fixup in CloudNodeListMonitorActor
Tom Clegg [Wed, 15 Feb 2017 19:47:57 +0000 (14:47 -0500)]
11097: Drop check for conflicting outputs when reusing completed containers.
Peter Amstutz [Wed, 15 Feb 2017 17:28:36 +0000 (12:28 -0500)]
6520: Update cloud_node.size to match server calculator configuration.
Tom Clegg [Wed, 15 Feb 2017 16:30:29 +0000 (11:30 -0500)]
11017: Add arv-migrate-docker19 to package.
Peter Amstutz [Wed, 15 Feb 2017 16:29:25 +0000 (11:29 -0500)]
6520: Assign weight to node based on price to preferentially schedule on
cheaper nodes.
radhika [Tue, 14 Feb 2017 21:26:39 +0000 (16:26 -0500)]
10979: trackContainers func in crunch-dispatch-slurm.go
Peter Amstutz [Tue, 14 Feb 2017 21:22:09 +0000 (16:22 -0500)]
6520: Add testcase using sinfo to set crunch_worker_state
Peter Amstutz [Tue, 14 Feb 2017 21:01:21 +0000 (16:01 -0500)]
6520: Add testcase getting wishlist from squeue.
Peter Amstutz [Mon, 6 Feb 2017 19:00:53 +0000 (14:00 -0500)]
6520: Call squeue to include slurm jobs in wishlist. Call sinfo to update
crunch_worker_state in arvados node records.
Tom Clegg [Tue, 14 Feb 2017 20:16:22 +0000 (15:16 -0500)]
11017: When compute nodes use image format v2, prefer migrated docker images.
Peter Amstutz [Tue, 14 Feb 2017 20:02:49 +0000 (15:02 -0500)]
Merge branch 'arvbox-docker-imgfmt' no issue #
Peter Amstutz [Tue, 14 Feb 2017 20:02:26 +0000 (15:02 -0500)]
Update test_with_arvbox script.
Peter Amstutz [Tue, 14 Feb 2017 19:11:20 +0000 (14:11 -0500)]
Set docker_image_formats to v1.
Peter Amstutz [Tue, 14 Feb 2017 17:09:13 +0000 (12:09 -0500)]
Merge branch 'fix-arvbox' no issue #
Peter Amstutz [Tue, 14 Feb 2017 16:05:56 +0000 (11:05 -0500)]
Fix arvbox build.
* Set up fuse groups separately.
* Separate generating superuser token from writing file so as not to write file on failure.
Tom Clegg [Tue, 14 Feb 2017 16:08:49 +0000 (11:08 -0500)]
Merge branch '10701-refactor-dispatch'
closes #10701
Tom Clegg [Tue, 14 Feb 2017 15:50:19 +0000 (10:50 -0500)]
10701: Update error text in test.
Tom Clegg [Tue, 14 Feb 2017 15:42:12 +0000 (10:42 -0500)]
10701: Improve comments and identifiers.
Tom Clegg [Mon, 13 Feb 2017 22:18:11 +0000 (17:18 -0500)]
11017: Add arv-migrate-docker19
Peter Amstutz [Mon, 13 Feb 2017 21:19:52 +0000 (16:19 -0500)]
Merge branch '9397-cwl-initialworkdir-crunchv2' closes #9397
Peter Amstutz [Mon, 13 Feb 2017 20:20:00 +0000 (15:20 -0500)]
9397: Improve container InitialWorkDir test.
Peter Amstutz [Fri, 10 Feb 2017 21:29:02 +0000 (16:29 -0500)]
9397: Fix keep URI splitting. Fix error when container has error log
collection with single file.
Peter Amstutz [Wed, 8 Feb 2017 16:11:42 +0000 (11:11 -0500)]
9397: arvados-cwl-runner implementation of InitialWorkDir on container API.
Peter Amstutz [Mon, 13 Feb 2017 19:34:15 +0000 (14:34 -0500)]
Update schema-salad pin. refs #10993
Peter Amstutz [Mon, 13 Feb 2017 18:03:47 +0000 (13:03 -0500)]
Update cwltool pin. closes #10993
Peter Amstutz [Mon, 13 Feb 2017 17:29:55 +0000 (12:29 -0500)]
Merge branch '9397-prepopulate-output-directory' refs #9397
Peter Amstutz [Mon, 13 Feb 2017 17:01:41 +0000 (12:01 -0500)]
Merge branch '10846-nodemanager-errors' closes #10846
Tom Clegg [Fri, 10 Feb 2017 23:03:10 +0000 (18:03 -0500)]
Merge branch '11088-keep-cache-units'
closes #11088
Tom Clegg [Fri, 10 Feb 2017 22:08:46 +0000 (17:08 -0500)]
Merge branch '11070-ws-listener-problem'
closes #11070
radhika [Fri, 10 Feb 2017 21:36:13 +0000 (16:36 -0500)]
closes #10903
Merge branch '10903-wb-cancel-cascade'
Tom Clegg [Fri, 10 Feb 2017 21:21:37 +0000 (16:21 -0500)]
11088: Fix keep_cache_ram units.
Lucas Di Pentima [Fri, 10 Feb 2017 21:00:22 +0000 (18:00 -0300)]
Merge branch '10956-arvput-pdh-fix'
Closes #10956
Lucas Di Pentima [Fri, 10 Feb 2017 19:18:21 +0000 (16:18 -0300)]
Merge branch '3900-trash-collections-delete-api'
Closes #3900
radhika [Fri, 10 Feb 2017 19:15:11 +0000 (14:15 -0500)]
closes #10945
Merge branch '10945-used-in-section-for-cwl-jobs'