Peter Amstutz [Fri, 10 Feb 2017 16:04:23 +0000 (11:04 -0500)]
Merge branch 'master' into origin-9397-prepopulate-output-directory
Peter Amstutz [Fri, 10 Feb 2017 16:03:42 +0000 (11:03 -0500)]
Merge branch '9397-go-manifest' refs #9397
Peter Amstutz [Fri, 10 Feb 2017 16:02:33 +0000 (11:02 -0500)]
9397: Update comment & tests for CollectionFileReader to reflect it is more
lenient in the paths it accepts as a result of updates to implementation of
manifest.FileSegmentIterByName.
Peter Amstutz [Thu, 9 Feb 2017 21:58:23 +0000 (16:58 -0500)]
9397: Tweak method documentation for Extract()
Peter Amstutz [Thu, 9 Feb 2017 21:55:10 +0000 (16:55 -0500)]
9397: Update method documentation for Extract()
Peter Amstutz [Thu, 9 Feb 2017 21:46:21 +0000 (16:46 -0500)]
9397: Fix major bug in firstBlock(). Refactor ManifestTextForPath() to
Extract(). Test returning errors from Extract().
Peter Amstutz [Thu, 9 Feb 2017 19:28:26 +0000 (14:28 -0500)]
9397: Fix collection.go to use blockdigest.LocatorPattern
Peter Amstutz [Thu, 9 Feb 2017 19:28:26 +0000 (14:28 -0500)]
9397: Fix collection.go to use blockdigest.LocatorPattern
Peter Amstutz [Thu, 9 Feb 2017 19:25:19 +0000 (14:25 -0500)]
9397: Use ManifestTextForPath
Peter Amstutz [Thu, 9 Feb 2017 19:11:53 +0000 (14:11 -0500)]
Merge branch '9397-go-manifest' into origin-9397-prepopulate-output-directory
Peter Amstutz [Thu, 9 Feb 2017 19:11:35 +0000 (14:11 -0500)]
9397: Remove conflicting code for easier merge
Peter Amstutz [Thu, 9 Feb 2017 18:25:40 +0000 (13:25 -0500)]
9397: Use idomatic loop form.
Peter Amstutz [Thu, 9 Feb 2017 18:21:28 +0000 (13:21 -0500)]
9397: Move most functions into private API.
* Document ManifestTextForPath().
* Check for invalid file segments, and fix tests that were using invalid file segments(!)
Tom Clegg [Thu, 9 Feb 2017 08:35:59 +0000 (03:35 -0500)]
Merge branch '10700-dispatch'
refs #10700
Tom Clegg [Thu, 9 Feb 2017 07:28:47 +0000 (02:28 -0500)]
10700: Fix error message.
Peter Amstutz [Mon, 6 Feb 2017 22:16:01 +0000 (17:16 -0500)]
9397: Add manifest normalization and sub-manifest extraction by path.
Introduces "SegmentedManifest" which stores streams -> files -> file segments.
Enables reexport of manifest in normalized form, as well as extraction of
individual files, streams or sets of streams. Also adds binary search for
efficiently determining first block to access for some stream offset.
radhika [Mon, 6 Feb 2017 21:56:38 +0000 (16:56 -0500)]
9397: add pre-population of output directory documentation
radhika [Mon, 6 Feb 2017 03:41:34 +0000 (22:41 -0500)]
Merge branch 'master' into 9397-prepopulate-output-directory
radhika [Mon, 6 Feb 2017 03:40:00 +0000 (22:40 -0500)]
9397: add normalized manifest test
radhika [Mon, 6 Feb 2017 02:03:11 +0000 (21:03 -0500)]
9397: add handling of denormalized manifest
radhika [Mon, 6 Feb 2017 00:14:48 +0000 (19:14 -0500)]
9397: NormalizeManifest
Lucas Di Pentima [Fri, 3 Feb 2017 22:34:49 +0000 (19:34 -0300)]
Merge branch '10968-arvput-resume-progress'
Closes #10968
Lucas Di Pentima [Fri, 3 Feb 2017 22:34:02 +0000 (19:34 -0300)]
10968: Deleted space char at the end of a comment
radhika [Fri, 3 Feb 2017 21:00:33 +0000 (16:00 -0500)]
9397: get manifest segment for a subdir or file using manifest from sdk.
Lucas Di Pentima [Fri, 3 Feb 2017 19:21:24 +0000 (16:21 -0300)]
10968: Added a notification when uploading at least one directory, just to let know the user that the expected bytes count can take some time when trying to upload lots of files.
Tom Clegg [Fri, 3 Feb 2017 18:15:43 +0000 (13:15 -0500)]
Merge branch '10969-check-docker-format'
closes #10969
Lucas Di Pentima [Fri, 3 Feb 2017 17:52:16 +0000 (14:52 -0300)]
10968: Changed the periodic update thread to run every 1 second while arv-put is checking which files to skip, only notifying the user via the progress indicator.
When starts uploading the rest of the files, the update thread returns to the previous behaviour, running once every minute and checkpointing to the cache.
Lucas Di Pentima [Fri, 3 Feb 2017 15:38:52 +0000 (12:38 -0300)]
Merge branch '10932-arvput-slow-resuming'
Closes #10932
Lucas Di Pentima [Fri, 3 Feb 2017 15:09:58 +0000 (12:09 -0300)]
10932: Changed _file_paths from being a list to a set so we're not going to copy it when checking for missing files on local collection on resume start.
Added comments on cache saving explaining why is better to use json.dumps() instead of copy.deepcopy().
radhika [Fri, 3 Feb 2017 00:16:25 +0000 (19:16 -0500)]
9397: Use manifest.FileSegmentForPath to get manifest segment for a file path. Cache collections to avoid fetching
the same collection repeatedly. If no manifest segment found for a mounted path, log that fact.
Tom Clegg [Thu, 2 Feb 2017 22:32:31 +0000 (17:32 -0500)]
10969: Add docker_image_formats server config, and corresponding check in `arv keep docker`.
Lucas Di Pentima [Thu, 2 Feb 2017 22:10:45 +0000 (19:10 -0300)]
10932: Replaced the use of a list with a set to check if files on local collection are on the local file list, so that the resume start time is greatly reduced.
Also, the save_state method was taking too much time on two operations: deepcopy() and json.dump(). Replaced both with just one call to json.dumps() that's a lot faster than json.dump().
This will improve overall performance on big file collections uploads.
Tom Clegg [Wed, 1 Feb 2017 21:31:06 +0000 (16:31 -0500)]
Merge branch '10990-keep-web-ranges'
closes #10990
Tom Clegg [Wed, 1 Feb 2017 21:29:16 +0000 (16:29 -0500)]
10990: Add Seek tests.
radhika [Wed, 1 Feb 2017 16:50:42 +0000 (11:50 -0500)]
9397: Add ExcludeFromOutput to Mount struct.
Tom Clegg [Wed, 1 Feb 2017 16:46:26 +0000 (11:46 -0500)]
Merge branch '10705-dedup-log-message'
refs #10705
Tom Clegg [Wed, 1 Feb 2017 16:38:29 +0000 (11:38 -0500)]
10705: Fix double-logged error.
Tom Clegg [Tue, 31 Jan 2017 21:32:05 +0000 (16:32 -0500)]
Merge branch '10700-dispatch'
closes #10700
refs #10701
closes #10702
closes #10703
closes #10704
Conflicts:
services/crunch-dispatch-slurm/crunch-dispatch-slurm.go
Lucas Di Pentima [Tue, 31 Jan 2017 17:27:03 +0000 (14:27 -0300)]
Merge branch '10694-provenance-graph-consistency'
Closes #10694
Tom Clegg [Tue, 31 Jan 2017 15:52:29 +0000 (10:52 -0500)]
Merge branch '10868-ip-addr-conflict'
refs #10868
Lucas Di Pentima [Tue, 31 Jan 2017 15:13:33 +0000 (12:13 -0300)]
Merge branch '10813-arv-put-six-threads'
Closes #10813
radhika [Tue, 31 Jan 2017 14:39:43 +0000 (09:39 -0500)]
9397: update comments
Tom Clegg [Tue, 31 Jan 2017 00:19:49 +0000 (19:19 -0500)]
10700: Rename squeue identifiers (sqCheck = SqueueChecker{})
Tom Clegg [Mon, 30 Jan 2017 23:32:49 +0000 (18:32 -0500)]
10700: Simplify squeue checker.
Lucas Di Pentima [Mon, 30 Jan 2017 21:29:10 +0000 (18:29 -0300)]
10813: Merge branch 'master' into 10813-arv-put-six-threads
Lucas Di Pentima [Mon, 30 Jan 2017 21:26:35 +0000 (18:26 -0300)]
10813: Added only_committed argument to ArvFile.manifest_text() method to maintain consistency with its counterpart from Collection class.
Lucas Di Pentima [Mon, 30 Jan 2017 21:25:34 +0000 (18:25 -0300)]
10813: Removed unnecessary argument
radhika [Mon, 30 Jan 2017 21:20:24 +0000 (16:20 -0500)]
9397: if mount.Path ends with "/", trim it.
radhika [Mon, 30 Jan 2017 20:59:29 +0000 (15:59 -0500)]
Merge branch 'master' into 9397-prepopulate-output-directory
radhika [Mon, 30 Jan 2017 20:58:28 +0000 (15:58 -0500)]
9397: more testing
Tom Clegg [Mon, 30 Jan 2017 20:17:52 +0000 (15:17 -0500)]
10868: Remove old DNS entry immediately when a new node re-uses an old node's IP address.
Lucas Di Pentima [Mon, 30 Jan 2017 19:39:02 +0000 (16:39 -0300)]
10694: Flipped graph orientations so they are all (collections, jobs, pipelines) redered top-down.
radhika [Mon, 30 Jan 2017 18:34:42 +0000 (13:34 -0500)]
9397: test with multiple mount points under subdirectories
Lucas Di Pentima [Mon, 30 Jan 2017 18:31:16 +0000 (15:31 -0300)]
10694: Changed Pipeline Instance's Graph orientation to match Collections and Jobs.
Lucas Di Pentima [Mon, 30 Jan 2017 18:05:33 +0000 (15:05 -0300)]
10694: Changed the Provenance Graph orientation of Jobs to be the same way as it is rendered on Collections.
Tom Clegg [Mon, 30 Jan 2017 16:33:28 +0000 (11:33 -0500)]
10700: Rephrase "should cancel" condition to be less unclear.
Tom Clegg [Mon, 30 Jan 2017 15:31:34 +0000 (10:31 -0500)]
Merge branch '10690-config-dump'
closes #10690
Tom Clegg [Fri, 27 Jan 2017 21:46:51 +0000 (16:46 -0500)]
10690: Fix conflicting uses of "config" name.
Tom Clegg [Fri, 27 Jan 2017 20:45:27 +0000 (15:45 -0500)]
10690: Fix missing return after exit().
Tom Clegg [Fri, 27 Jan 2017 20:28:36 +0000 (15:28 -0500)]
10690: De-duplicate dumpConfig.
Tom Clegg [Thu, 12 Jan 2017 15:33:16 +0000 (10:33 -0500)]
10690: Add -dump-config to arv-git-httpd, crunch-dispatch-slurm, keep-balance, keep-web, keepproxy.
Ward Vandewege [Fri, 27 Jan 2017 19:05:21 +0000 (14:05 -0500)]
Doc update: add trademark symbol next to the word 'Arvados' in 2 places.
No issue #
Tom Clegg [Fri, 27 Jan 2017 18:30:33 +0000 (13:30 -0500)]
Merge branch '10990-keep-web-ranges'
refs #10990
Tom Clegg [Fri, 27 Jan 2017 07:27:23 +0000 (02:27 -0500)]
10990: Remove superfluous code for headers now handled by http.ServeContent.
Tom Clegg [Fri, 27 Jan 2017 07:10:46 +0000 (02:10 -0500)]
10990: Adjust readahead to work better on small segments. Improve comments.
Tom Clegg [Fri, 27 Jan 2017 03:46:06 +0000 (22:46 -0500)]
10990: Fix collectionreader tests.
Tom Clegg [Fri, 27 Jan 2017 03:23:22 +0000 (22:23 -0500)]
10990: Support Range headers with non-zero start offset.
Tom Clegg [Thu, 26 Jan 2017 22:40:04 +0000 (17:40 -0500)]
10990: Add tests for byte range requests that start at byte >0.
Ward Vandewege [Thu, 26 Jan 2017 22:38:27 +0000 (17:38 -0500)]
Merge branch '10980-add-ubuntu1604-support'
refs #10980
Ward Vandewege [Thu, 26 Jan 2017 22:30:58 +0000 (17:30 -0500)]
Add support for Ubuntu 1604 ("xenial") to our build scripts.
refs #10980
Ward Vandewege [Thu, 26 Jan 2017 22:24:10 +0000 (17:24 -0500)]
Remove old debian7 package testing file.
refs #10785
Ward Vandewege [Thu, 26 Jan 2017 22:23:04 +0000 (17:23 -0500)]
Remove deprecated --force-yes flag during package testing, replace with
--allow-unauthorized
refs #10980
Tom Clegg [Thu, 26 Jan 2017 19:40:30 +0000 (14:40 -0500)]
10700: Rename PollInterval to PollPeriod in library to match commands and config files.
Tom Clegg [Thu, 26 Jan 2017 19:37:54 +0000 (14:37 -0500)]
10704: Rate-limit startup attempts per container.
Peter Amstutz [Thu, 26 Jan 2017 16:10:00 +0000 (11:10 -0500)]
Bump schema-salad version for schema loading performance fix. no issue #
Tom Clegg [Wed, 25 Jan 2017 21:37:26 +0000 (16:37 -0500)]
10703: Un-pyramid run().
Tom Clegg [Wed, 25 Jan 2017 21:04:30 +0000 (16:04 -0500)]
10701: Remove unneeded complexity in squeue invocation.
Peter Amstutz [Wed, 25 Jan 2017 19:31:52 +0000 (14:31 -0500)]
arv_docker_get_image should not modify dockerRequirement parameter. refs #10812
Peter Amstutz [Wed, 25 Jan 2017 18:51:58 +0000 (13:51 -0500)]
Merge branch '10812-docker-upload-fix' refs #10812
Peter Amstutz [Wed, 25 Jan 2017 18:20:28 +0000 (13:20 -0500)]
10812: Fix bug and add test that docker dependencies are found and uploaded.
Tom Clegg [Wed, 25 Jan 2017 15:59:48 +0000 (10:59 -0500)]
10703: Do not catch signals in crunch-dispatch-slurm. Simplify "stop dispatcher loop" API.
Peter Amstutz [Wed, 25 Jan 2017 15:09:19 +0000 (10:09 -0500)]
Merge branch '10895-run-in-single-container-reuse' refs #10895
Peter Amstutz [Wed, 25 Jan 2017 14:53:13 +0000 (09:53 -0500)]
10895: Don't report "reused" for Failed/Cancelled containers. Tweak log
message. Tweak behavior of --debug to also enable arvados SDK debugging.
Tom Clegg [Tue, 24 Jan 2017 20:28:37 +0000 (15:28 -0500)]
Merge branch '9998-no-count-items-available'
closes #9998
Tom Clegg [Tue, 24 Jan 2017 20:28:04 +0000 (15:28 -0500)]
9998: Update CLI usage.
Tom Clegg [Tue, 24 Jan 2017 20:18:52 +0000 (15:18 -0500)]
9998: Remove superfluous word.
Tom Clegg [Tue, 24 Jan 2017 20:17:00 +0000 (15:17 -0500)]
Merge branch '10868-job-log-node-info'
refs #10868
Tom Clegg [Tue, 24 Jan 2017 18:51:40 +0000 (13:51 -0500)]
9998: Use links instead of specimens to test count param.
Tom Clegg [Tue, 24 Jan 2017 18:42:30 +0000 (13:42 -0500)]
9998: Update API docs.
Tom Clegg [Tue, 24 Jan 2017 03:47:57 +0000 (22:47 -0500)]
10868: Include node UUIDs and properties in job logs.
Peter Amstutz [Mon, 23 Jan 2017 22:03:30 +0000 (17:03 -0500)]
10895: Don't add uploaded files to pathmap if they are not referenced.
Peter Amstutz [Mon, 23 Jan 2017 21:21:24 +0000 (16:21 -0500)]
10895: Improve reuse for RunInSingleContainer jobs
* Consolodate and de-duplicate requirements and hints
* Use json.dumps() with sorted_keys=True to avoid issues with round trip YAML
formatting and comments that are irrelevant to reuse semantics.
* Remember subworkflow packing for more efficient scattering.
* Logging distinguishes when jobs and containers are reused.
Tom Clegg [Mon, 23 Jan 2017 20:31:23 +0000 (15:31 -0500)]
9998: Handle timestamp collisions in collection index.
Tom Clegg [Mon, 23 Jan 2017 15:41:24 +0000 (10:41 -0500)]
Merge branch 'master' into 9998-no-count-items-available
Peter Amstutz [Mon, 23 Jan 2017 15:32:05 +0000 (10:32 -0500)]
Merge branch '10812-cwl-separate-dependencies' refs #10812
Peter Amstutz [Mon, 23 Jan 2017 15:20:46 +0000 (10:20 -0500)]
10812: Handle expected NotFoundError error when checking if file exists in keep.
Peter Amstutz [Mon, 23 Jan 2017 14:52:34 +0000 (09:52 -0500)]
10812: Add num_retries to CollectionFetcher and collectionResolver.
Peter Amstutz [Fri, 20 Jan 2017 16:36:05 +0000 (11:36 -0500)]
10812: Fix indentation typo.
Peter Amstutz [Thu, 19 Jan 2017 18:29:11 +0000 (13:29 -0500)]
10812: Update cwltool and schema-salad dependencies.
Peter Amstutz [Wed, 18 Jan 2017 22:46:54 +0000 (17:46 -0500)]
10812: Handle $schema references.