arvados.git
7 years agoMerge branch 'master' into origin-9397-prepopulate-output-directory
Peter Amstutz [Fri, 10 Feb 2017 16:04:23 +0000 (11:04 -0500)]
Merge branch 'master' into origin-9397-prepopulate-output-directory

7 years agoMerge branch '9397-go-manifest' refs #9397
Peter Amstutz [Fri, 10 Feb 2017 16:03:42 +0000 (11:03 -0500)]
Merge branch '9397-go-manifest' refs #9397

7 years ago9397: Update comment & tests for CollectionFileReader to reflect it is more
Peter Amstutz [Fri, 10 Feb 2017 16:02:33 +0000 (11:02 -0500)]
9397: Update comment & tests for CollectionFileReader to reflect it is more
lenient in the paths it accepts as a result of updates to implementation of
manifest.FileSegmentIterByName.

7 years ago9397: Tweak method documentation for Extract()
Peter Amstutz [Thu, 9 Feb 2017 21:58:23 +0000 (16:58 -0500)]
9397: Tweak method documentation for Extract()

7 years ago9397: Update method documentation for Extract()
Peter Amstutz [Thu, 9 Feb 2017 21:55:10 +0000 (16:55 -0500)]
9397: Update method documentation for Extract()

7 years ago9397: Fix major bug in firstBlock(). Refactor ManifestTextForPath() to
Peter Amstutz [Thu, 9 Feb 2017 21:46:21 +0000 (16:46 -0500)]
9397: Fix major bug in firstBlock().  Refactor ManifestTextForPath() to
Extract().  Test returning errors from Extract().

7 years ago9397: Fix collection.go to use blockdigest.LocatorPattern
Peter Amstutz [Thu, 9 Feb 2017 19:28:26 +0000 (14:28 -0500)]
9397: Fix collection.go to use blockdigest.LocatorPattern

7 years ago9397: Fix collection.go to use blockdigest.LocatorPattern
Peter Amstutz [Thu, 9 Feb 2017 19:28:26 +0000 (14:28 -0500)]
9397: Fix collection.go to use blockdigest.LocatorPattern

7 years ago9397: Use ManifestTextForPath
Peter Amstutz [Thu, 9 Feb 2017 19:25:19 +0000 (14:25 -0500)]
9397: Use ManifestTextForPath

7 years agoMerge branch '9397-go-manifest' into origin-9397-prepopulate-output-directory
Peter Amstutz [Thu, 9 Feb 2017 19:11:53 +0000 (14:11 -0500)]
Merge branch '9397-go-manifest' into origin-9397-prepopulate-output-directory

7 years ago9397: Remove conflicting code for easier merge
Peter Amstutz [Thu, 9 Feb 2017 19:11:35 +0000 (14:11 -0500)]
9397: Remove conflicting code for easier merge

7 years ago9397: Use idomatic loop form.
Peter Amstutz [Thu, 9 Feb 2017 18:25:40 +0000 (13:25 -0500)]
9397: Use idomatic loop form.

7 years ago9397: Move most functions into private API.
Peter Amstutz [Thu, 9 Feb 2017 18:21:28 +0000 (13:21 -0500)]
9397: Move most functions into private API.

* Document ManifestTextForPath().
* Check for invalid file segments, and fix tests that were using invalid file segments(!)

7 years agoMerge branch '10700-dispatch'
Tom Clegg [Thu, 9 Feb 2017 08:35:59 +0000 (03:35 -0500)]
Merge branch '10700-dispatch'

refs #10700

7 years ago10700: Fix error message.
Tom Clegg [Thu, 9 Feb 2017 07:28:47 +0000 (02:28 -0500)]
10700: Fix error message.

7 years ago9397: Add manifest normalization and sub-manifest extraction by path.
Peter Amstutz [Mon, 6 Feb 2017 22:16:01 +0000 (17:16 -0500)]
9397: Add manifest normalization and sub-manifest extraction by path.

Introduces "SegmentedManifest" which stores streams -> files -> file segments.
Enables reexport of manifest in normalized form, as well as extraction of
individual files, streams or sets of streams.  Also adds binary search for
efficiently determining first block to access for some stream offset.

7 years ago9397: add pre-population of output directory documentation
radhika [Mon, 6 Feb 2017 21:56:38 +0000 (16:56 -0500)]
9397: add pre-population of output directory documentation

7 years agoMerge branch 'master' into 9397-prepopulate-output-directory
radhika [Mon, 6 Feb 2017 03:41:34 +0000 (22:41 -0500)]
Merge branch 'master' into 9397-prepopulate-output-directory

7 years ago9397: add normalized manifest test
radhika [Mon, 6 Feb 2017 03:40:00 +0000 (22:40 -0500)]
9397: add normalized manifest test

7 years ago9397: add handling of denormalized manifest
radhika [Mon, 6 Feb 2017 02:03:11 +0000 (21:03 -0500)]
9397: add handling of denormalized manifest

7 years ago9397: NormalizeManifest
radhika [Mon, 6 Feb 2017 00:14:48 +0000 (19:14 -0500)]
9397: NormalizeManifest

7 years agoMerge branch '10968-arvput-resume-progress'
Lucas Di Pentima [Fri, 3 Feb 2017 22:34:49 +0000 (19:34 -0300)]
Merge branch '10968-arvput-resume-progress'
Closes #10968

7 years ago10968: Deleted space char at the end of a comment
Lucas Di Pentima [Fri, 3 Feb 2017 22:34:02 +0000 (19:34 -0300)]
10968: Deleted space char at the end of a comment

7 years ago9397: get manifest segment for a subdir or file using manifest from sdk.
radhika [Fri, 3 Feb 2017 21:00:33 +0000 (16:00 -0500)]
9397: get manifest segment for a subdir or file using manifest from sdk.

7 years ago10968: Added a notification when uploading at least one directory, just to let know...
Lucas Di Pentima [Fri, 3 Feb 2017 19:21:24 +0000 (16:21 -0300)]
10968: Added a notification when uploading at least one directory, just to let know the user that the expected bytes count can take some time when trying to upload lots of files.

7 years agoMerge branch '10969-check-docker-format'
Tom Clegg [Fri, 3 Feb 2017 18:15:43 +0000 (13:15 -0500)]
Merge branch '10969-check-docker-format'

closes #10969

7 years ago10968: Changed the periodic update thread to run every 1 second while arv-put is...
Lucas Di Pentima [Fri, 3 Feb 2017 17:52:16 +0000 (14:52 -0300)]
10968: Changed the periodic update thread to run every 1 second while arv-put is checking which files to skip, only notifying the user via the progress indicator.
When starts uploading the rest of the files, the update thread returns to the previous behaviour, running once every minute and checkpointing to the cache.

7 years agoMerge branch '10932-arvput-slow-resuming'
Lucas Di Pentima [Fri, 3 Feb 2017 15:38:52 +0000 (12:38 -0300)]
Merge branch '10932-arvput-slow-resuming'
Closes #10932

7 years ago10932: Changed _file_paths from being a list to a set so we're not going to copy...
Lucas Di Pentima [Fri, 3 Feb 2017 15:09:58 +0000 (12:09 -0300)]
10932: Changed _file_paths from being a list to a set so we're not going to copy it when checking for missing files on local collection on resume start.
Added comments on cache saving explaining why is better to use json.dumps() instead of copy.deepcopy().

7 years ago9397: Use manifest.FileSegmentForPath to get manifest segment for a file path. Cache...
radhika [Fri, 3 Feb 2017 00:16:25 +0000 (19:16 -0500)]
9397: Use manifest.FileSegmentForPath to get manifest segment for a file path. Cache collections to avoid fetching
the same collection repeatedly. If no manifest segment found for a mounted path, log that fact.

7 years ago10969: Add docker_image_formats server config, and corresponding check in `arv keep...
Tom Clegg [Thu, 2 Feb 2017 22:32:31 +0000 (17:32 -0500)]
10969: Add docker_image_formats server config, and corresponding check in `arv keep docker`.

7 years ago10932: Replaced the use of a list with a set to check if files on local collection...
Lucas Di Pentima [Thu, 2 Feb 2017 22:10:45 +0000 (19:10 -0300)]
10932: Replaced the use of a list with a set to check if files on local collection are on the local file list, so that the resume start time is greatly reduced.
Also, the save_state method was taking too much time on two operations: deepcopy() and json.dump(). Replaced both with just one call to json.dumps() that's a lot faster than json.dump().
This will improve overall performance on big file collections uploads.

7 years agoMerge branch '10990-keep-web-ranges'
Tom Clegg [Wed, 1 Feb 2017 21:31:06 +0000 (16:31 -0500)]
Merge branch '10990-keep-web-ranges'

closes #10990

7 years ago10990: Add Seek tests.
Tom Clegg [Wed, 1 Feb 2017 21:29:16 +0000 (16:29 -0500)]
10990: Add Seek tests.

7 years ago9397: Add ExcludeFromOutput to Mount struct.
radhika [Wed, 1 Feb 2017 16:50:42 +0000 (11:50 -0500)]
9397: Add ExcludeFromOutput to Mount struct.

7 years agoMerge branch '10705-dedup-log-message'
Tom Clegg [Wed, 1 Feb 2017 16:46:26 +0000 (11:46 -0500)]
Merge branch '10705-dedup-log-message'

refs #10705

7 years ago10705: Fix double-logged error.
Tom Clegg [Wed, 1 Feb 2017 16:38:29 +0000 (11:38 -0500)]
10705: Fix double-logged error.

7 years agoMerge branch '10700-dispatch'
Tom Clegg [Tue, 31 Jan 2017 21:32:05 +0000 (16:32 -0500)]
Merge branch '10700-dispatch'

closes #10700
  refs #10701
closes #10702
closes #10703
closes #10704

Conflicts:
services/crunch-dispatch-slurm/crunch-dispatch-slurm.go

7 years agoMerge branch '10694-provenance-graph-consistency'
Lucas Di Pentima [Tue, 31 Jan 2017 17:27:03 +0000 (14:27 -0300)]
Merge branch '10694-provenance-graph-consistency'
Closes #10694

7 years agoMerge branch '10868-ip-addr-conflict'
Tom Clegg [Tue, 31 Jan 2017 15:52:29 +0000 (10:52 -0500)]
Merge branch '10868-ip-addr-conflict'

refs #10868

7 years agoMerge branch '10813-arv-put-six-threads'
Lucas Di Pentima [Tue, 31 Jan 2017 15:13:33 +0000 (12:13 -0300)]
Merge branch '10813-arv-put-six-threads'
Closes #10813

7 years ago9397: update comments
radhika [Tue, 31 Jan 2017 14:39:43 +0000 (09:39 -0500)]
9397: update comments

7 years ago10700: Rename squeue identifiers (sqCheck = SqueueChecker{})
Tom Clegg [Tue, 31 Jan 2017 00:19:49 +0000 (19:19 -0500)]
10700: Rename squeue identifiers (sqCheck = SqueueChecker{})

7 years ago10700: Simplify squeue checker.
Tom Clegg [Mon, 30 Jan 2017 23:32:49 +0000 (18:32 -0500)]
10700: Simplify squeue checker.

7 years ago10813: Merge branch 'master' into 10813-arv-put-six-threads
Lucas Di Pentima [Mon, 30 Jan 2017 21:29:10 +0000 (18:29 -0300)]
10813: Merge branch 'master' into 10813-arv-put-six-threads

7 years ago10813: Added only_committed argument to ArvFile.manifest_text() method to maintain...
Lucas Di Pentima [Mon, 30 Jan 2017 21:26:35 +0000 (18:26 -0300)]
10813: Added only_committed argument to ArvFile.manifest_text() method to maintain consistency with its counterpart from Collection class.

7 years ago10813: Removed unnecessary argument
Lucas Di Pentima [Mon, 30 Jan 2017 21:25:34 +0000 (18:25 -0300)]
10813: Removed unnecessary argument

7 years ago9397: if mount.Path ends with "/", trim it.
radhika [Mon, 30 Jan 2017 21:20:24 +0000 (16:20 -0500)]
9397: if mount.Path ends with "/", trim it.

7 years agoMerge branch 'master' into 9397-prepopulate-output-directory
radhika [Mon, 30 Jan 2017 20:59:29 +0000 (15:59 -0500)]
Merge branch 'master' into 9397-prepopulate-output-directory

7 years ago9397: more testing
radhika [Mon, 30 Jan 2017 20:58:28 +0000 (15:58 -0500)]
9397: more testing

7 years ago10868: Remove old DNS entry immediately when a new node re-uses an old node's IP...
Tom Clegg [Mon, 30 Jan 2017 20:17:52 +0000 (15:17 -0500)]
10868: Remove old DNS entry immediately when a new node re-uses an old node's IP address.

7 years ago10694: Flipped graph orientations so they are all (collections, jobs, pipelines)...
Lucas Di Pentima [Mon, 30 Jan 2017 19:39:02 +0000 (16:39 -0300)]
10694: Flipped graph orientations so they are all (collections, jobs, pipelines) redered top-down.

7 years ago9397: test with multiple mount points under subdirectories
radhika [Mon, 30 Jan 2017 18:34:42 +0000 (13:34 -0500)]
9397: test with multiple mount points under subdirectories

7 years ago10694: Changed Pipeline Instance's Graph orientation to match Collections and Jobs.
Lucas Di Pentima [Mon, 30 Jan 2017 18:31:16 +0000 (15:31 -0300)]
10694: Changed Pipeline Instance's Graph orientation to match Collections and Jobs.

7 years ago10694: Changed the Provenance Graph orientation of Jobs to be the same way as it...
Lucas Di Pentima [Mon, 30 Jan 2017 18:05:33 +0000 (15:05 -0300)]
10694: Changed the Provenance Graph orientation of Jobs to be the same way as it is rendered on Collections.

7 years ago10700: Rephrase "should cancel" condition to be less unclear.
Tom Clegg [Mon, 30 Jan 2017 16:33:28 +0000 (11:33 -0500)]
10700: Rephrase "should cancel" condition to be less unclear.

7 years agoMerge branch '10690-config-dump'
Tom Clegg [Mon, 30 Jan 2017 15:31:34 +0000 (10:31 -0500)]
Merge branch '10690-config-dump'

closes #10690

7 years ago10690: Fix conflicting uses of "config" name.
Tom Clegg [Fri, 27 Jan 2017 21:46:51 +0000 (16:46 -0500)]
10690: Fix conflicting uses of "config" name.

7 years ago10690: Fix missing return after exit().
Tom Clegg [Fri, 27 Jan 2017 20:45:27 +0000 (15:45 -0500)]
10690: Fix missing return after exit().

7 years ago10690: De-duplicate dumpConfig.
Tom Clegg [Fri, 27 Jan 2017 20:28:36 +0000 (15:28 -0500)]
10690: De-duplicate dumpConfig.

7 years ago10690: Add -dump-config to arv-git-httpd, crunch-dispatch-slurm, keep-balance, keep...
Tom Clegg [Thu, 12 Jan 2017 15:33:16 +0000 (10:33 -0500)]
10690: Add -dump-config to arv-git-httpd, crunch-dispatch-slurm, keep-balance, keep-web, keepproxy.

7 years agoDoc update: add trademark symbol next to the word 'Arvados' in 2 places.
Ward Vandewege [Fri, 27 Jan 2017 19:05:21 +0000 (14:05 -0500)]
Doc update: add trademark symbol next to the word 'Arvados' in 2 places.

No issue #

7 years agoMerge branch '10990-keep-web-ranges'
Tom Clegg [Fri, 27 Jan 2017 18:30:33 +0000 (13:30 -0500)]
Merge branch '10990-keep-web-ranges'

refs #10990

7 years ago10990: Remove superfluous code for headers now handled by http.ServeContent.
Tom Clegg [Fri, 27 Jan 2017 07:27:23 +0000 (02:27 -0500)]
10990: Remove superfluous code for headers now handled by http.ServeContent.

7 years ago10990: Adjust readahead to work better on small segments. Improve comments.
Tom Clegg [Fri, 27 Jan 2017 07:10:46 +0000 (02:10 -0500)]
10990: Adjust readahead to work better on small segments. Improve comments.

7 years ago10990: Fix collectionreader tests.
Tom Clegg [Fri, 27 Jan 2017 03:46:06 +0000 (22:46 -0500)]
10990: Fix collectionreader tests.

7 years ago10990: Support Range headers with non-zero start offset.
Tom Clegg [Fri, 27 Jan 2017 03:23:22 +0000 (22:23 -0500)]
10990: Support Range headers with non-zero start offset.

7 years ago10990: Add tests for byte range requests that start at byte >0.
Tom Clegg [Thu, 26 Jan 2017 22:40:04 +0000 (17:40 -0500)]
10990: Add tests for byte range requests that start at byte >0.

7 years agoMerge branch '10980-add-ubuntu1604-support'
Ward Vandewege [Thu, 26 Jan 2017 22:38:27 +0000 (17:38 -0500)]
Merge branch '10980-add-ubuntu1604-support'

refs #10980

7 years agoAdd support for Ubuntu 1604 ("xenial") to our build scripts.
Ward Vandewege [Thu, 26 Jan 2017 22:30:58 +0000 (17:30 -0500)]
Add support for Ubuntu 1604 ("xenial") to our build scripts.

refs #10980

7 years agoRemove old debian7 package testing file.
Ward Vandewege [Thu, 26 Jan 2017 22:24:10 +0000 (17:24 -0500)]
Remove old debian7 package testing file.

refs #10785

7 years agoRemove deprecated --force-yes flag during package testing, replace with
Ward Vandewege [Thu, 26 Jan 2017 22:23:04 +0000 (17:23 -0500)]
Remove deprecated --force-yes flag during package testing, replace with
--allow-unauthorized

refs #10980

7 years ago10700: Rename PollInterval to PollPeriod in library to match commands and config...
Tom Clegg [Thu, 26 Jan 2017 19:40:30 +0000 (14:40 -0500)]
10700: Rename PollInterval to PollPeriod in library to match commands and config files.

7 years ago10704: Rate-limit startup attempts per container.
Tom Clegg [Thu, 26 Jan 2017 19:37:54 +0000 (14:37 -0500)]
10704: Rate-limit startup attempts per container.

7 years agoBump schema-salad version for schema loading performance fix. no issue #
Peter Amstutz [Thu, 26 Jan 2017 16:10:00 +0000 (11:10 -0500)]
Bump schema-salad version for schema loading performance fix.  no issue #

7 years ago10703: Un-pyramid run().
Tom Clegg [Wed, 25 Jan 2017 21:37:26 +0000 (16:37 -0500)]
10703: Un-pyramid run().

7 years ago10701: Remove unneeded complexity in squeue invocation.
Tom Clegg [Wed, 25 Jan 2017 21:04:30 +0000 (16:04 -0500)]
10701: Remove unneeded complexity in squeue invocation.

7 years agoarv_docker_get_image should not modify dockerRequirement parameter. refs #10812
Peter Amstutz [Wed, 25 Jan 2017 19:31:52 +0000 (14:31 -0500)]
arv_docker_get_image should not modify dockerRequirement parameter.  refs #10812

7 years agoMerge branch '10812-docker-upload-fix' refs #10812
Peter Amstutz [Wed, 25 Jan 2017 18:51:58 +0000 (13:51 -0500)]
Merge branch '10812-docker-upload-fix' refs #10812

7 years ago10812: Fix bug and add test that docker dependencies are found and uploaded.
Peter Amstutz [Wed, 25 Jan 2017 18:20:28 +0000 (13:20 -0500)]
10812: Fix bug and add test that docker dependencies are found and uploaded.

7 years ago10703: Do not catch signals in crunch-dispatch-slurm. Simplify "stop dispatcher loop...
Tom Clegg [Wed, 25 Jan 2017 15:59:48 +0000 (10:59 -0500)]
10703: Do not catch signals in crunch-dispatch-slurm. Simplify "stop dispatcher loop" API.

7 years agoMerge branch '10895-run-in-single-container-reuse' refs #10895
Peter Amstutz [Wed, 25 Jan 2017 15:09:19 +0000 (10:09 -0500)]
Merge branch '10895-run-in-single-container-reuse' refs #10895

7 years ago10895: Don't report "reused" for Failed/Cancelled containers. Tweak log
Peter Amstutz [Wed, 25 Jan 2017 14:53:13 +0000 (09:53 -0500)]
10895: Don't report "reused" for Failed/Cancelled containers.  Tweak log
message.  Tweak behavior of --debug to also enable arvados SDK debugging.

7 years agoMerge branch '9998-no-count-items-available'
Tom Clegg [Tue, 24 Jan 2017 20:28:37 +0000 (15:28 -0500)]
Merge branch '9998-no-count-items-available'

closes #9998

7 years ago9998: Update CLI usage.
Tom Clegg [Tue, 24 Jan 2017 20:28:04 +0000 (15:28 -0500)]
9998: Update CLI usage.

7 years ago9998: Remove superfluous word.
Tom Clegg [Tue, 24 Jan 2017 20:18:52 +0000 (15:18 -0500)]
9998: Remove superfluous word.

7 years agoMerge branch '10868-job-log-node-info'
Tom Clegg [Tue, 24 Jan 2017 20:17:00 +0000 (15:17 -0500)]
Merge branch '10868-job-log-node-info'

refs #10868

7 years ago9998: Use links instead of specimens to test count param.
Tom Clegg [Tue, 24 Jan 2017 18:51:40 +0000 (13:51 -0500)]
9998: Use links instead of specimens to test count param.

7 years ago9998: Update API docs.
Tom Clegg [Tue, 24 Jan 2017 18:42:30 +0000 (13:42 -0500)]
9998: Update API docs.

7 years ago10868: Include node UUIDs and properties in job logs.
Tom Clegg [Tue, 24 Jan 2017 03:47:57 +0000 (22:47 -0500)]
10868: Include node UUIDs and properties in job logs.

7 years ago10895: Don't add uploaded files to pathmap if they are not referenced.
Peter Amstutz [Mon, 23 Jan 2017 22:03:30 +0000 (17:03 -0500)]
10895: Don't add uploaded files to pathmap if they are not referenced.

7 years ago10895: Improve reuse for RunInSingleContainer jobs
Peter Amstutz [Mon, 23 Jan 2017 21:21:24 +0000 (16:21 -0500)]
10895: Improve reuse for RunInSingleContainer jobs

* Consolodate and de-duplicate requirements and hints
* Use json.dumps() with sorted_keys=True to avoid issues with round trip YAML
  formatting and comments that are irrelevant to reuse semantics.
* Remember subworkflow packing for more efficient scattering.
* Logging distinguishes when jobs and containers are reused.

7 years ago9998: Handle timestamp collisions in collection index.
Tom Clegg [Mon, 23 Jan 2017 20:31:23 +0000 (15:31 -0500)]
9998: Handle timestamp collisions in collection index.

7 years agoMerge branch 'master' into 9998-no-count-items-available
Tom Clegg [Mon, 23 Jan 2017 15:41:24 +0000 (10:41 -0500)]
Merge branch 'master' into 9998-no-count-items-available

7 years agoMerge branch '10812-cwl-separate-dependencies' refs #10812
Peter Amstutz [Mon, 23 Jan 2017 15:32:05 +0000 (10:32 -0500)]
Merge branch '10812-cwl-separate-dependencies' refs #10812

7 years ago10812: Handle expected NotFoundError error when checking if file exists in keep.
Peter Amstutz [Mon, 23 Jan 2017 15:20:46 +0000 (10:20 -0500)]
10812: Handle expected NotFoundError error when checking if file exists in keep.

7 years ago10812: Add num_retries to CollectionFetcher and collectionResolver.
Peter Amstutz [Mon, 23 Jan 2017 14:52:34 +0000 (09:52 -0500)]
10812: Add num_retries to CollectionFetcher and collectionResolver.

7 years ago10812: Fix indentation typo.
Peter Amstutz [Fri, 20 Jan 2017 16:36:05 +0000 (11:36 -0500)]
10812: Fix indentation typo.

7 years ago10812: Update cwltool and schema-salad dependencies.
Peter Amstutz [Thu, 19 Jan 2017 18:29:11 +0000 (13:29 -0500)]
10812: Update cwltool and schema-salad dependencies.

7 years ago10812: Handle $schema references.
Peter Amstutz [Wed, 18 Jan 2017 22:46:54 +0000 (17:46 -0500)]
10812: Handle $schema references.