X-Git-Url: https://git.arvados.org/arvados.git/blobdiff_plain/8c90084189d5ddb42359688aabd57f77ae867000..cf2a30aa6449f7ed9ac6c455842a23af44fb1509:/doc/user/topics/keep.html.textile.liquid diff --git a/doc/user/topics/keep.html.textile.liquid b/doc/user/topics/keep.html.textile.liquid index b8d787d540..f2ffa62406 100644 --- a/doc/user/topics/keep.html.textile.liquid +++ b/doc/user/topics/keep.html.textile.liquid @@ -8,7 +8,7 @@ In Keep, information is stored in *data blocks*. Data blocks are normally betwe In order to reassemble the file, Keep stores a *collection* data block which lists in sequence the data blocks that make up the original file. A collection data block may store the information for multiple files, including a directory structure. -In this example we will use @c1bad4b39ca5a924e481008009d94e32+210@ which we added to Keep in "the first Keep tutorial":{{ site.baseurl }}/users/tutorial/tutorial-keep.html. First let us examine the contents of this collection using @arv keep get@: +In this example we will use @c1bad4b39ca5a924e481008009d94e32+210@ which we added to Keep in "the first Keep tutorial":{{ site.baseurl }}/user/tutorials/tutorial-keep.html. First let us examine the contents of this collection using @arv keep get@:
~$ arv keep get c1bad4b39ca5a924e481008009d94e32+210
@@ -16,14 +16,24 @@ In this example we will use @c1bad4b39ca5a924e481008009d94e32+210@ which we adde
 
-The command @arv keep get@ fetches the contents of the locator @c1bad4b39ca5a924e481008009d94e32+210@. This is a locator for a collection data block, so it fetches the contents of the collection. In this example, this collection consists of a single file @var-GS000016015-ASM.tsv.bz2@ which is 227212247 bytes long, and is stored using four sequential data blocks, 204e43b8a1185621ca55a94839582e6f+67108864, b9677abbac956bd3e86b1deb28dfac03+67108864, fc15aff2a762b13f521baf042140acec+67108864, 323d2a3ce20370c4ca1d3462a344f8fd+25885655. +The command @arv keep get@ fetches the contents of the collection @c1bad4b39ca5a924e481008009d94e32+210@. In this example, this collection includes a single file @var-GS000016015-ASM.tsv.bz2@ which is 227212247 bytes long, and is stored using four sequential data blocks, @204e43b8a1185621ca55a94839582e6f+67108864@, @b9677abbac956bd3e86b1deb28dfac03+67108864@, @fc15aff2a762b13f521baf042140acec+67108864@, and @323d2a3ce20370c4ca1d3462a344f8fd+25885655@. -Let's use @arv keep get@ to download the first datablock: +Let's use @arv keep get@ to download the first data block: notextile.
~$ cd /scratch/you
 /scratch/you$ arv keep get 204e43b8a1185621ca55a94839582e6f+67108864 > block1
-Let's look at the size and compute the md5 hash of @block1@: +{% include 'notebox_begin' %} + +When you run this command, you may get this API warning: + +notextile.
WARNING:root:API lookup failed for collection 204e43b8a1185621ca55a94839582e6f+67108864 (<class 'apiclient.errors.HttpError'>: <HttpError 404 when requesting https://qr1hi.arvadosapi.com/arvados/v1/collections/204e43b8a1185621ca55a94839582e6f%2B67108864?alt=json returned "Not Found">)
+ +This happens because @arv keep get@ tries to find a collection with this identifier. When that fails, it emits this warning, then looks for a datablock instead, which succeeds. + +{% include 'notebox_end' %} + +Let's look at the size and compute the MD5 hash of @block1@:
/scratch/you$ ls -l block1
@@ -34,5 +44,5 @@ Let's look at the size and compute the md5 hash of @block1@:
 
 
 Notice that the block identifer 204e43b8a1185621ca55a94839582e6f+67108864 consists of:
-* the md5 hash @204e43b8a1185621ca55a94839582e6f@ which matches the md5 hash of @block1@
-* a size hint @67108864@ which matches the size of @block1@
+* the MD5 hash of @block1@, @204e43b8a1185621ca55a94839582e6f@, plus
+* the size of @block1@, @67108864@.