X-Git-Url: https://git.arvados.org/arvados.git/blobdiff_plain/9fcd276e29493c76dd3a54d7d0b10c1570996478..2c5a3dbd6b56ca39d7b105b821f7061d2b102a56:/doc/install/install-keepproxy.html.textile.liquid diff --git a/doc/install/install-keepproxy.html.textile.liquid b/doc/install/install-keepproxy.html.textile.liquid index 2b2c56e236..20021bd42e 100644 --- a/doc/install/install-keepproxy.html.textile.liquid +++ b/doc/install/install-keepproxy.html.textile.liquid @@ -21,7 +21,7 @@ h2(#introduction). Introduction The Keepproxy server is a gateway into your Keep storage. Unlike the Keepstore servers, which are only accessible on the local LAN, Keepproxy is suitable for clients located elsewhere on the internet. Specifically, in contrast to Keepstore: * A client writing through Keepproxy sends a single copy of a data block, and Keepproxy distributes copies to the appropriate Keepstore servers. -* A client can write through Keepproxy without precomputing content hashes. Notably, the browser-based upload feature in Workbench requires Keepproxy. +* A client can write through Keepproxy without precomputing content hashes. * Keepproxy checks API token validity before processing requests. (Clients that can connect directly to Keepstore can use it as scratch space even without a valid API token.) By convention, we use the following hostname for the Keepproxy server: @@ -29,47 +29,47 @@ By convention, we use the following hostname for the Keepproxy server:
table(table table-bordered table-condensed). |_. Hostname| -|keep.@uuid_prefix@.your.domain| +|@keep.ClusterID.example.com@|
This hostname should resolve from anywhere on the internet. h2(#update-config). Update config.yml -Edit the cluster config at @/etc/arvados/config.yml@ and set @Services.Keepproxy.ExternalURL@ and @Services.Keepproxy.InternalURLs@. +Edit the cluster config at @config.yml@ and set @Services.Keepproxy.ExternalURL@ and @Services.Keepproxy.InternalURLs@.
    Services:
       Keepproxy:
         ExternalURL: https://keep.ClusterID.example.com
         InternalURLs:
-	  "http://keep.ClusterID.example.com:25107": {}
+          "http://localhost:25107": {}
 
h2(#update-nginx). Update Nginx configuration -Put a reverse proxy with SSL support in front of Keepproxy. Keepproxy itself runs on port 25107 by default; the reverse proxy runs on port 443 and forwards requests to Keepproxy on port 25107. +Put a reverse proxy with SSL support in front of Keepproxy. Keepproxy itself runs on the port 25107 (or whatever is specified in @Services.Keepproxy.InternalURL@) while the reverse proxy runs on port 443 and forwards requests to Keepproxy. -Use a text editor to create a new file @/etc/nginx/conf.d/keepproxy.conf@ with the following configuration. Options that need attention are marked with “TODO”. +Use a text editor to create a new file @/etc/nginx/conf.d/keepproxy.conf@ with the following configuration. Options that need attention are marked in red.
upstream keepproxy {
   server                127.0.0.1:25107;
 }
 
 server {
-  listen                  [TODO your public IP address]:443 ssl;
-  server_name             keep.ClusterID.example.com;
+  listen                  443 ssl;
+  server_name             keep.ClusterID.example.com;
 
   proxy_connect_timeout   90s;
   proxy_read_timeout      300s;
   proxy_set_header        X-Real-IP $remote_addr;
   proxy_http_version      1.1;
   proxy_request_buffering off;
+  proxy_max_temp_file_size 0;
 
-  ssl on;
-  ssl_certificate     /TODO/YOUR/PATH/TO/cert.pem;
-  ssl_certificate_key /TODO/YOUR/PATH/TO/cert.key;
+  ssl_certificate     /YOUR/PATH/TO/cert.pem;
+  ssl_certificate_key /YOUR/PATH/TO/cert.key;
 
   # Clients need to be able to upload blocks of data up to 64MiB in size.
   client_max_body_size    64m;
@@ -82,57 +82,43 @@ server {
 
 Note: if the Web uploader is failing to upload data and there are no logs from keepproxy, be sure to check the nginx proxy logs.  In addition to "GET" and "PUT", The nginx proxy must pass "OPTIONS" requests to keepproxy, which should respond with appropriate Cross-origin resource sharing headers.  If the CORS headers are not present, brower security policy will cause the upload request to silently fail.  The CORS headers are generated by keepproxy and should not be set in nginx.
 
-h2(#install-packages). Install Keepproxy package
+{% assign arvados_component = 'keepproxy' %}
 
-h3. Centos 7
+{% include 'install_packages' %}
 
-
-
# yum install keepproxy
-
-
+{% include 'start_service' %} -h3. Debian and Ubuntu +{% include 'restart_api' %} - -
# apt-get install keepproxy
-
-
+h2(#confirm-working). Confirm working installation -h2(#start-service). Start the service +We recommend using the "Cluster diagnostics tool.":diagnostics.html Because Keepproxy is specifically a gateway used by outside clients, for this test you should run the diagnostics from a client machine outside the Arvados private network, and provide the @-external-client@ parameter. -If your system does not use systemd, skip this section and follow the "runit instructions":#runit instead. +Here are some other checks you can perform manually. -If your system uses systemd, the keepproxy service should already be set up. Start it and check its status: +Log into a host that is on a network external to your private Arvados network. The host should be able to contact your keepproxy server (eg @keep.ClusterID.example.com@), but not your keepstore servers (eg keep[0-9].ClusterID.example.com). - -
# systemctl restart keepproxy
-# systemctl status keepproxy
-● keepproxy.service - Arvados Keep Proxy
-   Loaded: loaded (/lib/systemd/system/keepproxy.service; enabled)
-   Active: active (running) since Tue 2019-07-23 09:33:47 EDT; 3 weeks 1 days ago
-     Docs: https://doc.arvados.org/
- Main PID: 1150 (Keepproxy)
-   CGroup: /system.slice/keepproxy.service
-           └─1150 /usr/bin/keepproxy
-[...]
-
-
+@ARVADOS_API_HOST@ and @ARVADOS_API_TOKEN@ must be set in the environment. + +@ARVADOS_API_HOST@ should be the hostname of the API server. + +@ARVADOS_API_TOKEN@ should be the system root token. -h2(#restart-api). Restart the API server and controller +Install the "Command line SDK":{{site.baseurl}}/sdk/cli/install.html -After adding keeproxy to the Services section, make sure the cluster config file is up to date on the API server host, and restart the API server and controller processes to ensure the changes are applied. +Check that the keepproxy server is in the @keep_service@ "accessible" list: -
# systemctl restart nginx arvados-controller
+

+$ arv keep_service accessible
+[...]
 
-h2(#confirm-working). Confirm working installation - -Log into a host that is on a network external to your private Arvados network. The host should be able to contact your keepproxy server (eg @keep.ClusterID.example.com@), but not your keepstore servers (eg keep[0-9].ClusterID.example.com). +If keepstore does not show up in the "accessible" list, and you are accessing it from within the private network, check that you have "properly configured the @geo@ block for the API server":install-api-server.html#update-nginx . Install the "Python SDK":{{site.baseurl}}/sdk/python/sdk-python.html -@ARVADOS_API_HOST@ and @ARVADOS_API_TOKEN@ must be set in the environment. +You should now be able to use @arv-put@ to upload collections and @arv-get@ to fetch collections. Be sure to execute this from _outside_ the cluster's private network. -You should now be able to use @arv-put@ to upload collections and @arv-get@ to fetch collections, for an example see "Testing keep.":install-keepstore.html#testing on the keepstore install page. +{% include 'arv_put_example' %}