X-Git-Url: https://git.arvados.org/arvados.git/blobdiff_plain/19ae770973482257117fe8ded5619c3018c4b60f..0dab89df8040f203a33bc1922df0ff893791def7:/doc/install/install-api-server.html.textile.liquid diff --git a/doc/install/install-api-server.html.textile.liquid b/doc/install/install-api-server.html.textile.liquid index 91e2c69892..6c3eabba4f 100644 --- a/doc/install/install-api-server.html.textile.liquid +++ b/doc/install/install-api-server.html.textile.liquid @@ -1,338 +1,223 @@ --- layout: default navsection: installguide -title: Install the API server +title: Install API server and Controller ... +{% comment %} +Copyright (C) The Arvados Authors. All rights reserved. -h2. Install prerequisites +SPDX-License-Identifier: CC-BY-SA-3.0 +{% endcomment %} -The Arvados package repository includes an API server package that can help automate much of the deployment. +# "Introduction":#introduction +# "Install dependencies":#dependencies +# "Set up database":#database-setup +# "Update config.yml":#update-config +# "Update nginx configuration":#update-nginx +# "Install arvados-api-server and arvados-controller":#install-packages +# "Confirm working installation":#confirm-working -h3(#install_ruby_and_bundler). Install Ruby and Bundler +h2(#introduction). Introduction -{% include 'install_ruby_and_bundler' %} +The Arvados core API server consists of four services: PostgreSQL, Arvados Rails API, Arvados Controller, and Nginx. -h3(#install_postgres). Install PostgreSQL +Here is a simplified diagram showing the relationship between the core services. Client requests arrive at the public-facing Nginx reverse proxy. The request is forwarded to Arvados controller. The controller is able handle some requests itself, the rest are forwarded to the Arvados Rails API. The Rails API server implements the majority of business logic, communicating with the PostgreSQL database to fetch data and make transactional updates. All services are stateless, except the PostgreSQL database. This guide assumes all of these services will be installed on the same node, but it is possible to install these services across multiple nodes. -{% include 'install_postgres' %} +!(full-width){{site.baseurl}}/images/proxy-chain.svg! -h2(#install_apiserver). Install API server and dependencies +h2(#dependencies). Install dependencies -On a Debian-based system, install the following packages: +# "Install PostgreSQL":install-postgresql.html +# "Install Ruby and Bundler":ruby.html +# "Install nginx":nginx.html +# "Install Phusion Passenger":https://www.phusionpassenger.com/library/walkthroughs/deploy/ruby/ownserver/nginx/oss/install_passenger_main.html - -
~$ sudo apt-get install bison build-essential libcurl4-openssl-dev git arvados-api-server
-
-
+h2(#database-setup). Set up database -On a Red Hat-based system, install the following packages: +{% assign service_role = "arvados" %} +{% assign service_database = "arvados_production" %} +{% assign use_contrib = true %} +{% include 'install_postgres_database' %} - -
~$ sudo yum install bison make automake gcc gcc-c++ libcurl-devel git arvados-api-server
-
-
- -{% include 'install_git' %} +h2(#update-config). Update config.yml -h2(#configure). Set up the database +Starting from an "empty config.yml file,":config.html#empty add the following configuration keys. -Generate a new database password. Nobody ever needs to memorize it or type it, so we'll make a strong one: +h3. Tokens -
~$ ruby -e 'puts rand(2**128).to_s(36)'
-6gqa1vu492idd7yca9tfandj3
-
- -Create a new database user. - - -
~$ sudo -u postgres createuser --encrypted -R -S --pwprompt arvados
-[sudo] password for you: yourpassword
-Enter password for new role: paste-password-you-generated
-Enter it again: paste-password-again
-
- -{% assign pg_hba_path = "/opt/rh/postgresql92/root/var/lib/pgsql/data/pg_hba.conf" %} -{% assign pg_service = "postgresql92-postgresql" %} -{% include 'install_redhat_postgres_auth' %} - -Create the database: - - -
~$ sudo -u postgres createdb arvados_production -T template0 -E UTF8 -O arvados
+
    SystemRootToken: "$system_root_token"
+    ManagementToken: "$management_token"
+    Collections:
+      BlobSigningKey: "$blob_signing_key"
 
-h2. Configure the database connection - -Edit @/etc/arvados/api/database.yml@ and replace the @xxxxxxxx@ database password placeholders with the PostgreSQL password you generated above. - -h2(#configure_application). Configure the API server - -Edit @/etc/arvados/api/application.yml@ to configure the settings described in the following sections. The API server reads both @application.yml@ and its own @config/application.default.yml@ file. The settings in @application.yml@ take precedence over the defaults that are defined in @config/application.default.yml@. The @config/application.yml.example@ file is not read by the API server and is provided as a starting template only. - -@config/application.default.yml@ documents additional configuration settings not listed here. You can "view the current source version":https://dev.arvados.org/projects/arvados/repository/revisions/master/entry/services/api/config/application.default.yml for reference. - -Only put local configuration in @application.yml@. Do not edit @application.default.yml@. - -h3(#uuid_prefix). uuid_prefix - -Define your @uuid_prefix@ in @application.yml@ by setting the @uuid_prefix@ field in the section for your environment. This prefix is used for all database identifiers to identify the record as originating from this site. It must be exactly 5 lowercase ASCII letters and digits. - -Example @application.yml@: - - -
  uuid_prefix: zzzzz
-
- -h3. secret_token - -The @secret_token@ is used for for signing cookies. IMPORTANT: This is a site secret. It should be at least 50 characters. Generate a random value and set it in @application.yml@: - - -
~$ ruby -e 'puts rand(2**400).to_s(36)'
-yyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyy
-
+These secret tokens are used to authenticate messages between Arvados components. +* @SystemRootToken@ is used by Arvados system services to authenticate as the system (root) user when communicating with the API server. +* @ManagementToken@ is used to authenticate access to system metrics. +* @Collections.BlobSigningKey@ is used to control access to Keep blocks. -Example @application.yml@: +Each token should be a string of at least 50 alphanumeric characters. You can generate a suitable token with the following command: -
  secret_token: yyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyy
-
- -h3(#blob_signing_key). blob_signing_key - -The @blob_signing_key@ is used to enforce access control to Keep blocks. This same key must be provided to the Keepstore daemons when "installing Keepstore servers.":install-keepstore.html IMPORTANT: This is a site secret. It should be at least 50 characters. Generate a random value and set it in @application.yml@: - - -
~$ ruby -e 'puts rand(2**400).to_s(36)'
-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
-
- -Example @application.yml@: - - -
  blob_signing_key: xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
-
- -h3(#omniauth). sso_app_secret, sso_app_id, sso_provider_url - -The following settings enable the API server to communicate with the "Single Sign On (SSO) server":install-sso.html to authenticate user log in. - -Set @sso_provider_url@ to the base URL where your SSO server is installed. This should be a URL consisting of the scheme and host (and optionally, port), without a trailing slash. - -Set @sso_app_secret@ and @sso_app_id@ to the corresponding values for @app_secret@ and @app_id@ used in the "Create arvados-server client for Single Sign On (SSO)":install-sso.html#client step. - -Example @application.yml@: - - -
  sso_app_id: arvados-server
-  sso_app_secret: wwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwww
-  sso_provider_url: https://sso.example.com
+
~$ tr -dc 0-9a-zA-Z </dev/urandom | head -c50 ; echo
 
-h3. workbench_address - -Set @workbench_address@ to the URL of your workbench application after following "Install Workbench.":install-workbench-app.html - -Example @application.yml@: - - -
  workbench_address: https://workbench.zzzzz.example.com
-
- -h3. websocket_address - -Set @websocket_address@ to the @wss://@ URL of the API server websocket endpoint after following "Set up Web servers":#set_up. The path of the default endpoint is @/websocket@. - -Example @application.yml@: - - -
  websocket_address: wss://ws.zzzzz.example.com/websocket
-
- -h3(#git_repositories_dir). git_repositories_dir - -The @git_repositories_dir@ setting specifies the directory where user git repositories will be stored. - -The git server setup process is covered on "its own page":install-arv-git-httpd.html. For now, create an empty directory in the default location: - - -
~$ sudo mkdir -p /var/lib/arvados/git/repositories
-
- -If you intend to store your git repositories in a different location, specify that location in @application.yml@. - -Default setting in @application.default.yml@: +h3. PostgreSQL.Connection -
  git_repositories_dir: /var/lib/arvados/git/repositories
+
    PostgreSQL:
+      Connection:
+        host: localhost
+        user: arvados
+        password: $postgres_password
+        dbname: arvados_production
 
-h3(#git_internal_dir). git_internal_dir - -The @git_internal_dir@ setting specifies the location of Arvados' internal git repository. By default this is @/var/lib/arvados/internal.git@. This repository stores git commits that have been used to run Crunch jobs. It should _not_ be a subdirectory of @git_repositories_dir@. +Replace the @$postgres_password@ placeholder with the password you generated during "database setup":#database-setup . -Example @application.yml@: +h3. Services -
  git_internal_dir: /var/lib/arvados/internal.git
+
    Services:
+      Controller:
+        ExternalURL: "https://ClusterID.example.com"
+        InternalURLs:
+          "http://localhost:8003": {}
+      RailsAPI:
+        # Does not have an ExternalURL
+        InternalURLs:
+          "http://localhost:8004": {}
 
-h2(#set_up). Set up Web servers - -For best performance, we recommend you use Nginx as your Web server front-end, with a Passenger backend for the main API server and a Puma backend for API server Websockets. To do that: - - -
    -
  1. Install Nginx and Phusion Passenger.
  2. - -
  3. Install runit to supervise the Puma daemon. {% include 'install_runit' %}

  4. - -
  5. Install the script below as the run script for the Puma service, modifying it as directed by the comments.

    - -
    #!/bin/bash
    -
    -set -e
    -exec 2>&1
    -
    -# Uncomment the line below if you're using RVM.
    -#source /etc/profile.d/rvm.sh
    -
    -envdir="`pwd`/env"
    -mkdir -p "$envdir"
    -echo ws-only > "$envdir/ARVADOS_WEBSOCKETS"
    -
    -cd /var/www/arvados-api/current
    -echo "Starting puma in `pwd`"
    -
    -# Change arguments below to match your deployment, "webserver-user" and
    -# "webserver-group" should be changed to the user and group of the web server
    -# process.  This is typically "www-data:www-data" on Debian systems by default,
    -# other systems may use different defaults such the name of the web server
    -# software (for example, "nginx:nginx").
    -exec chpst -m 1073741824 -u webserver-user:webserver-group -e "$envdir" \
    -  bundle exec puma -t 0:512 -e production -b tcp://127.0.0.1:8100
    -
    -
  6. - -
  7. Edit the http section of your Nginx configuration to run the Passenger server, and act as a front-end for both it and Puma. You might add a block like the following, adding SSL and logging parameters to taste:

    - -
    server {
    -  listen 127.0.0.1:8000;
    -  server_name localhost-api;
    -
    -  root /var/www/arvados-api/current/public;
    -  index  index.html index.htm index.php;
    +Replace @ClusterID.example.com@ with the hostname that you previously selected for the API server.
     
    -  passenger_enabled on;
    -  # If you're using RVM, uncomment the line below.
    -  #passenger_ruby /usr/local/rvm/wrappers/default/ruby;
    +The @Services@ section of the configuration helps Arvados components contact one another (service discovery).  Each service has one or more @InternalURLs@ and an @ExternalURL@.  The @InternalURLs@ describe where the service runs, and how the Nginx reverse proxy will connect to it.  The @ExternalURL@ is how external clients contact the service.
     
    -  # This value effectively limits the size of API objects users can
    -  # create, especially collections.  If you change this, you should
    -  # also ensure the following settings match it:
    -  # * `client_max_body_size` in the server section below
    -  # * `client_max_body_size` in the Workbench Nginx configuration (twice)
    -  # * `max_request_size` in the API server's application.yml file
    -  client_max_body_size 128m;
    -}
    +h2(#update-nginx). Update nginx configuration
     
    -upstream api {
    -  server     127.0.0.1:8000  fail_timeout=10s;
    -}
    +Use a text editor to create a new file @/etc/nginx/conf.d/arvados-api-and-controller.conf@ with the following configuration.  Options that need attention are marked in red.
     
    -upstream websockets {
    -  # The address below must match the one specified in puma's -b option.
    -  server     127.0.0.1:8100  fail_timeout=10s;
    -}
    +
    +
    proxy_http_version 1.1;
     
    -proxy_http_version 1.1;
    +# When Keep clients request a list of Keep services from the API
    +# server, use the origin IP address to determine if the request came
    +# from the internal subnet or it is an external client.  This sets the
    +# $external_client variable which in turn is used to set the
    +# X-External-Client header.
    +#
    +# The API server uses this header to choose whether to respond to a
    +# "available keep services" request with either a list of internal keep
    +# servers (0) or with the keepproxy (1).
    +#
    +# Following the example here, update the 10.20.30.0/24 netmask
    +# to match your private subnet.
    +# Update 1.2.3.4 and add lines as necessary with the public IP
    +# address of all servers that can also access the private network to
    +# ensure they are not considered 'external'.
     
    -# When Keep clients request a list of Keep services from the API server, the
    -# server will automatically return the list of available proxies if
    -# the request headers include X-External-Client: 1.  Following the example
    -# here, at the end of this section, add a line for each netmask that has
    -# direct access to Keep storage daemons to set this header value to 0.
     geo $external_client {
       default        1;
    +  127.0.0.0/24   0;
       10.20.30.0/24  0;
    +  1.2.3.4/32     0;
    +}
    +
    +# This is the port where nginx expects to contact arvados-controller.
    +upstream controller {
    +  server     localhost:8003  fail_timeout=10s;
     }
     
     server {
    -  listen       [your public IP address]:443 ssl;
    -  server_name  uuid_prefix.your.domain;
    +  # This configures the public https port that clients will actually connect to,
    +  # the request is reverse proxied to the upstream 'controller'
    +
    +  listen       443 ssl;
    +  server_name  ClusterID.example.com;
     
    -  ssl on;
       ssl_certificate     /YOUR/PATH/TO/cert.pem;
       ssl_certificate_key /YOUR/PATH/TO/cert.key;
     
    -  index  index.html index.htm index.php;
    -
    -  # Refer to the comment about this setting in the server section above.
    +  # Refer to the comment about this setting in the passenger (arvados
    +  # api server) section of your Nginx configuration.
       client_max_body_size 128m;
     
       location / {
    -    proxy_pass            http://api;
    +    proxy_pass            http://controller;
         proxy_redirect        off;
         proxy_connect_timeout 90s;
         proxy_read_timeout    300s;
     
    -    proxy_set_header      X-Forwarded-Proto https;
    -    proxy_set_header      Host $http_host;
    +    proxy_set_header      Host              $http_host;
    +    proxy_set_header      Upgrade           $http_upgrade;
    +    proxy_set_header      Connection        "upgrade";
         proxy_set_header      X-External-Client $external_client;
    -    proxy_set_header      X-Real-IP $remote_addr;
    -    proxy_set_header      X-Forwarded-For $proxy_add_x_forwarded_for;
    +    proxy_set_header      X-Forwarded-For   $proxy_add_x_forwarded_for;
    +    proxy_set_header      X-Forwarded-Proto https;
    +    proxy_set_header      X-Real-IP         $remote_addr;
       }
     }
     
     server {
    -  listen       [your public IP address]:443 ssl;
    -  server_name  ws.uuid_prefix.your.domain;
    +  # This configures the Arvados API server.  It is written using Ruby
    +  # on Rails and uses the Passenger application server.
     
    -  ssl on;
    -  ssl_certificate     /YOUR/PATH/TO/cert.pem;
    -  ssl_certificate_key /YOUR/PATH/TO/cert.key;
    +  listen localhost:8004;
    +  server_name localhost-api;
     
    +  root /var/www/arvados-api/current/public;
       index  index.html index.htm index.php;
     
    -  location / {
    -    proxy_pass            http://websockets;
    -    proxy_redirect        off;
    -    proxy_connect_timeout 90s;
    -    proxy_read_timeout    300s;
    +  passenger_enabled on;
     
    -    proxy_set_header      Upgrade $http_upgrade;
    -    proxy_set_header      Connection "upgrade";
    -    proxy_set_header      Host $host;
    -    proxy_set_header      X-Real-IP $remote_addr;
    -    proxy_set_header      X-Forwarded-For $proxy_add_x_forwarded_for;
    -  }
    +  # If you are using RVM, uncomment the line below.
    +  # If you're using system ruby, leave it commented out.
    +  #passenger_ruby /usr/local/rvm/wrappers/default/ruby;
    +
    +  # This value effectively limits the size of API objects users can
    +  # create, especially collections.  If you change this, you should
    +  # also ensure the following settings match it:
    +  # * `client_max_body_size` in the previous server section
    +  # * `API.MaxRequestSize` in config.yml
    +  client_max_body_size 128m;
     }
     
    -
  8. + -
  9. Restart Nginx:

    +{% assign arvados_component = 'arvados-api-server arvados-controller' %} -
    ~$ sudo nginx -s reload
    -
    +{% include 'install_packages' %} -
  10. +{% assign arvados_component = 'arvados-controller' %} -
-
+{% include 'start_service' %} + +h2(#confirm-working). Confirm working installation + +Confirm working controller: + +
$ curl https://ClusterID.example.com/arvados/v1/config
+
+ +Confirm working Rails API server: + +
$ curl https://ClusterID.example.com/discovery/v1/apis/arvados/v1/rest
+
+ +Confirm that you can use the system root token to act as the system root user: + +
$ curl -H "Authorization: Bearer $system_root_token" https://ClusterID.example.com/arvados/v1/users/current
+
+ +h3. Troubleshooting -h2. Prepare the API server deployment +If you are getting TLS errors, make sure the @ssl_certificate@ directive in your nginx configuration has the "full certificate chain":http://nginx.org/en/docs/http/configuring_https_servers.html#chains -{% assign railspkg = "arvados-api-server" %} -{% include 'install_rails_reconfigure' %} +Logs can be found in @/var/www/arvados-api/current/log/production.log@ and using @journalctl -u arvados-controller@. -{% include 'notebox_begin' %} -You can safely ignore the following messages if they appear while this command runs: -
Don't run Bundler as root. Bundler can ask for sudo if it is needed, and installing your bundle as root will
-break this application for all non-root users on this machine.
-
fatal: Not a git repository (or any of the parent directories): .git
-{% include 'notebox_end' %} +See also the admin page on "Logging":{{site.baseurl}}/admin/logging.html .