X-Git-Url: https://git.arvados.org/arvados.git/blobdiff_plain/e51a22dc5b9da795b68c87cb9d0a45e4732ed2f6..64c516079154f73da3f2a33a957fa8ae8eb23749:/doc/install/install-arv-git-httpd.html.textile.liquid diff --git a/doc/install/install-arv-git-httpd.html.textile.liquid b/doc/install/install-arv-git-httpd.html.textile.liquid index 21426daaef..3d70fc4de9 100644 --- a/doc/install/install-arv-git-httpd.html.textile.liquid +++ b/doc/install/install-arv-git-httpd.html.textile.liquid @@ -3,76 +3,54 @@ layout: default navsection: installguide title: Install the Git server ... +{% comment %} +Copyright (C) The Arvados Authors. All rights reserved. -Arvados allows users to create their own private and public git repositories, and clone/push them using SSH and HTTPS. +SPDX-License-Identifier: CC-BY-SA-3.0 +{% endcomment %} -The git hosting setup involves three components. -* The "arvados-git-sync.rb" script polls the API server for the current list of repositories, creates bare repositories, and updates the local permission cache used by gitolite. -* Gitolite provides SSH access. -* arvados-git-http provides HTTPS access. - -It is not strictly necessary to deploy _both_ SSH and HTTPS access, but we recommend deploying both: -* SSH is a more appropriate way to authenticate from a user's workstation because it does not require managing tokens on the client side; -* HTTPS is a more appropriate way to authenticate from a shell VM because it does not depend on SSH agent forwarding (SSH clients' agent forwarding features tend to behave as if the remote machine is fully trusted). - -The HTTPS instructions given below will not work if you skip the SSH setup steps. +# "Introduction":#introduction +# "Install dependencies":#dependencies +# "Create "git" user and storage directory":#create +# "Install gitolite":#gitolite +# "Configure gitolite":#config-gitolite +# "Configure git synchronization":#sync +# "Update config.yml":#update-config +# "Update nginx configuration":#update-nginx +# "Install arvados-git-httpd package":#install-packages +# "Restart the API server and controller":#restart-api +# "Confirm working installation":#confirm-working -h2. Set up DNS +h2(#introduction). Introduction -By convention, we use the following hostname for the git service: +Arvados support for git repository management enables using Arvados permissions to control access to git repositories. Users can create their own private and public git repositories and share them with others. - -
git.uuid_prefix.your.domain
-
-
- -{% include 'notebox_begin' %} -Here, we show how to install the git hosting services *on the same host as your API server.* Using a different host is not yet fully supported. On this page we will refer to it as your git server. -{% include 'notebox_end' %} - -DNS and network configuration should be set up so port 443 reaches your HTTPS proxy, and port 22 reaches your git server. - -h2. Generate an API token - -On the API server, if you are using RVM: - - -
gitserver:~$ cd /var/www/arvados-api/current
-gitserver:/var/www/arvados-api/current$ sudo -u www-data RAILS_ENV=production `which rvm-exec` default bundle exec ./script/create_superuser_token.rb
-4hdqaixi5a027jqn0vyjbwa3xmcue8logzhtsmk1bplgp064fe
-
-
- -If you are not using RVM: - - -
gitserver:~$ cd /var/www/arvados-api/current
-gitserver:/var/www/arvados-api/current$ sudo -u www-data RAILS_ENV=production bundle exec ./script/create_superuser_token.rb
-4hdqaixi5a027jqn0vyjbwa3xmcue8logzhtsmk1bplgp064fe
-
-
+The git hosting setup involves three components. +* The "arvados-git-sync.rb" script polls the API server for the current list of repositories, creates bare repositories, and updates the local permission cache used by gitolite. +* Gitolite provides SSH access. Users authenticate by SSH keys. +* arvados-git-http provides HTTPS access. Users authenticate by Arvados tokens. -Copy that token; you'll need it in a minute. +Git services must be installed on the same host as the Arvados Rails API server. -h2. Install git and other dependencies +h2(#dependencies). Install dependencies -On Debian-based systems: +h3. Centos 7 -
gitserver:~$ sudo apt-get install git openssh-server
+
# yum install git perl-Data-Dumper openssh-server
 
-On Red Hat-based systems: +h3. Debian and Ubuntu -
gitserver:~$ sudo yum install git perl-Data-Dumper openssh-server
+
# apt-get --no-install-recommends install git openssh-server
 
-h2. Create a "git" user and a storage directory +h2(#create). Create "git" user and storage directory -Hosted repositories will be stored under @/var/lib/arvados/git/@. If you choose a different location, make sure to update the @git_repositories_dir@ entry in your API server's @config/application.yml@ file, preserving the trailing @repositories/@ part. +Gitolite and some additional scripts will be installed in @/var/lib/arvados/git@, which means hosted repository data will be stored in @/var/lib/arvados/git/repositories@. If you choose to install gitolite in a different location, make sure to update the @git_repositories_dir@ entry in your API server's @application.yml@ file accordingly: for example, if you install gitolite at @/data/gitolite@ then your @git_repositories_dir@ will be @/data/gitolite/repositories@. A new UNIX account called "git" will own the files. This makes git URLs look familiar to users (git@[...]:username/reponame.git). @@ -85,7 +63,7 @@ gitserver:~$ sudo chown -R git:git ~git
-The git user needs its own SSH key. (It must be able to run @ssh git@localhost@ from scripts.) +The git user needs its own SSH key. (It must be able to run ssh git@localhost from scripts.)
gitserver:~$ sudo -u git -i bash
@@ -98,16 +76,17 @@ git@gitserver:~$ rm .ssh/authorized_keys
 
-h2. Install gitolite +h2(#gitolite). Install gitolite -Check https://github.com/sitaramc/gitolite/tags for the latest stable version (_e.g.,_ @v3.6.3@). +Check "https://github.com/sitaramc/gitolite/tags":https://github.com/sitaramc/gitolite/tags for the latest stable version. This guide was tested with @v3.6.11@. _Versions below 3.0 are missing some features needed by Arvados, and should not be used._ Download and install the version you selected. -
git@gitserver:~$ echo 'PATH=$HOME/bin:$PATH' >.profile
-git@gitserver:~$ source .profile
-git@gitserver:~$ git clone --branch v3.6.3 git://github.com/sitaramc/gitolite
+
$ sudo -u git -i bash
+git@gitserver:~$ echo 'PATH=$HOME/bin:$PATH' >.profile
+git@gitserver:~$ . .profile
+git@gitserver:~$ git clone --branch v3.6.11 https://github.com/sitaramc/gitolite
 ...
 Note: checking out '5d24ae666bfd2fa9093d67c840eb8d686992083f'.
 ...
@@ -121,6 +100,8 @@ WARNING: /var/lib/arvados/git/.ssh/authorized_keys missing; creating a new one
 
+_If this didn't go well, more detail about installing gitolite, and information about how it works, can be found on the "gitolite home page":http://gitolite.com/._ + Clone the gitolite-admin repository. The arvados-git-sync.rb script works by editing the files in this working directory and pushing them to gitolite. Here we make sure "git push" won't produce any errors or warnings. @@ -140,7 +121,7 @@ Everything up-to-date
-h2. Configure gitolite +h2(#config-gitolite). Configure gitolite Configure gitolite to look up a repository name like @username/reponame.git@ and find the appropriate bare repository storage directory. @@ -163,6 +144,13 @@ Add the following lines inside the section that begins @%RC = (@:
+Inside that section, adjust the 'UMASK' setting to @022@, to ensure the API server has permission to read repositories: + + +
    UMASK => 022,
+
+
+ Uncomment the 'Alias' line in the section that begins @ENABLE => [@: @@ -171,136 +159,140 @@ Uncomment the 'Alias' line in the section that begins @ENABLE => [@: -h2. Configure git synchronization +h2(#sync). Configure git synchronization Create a configuration file @/var/www/arvados-api/current/config/arvados-clients.yml@ using the following template, filling in the appropriate values for your system. -* For @arvados_api_token@, use the token you generated above. +* For @arvados_api_token@, use @SystemRootToken@ * For @gitolite_arvados_git_user_key@, provide the public key you generated above, i.e., the contents of @~git/.ssh/id_rsa.pub@.
production:
   gitolite_url: /var/lib/arvados/git/repositories/gitolite-admin.git
   gitolite_tmp: /var/lib/arvados/git
-  arvados_api_host: uuid_prefix.example.com
-  arvados_api_token: "4hdqaixi5a027jqn0vyjbwa3xmcue8logzhtsmk1bplgp064fe"
+  arvados_api_host: ClusterID.example.com
+  arvados_api_token: "zzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzz"
   arvados_api_host_insecure: false
   gitolite_arvados_git_user_key: "ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQC7aBIDAAgMQN16Pg6eHmvc+D+6TljwCGr4YGUBphSdVb25UyBCeAEgzqRiqy0IjQR2BLtSirXr+1SJAcQfBgI/jwR7FG+YIzJ4ND9JFEfcpq20FvWnMMQ6XD3y3xrZ1/h/RdBNwy4QCqjiXuxDpDB7VNP9/oeAzoATPZGhqjPfNS+RRVEQpC6BzZdsR+S838E53URguBOf9yrPwdHvosZn7VC0akeWQerHqaBIpSfDMtaM4+9s1Gdsz0iP85rtj/6U/K/XOuv2CZsuVZZ52nu3soHnEX2nx2IaXMS3L8Z+lfOXB2T6EaJgXF7Z9ME5K1tx9TSNTRcYCiKztXLNLSbp git@gitserver"
 
-h2. Enable the synchronization script +
+$ sudo chown git:git /var/www/arvados-api/current/config/arvados-clients.yml
+$ sudo chmod og-rwx /var/www/arvados-api/current/config/arvados-clients.yml
+
+ +h3. Test configuration + +notextile.
$ sudo -u git -i bash -c 'cd /var/www/arvados-api/current && bundle exec script/arvados-git-sync.rb production'
+ +h3. Enable the synchronization script -The API server package includes a script that retrieves the current set of repository names and permissions from the API, writes names and permissions to @arvadosaliases.pl@ in a format usable by gitolite, and creates new empty repositories if needed. This script should run every 2 to 5 minutes. +The API server package includes a script that retrieves the current set of repository names and permissions from the API, writes them to @arvadosaliases.pl@ in a format usable by gitolite, and triggers gitolite hooks which create new empty repositories if needed. This script should run every 2 to 5 minutes. -If you are using RVM, create @/etc/cron.d/arvados-git-sync@ with the following content: +Create @/etc/cron.d/arvados-git-sync@ with the following content: -
*/5 * * * * git cd /var/www/arvados-api/current && /usr/local/rvm/bin/rvm-exec default bundle exec script/arvados-git-sync.rb production
+
*/5 * * * * git cd /var/www/arvados-api/current && bundle exec script/arvados-git-sync.rb production
 
-Otherwise, create @/etc/cron.d/arvados-git-sync@ with the following content: +h2(#update-config). Update config.yml + +Edit the cluster config at @config.yml@ . -
*/5 * * * * git cd /var/www/arvados-api/current && bundle exec script/arvados-git-sync.rb production
+
    Services:
+      GitSSH:
+        ExternalURL: "ssh://git@git.ClusterID.example.com"
+      GitHTTP:
+        ExternalURL: https://git.ClusterID.example.com/
+        InternalURLs:
+	  "http://localhost:9001": {}
+    Git:
+      GitCommand: /var/lib/arvados/git/gitolite/src/gitolite-shell
+      GitoliteHome: /var/lib/arvados/git
+      Repositories: /var/lib/arvados/git/repositories
 
-h2. Install the arvados-git-httpd package +h2(#update-nginx). Update nginx configuration -This is needed only for HTTPS access. +Use a text editor to create a new file @/etc/nginx/conf.d/arvados-git.conf@ with the following configuration. Options that need attention are marked in red. -The arvados-git-httpd package provides HTTP access, using Arvados authentication tokens instead of passwords. It is intended to be installed on the system where your git repositories are stored, and accessed through a web proxy that provides SSL support. + +
upstream arvados-git-httpd {
+  server                  127.0.0.1:9001;
+}
+server {
+  listen                  443 ssl;
+  server_name             git.ClusterID.example.com;
+  proxy_connect_timeout   90s;
+  proxy_read_timeout      300s;
 
-On Debian-based systems:
+  ssl_certificate         /YOUR/PATH/TO/cert.pem;
+  ssl_certificate_key     /YOUR/PATH/TO/cert.key;
 
-
-
~$ sudo apt-get install git arvados-git-httpd
+  # The server needs to accept potentially large refpacks from push clients.
+  client_max_body_size 128m;
+
+  location  / {
+    proxy_pass            http://arvados-git-httpd;
+  }
+}
 
-On Red Hat-based systems: +h2(#install-packages). Install the arvados-git-httpd package + +The arvados-git-httpd package provides HTTP access, using Arvados authentication tokens instead of passwords. It must be installed on the system where your git repositories are stored. + +h3. Centos 7 -
~$ sudo yum install git arvados-git-httpd
+
# yum install arvados-git-httpd
 
-Verify that @arvados-git-httpd@ and @git-http-backend@ can be run: +h3. Debian and Ubuntu -
~$ arvados-git-httpd -h
-Usage of arvados-git-httpd:
-  -address="0.0.0.0:80": Address to listen on, "host:port".
-  -git-command="/usr/bin/git": Path to git executable. Each authenticated request will execute this program with a single argument, "http-backend".
-  -repo-root="/path/to/cwd": Path to git repositories.
-~$ git http-backend
-Status: 500 Internal Server Error
-Expires: Fri, 01 Jan 1980 00:00:00 GMT
-Pragma: no-cache
-Cache-Control: no-cache, max-age=0, must-revalidate
-
-fatal: No REQUEST_METHOD from server
+
# apt-get --no-install-recommends install arvados-git-httpd
 
-h3. Enable arvados-git-httpd +h2(#restart-api). Restart the API server and controller -Install "runit":http://smarden.org/runit/ (if it's not already installed) and configure it to run arvados-git-httpd. Update the API host to match your site. +After adding Workbench to the Services section, make sure the cluster config file is up to date on the API server host, and restart the API server and controller processes to ensure the changes are applied. -
~$ sudo apt-get install runit
-~$ cd /etc/sv
-/etc/sv$ sudo mkdir arvados-git-httpd; cd arvados-git-httpd
-/etc/sv/arvados-git-httpd$ sudo mkdir log
-/etc/sv/arvados-git-httpd$ sudo sh -c 'cat >log/run' <<'EOF'
-#!/bin/sh
-mkdir -p main
-chown git:git main
-exec chpst -u git:git svlogd -tt main
-EOF
-/etc/sv/arvados-git-httpd$ sudo sh -c 'cat >run' <<'EOF'
-#!/bin/sh
-export ARVADOS_API_HOST=uuid_prefix.your.domain
-export GITOLITE_HTTP_HOME=/var/lib/arvados/git
-export PATH="$PATH:/var/lib/arvados/git/bin"
-exec chpst -u git:git arvados-git-httpd -address=:9001 -git-command="$(which git)" -repo-root=/var/lib/arvados/git/repositories 2>&1
-EOF
-/etc/sv/arvados-git-httpd$ sudo chmod +x run log/run
+
# systemctl restart nginx arvados-controller
 
-h3. Set up a reverse proxy to provide SSL service +h2(#confirm-working). Confirm working installation + +Create 'testrepo' in the Arvados database. -The arvados-git-httpd service will be accessible from anywhere on the internet, so we recommend using SSL. + +
~$ arv --format=uuid repository create --repository '{"name":"myusername/testrepo"}'
+
+ +The arvados-git-sync cron job will notice the new repository record and create a repository on disk. Because it is on a timer (default 5 minutes) you may have to wait a minute or two for it to show up. + +h3. SSH -This is best achieved by putting a reverse proxy with SSL support in front of arvados-git-httpd, running on port 443 and passing requests to @arvados-git-httpd@ on port 9001 (or whichever port you used in your run script). +Before you do this, go to Workbench and choose *SSH Keys* from the menu, and upload your public key. Arvados uses the public key to identify you when you access the git repo. -
http {
-  upstream arvados-git-httpd {
-    server localhost:9001;
-  }
-  server {
-    listen *:443 ssl;
-    server_name git.uuid_prefix.example.com;
-    ssl_certificate /root/git.uuid_prefix.example.com.crt;
-    ssl_certificate_key /root/git.uuid_prefix.example.com.key;
-    location  / {
-      proxy_pass http://arvados-git-httpd;
-      proxy_set_header X-Forwarded-For $remote_addr;
-    }
-  }
-}
-
+
~$ git clone git@git.ClusterID.example.com:username/testrepo.git
 
-h3. Tell the API server about the arvados-git-httpd service +h3. HTTP -In your API server's @config/application.yml@ file, add the following entry: +Set up git credential helpers as described in "install shell server":install-shell-server.html#config-git for the git command to use your API token instead of prompting you for a username and password. -
git_http_base: git.uuid_prefix.your.domain
+
~$ git clone https://git.ClusterID.example.com/username/testrepo.git