--- title: Setup --- {% capture generic_tab_content %} # Setting up a practice repository We will create a new git repository and import a library of existing tool definitions that will help us build our workflow. Create a new empty git repository to hold our workflow with this command: ``` git init rnaseq-cwl-training-exercises ``` {: .language-bash } Next, import bio-cwl-tools with this command: ``` git submodule add https://github.com/common-workflow-library/bio-cwl-tools.git ``` {: .language-bash } # Downloading sample and reference data Start from your rnaseq-cwl-exercises directory. ``` mkdir rnaseq cd rnaseq wget --mirror --no-parent --no-host --cut-dirs=1 https://download.jutro.arvadosapi.com/c=9178fe1b80a08a422dbe02adfd439764+925/ ``` {: .language-bash } # Downloading or generating STAR index Running STAR requires index files generated from the reference. This is a rather large download (4 GB). Depending on your bandwidth, it may be faster to generate it yourself. ## Downloading ``` mkdir hg19-chr1-STAR-index cd hg19-chr1-STAR-index wget --mirror --no-parent --no-host --cut-dirs=1 https://download.jutro.arvadosapi.com/c=02a12ce9e2707610991bd29d38796b57+2912/ ``` {: .language-bash } ## Generating Create `chr1-star-index.yaml`: ``` InputFiles: - class: File location: rnaseq/reference_data/chr1.fa format: http://edamontology.org/format_1930 IndexName: 'hg19-chr1-STAR-index' Gtf: class: File location: rnaseq/reference_data/chr1-hg19_genes.gtf Overhang: 99 ``` {: .language-yaml } Generate the index with your local cwl-runner. ``` cwl-runner bio-cwl-tools/STAR/STAR-Index.cwl chr1-star-index.yaml ``` {: .language-bash } {% endcapture %} {% capture arvados_tab_content %} # Setting up a practice repository We will create a new git repository and import a library of existing tool definitions that will help us build our workflow. When using the recommended [VSCode environment to develop on Arvados](https://doc.arvados.org/v2.3/user/cwl/arvados-vscode-training.html), start by forking the [arvados-vscode-cwl-template](https://github.com/arvados/arvados-vscode-cwl-template) repository. 1. Vscode: On the left sidebar, choose `Explorer` ![](assets/img/Explorer.png) 1. Select `Clone Repository` and enter [https://github.com/arvados/arvados-vscode-cwl-template](https://github.com/arvados/arvados-vscode-cwl-template), then click `Open` 1. If asked `Would you like to open the cloned repository?` choose `Open` Next, import the [bio-cwl-tools](https://github.com/common-workflow-library/bio-cwl-tools) repository: 1. Vscode: In the top menu, select `Terminal` → `New Terminal` 1. This will open a terminal window in the lower part of the screen 1. Run this command: ``` git submodule add https://github.com/common-workflow-library/bio-cwl-tools.git ``` {: .language-bash } # Downloading sample and reference data > ## Note > > You may already have access to this collection. > > You can check by going to Workbench and pasting > `9178fe1b80a08a422dbe02adfd439764+925` into the search box. If you > arrived at a collection page instead of a "not found" error, then > you do not need to perform this download step. {: .callout} 1. Go to [https://workbench2.jutro.arvadosapi.com](https://workbench2.jutro.arvadosapi.com) and sign in, this will create an account 2. Go to `Get an API token` under the user menu 3. Log into the shell node of your Arvados cluster 4. On the shell node, copy the host name and token for the `jutro` cluster into the file `~/.config/arvados/jutro.conf` as described on the page for [arv-copy](https://doc.arvados.org/user/topics/arv-copy.html). Now, on shell node of your Arvados cluster, use `arv-copy` to copy the collection: ``` arv-copy --src jutro 9178fe1b80a08a422dbe02adfd439764+925 ``` {: .language-bash } # Downloading or generating STAR index Running STAR requires index files generated from the reference. This is a rather large download (4 GB). Depending on your bandwidth, it may be faster to generate it yourself. ## Downloading > ## Note > > As above, you can check by going to Workbench and pasting > `02a12ce9e2707610991bd29d38796b57+2912` into the search box to see > if you already have access to this collection. {: .callout} Use `arv-copy` to copy the collection: ``` arv-copy --src jutro 02a12ce9e2707610991bd29d38796b57+2912 ``` {: .language-bash } ## Generating Create `chr1-star-index.yaml`: ``` InputFiles: - class: File location: keep:9178fe1b80a08a422dbe02adfd439764+925/reference_data/chr1.fa format: http://edamontology.org/format_1930 IndexName: 'hg19-chr1-STAR-index' Gtf: class: File location: keep:9178fe1b80a08a422dbe02adfd439764+925/reference_data/chr1-hg19_genes.gtf Overhang: 99 ``` {: .language-yaml } Generate the index with arvados-cwl-runner. ``` arvados-cwl-runner bio-cwl-tools/STAR/STAR-Index.cwl chr1-star-index.yaml ``` {: .language-bash } {% endcapture %}
{{ generic_tab_content | markdownify}}
{{ arvados_tab_content | markdownify}}
{% include links.md %}