5 # Setting up a practice repository
7 We will create a new git repository and import a library of existing
8 tool definitions that will help us build our workflow.
10 Create a new git repository to hold our workflow with this command:
13 git init rnaseq-cwl-training-exercises
19 git clone https://github.com/arvados/arvados-vscode-cwl-template.git rnaseq-cwl-training-exercises
22 Next, import bio-cwl-tools with this command:
25 git submodule add https://github.com/common-workflow-library/bio-cwl-tools.git
28 # Downloading sample and reference data
30 Start from your rnaseq-cwl-exercises directory.
35 wget --mirror --no-parent --no-host --cut-dirs=1 https://download.pirca.arvadosapi.com/c=9178fe1b80a08a422dbe02adfd439764+925/
38 # Downloading or generating STAR index
40 Running STAR requires index files generated from the reference.
42 This is a rather large download (4 GB). Depending on your bandwidth, it may be faster to generate it yourself.
47 mkdir hg19-chr1-STAR-index
48 cd hg19-chr1-STAR-index
49 wget --mirror --no-parent --no-host --cut-dirs=1 https://download.pirca.arvadosapi.com/c=02a12ce9e2707610991bd29d38796b57+2912/
54 Create `chr1-star-index.yaml`:
59 location: rnaseq/reference_data/chr1.fa
60 format: http://edamontology.org/format_1930
61 IndexName: 'hg19-chr1-STAR-index'
64 location: rnaseq/reference_data/chr1-hg19_genes.gtf
68 Generate the index with your local cwl-runner.
71 cwl-runner bio-cwl-tools/STAR/STAR-Index.cwl chr1-star-index.yaml
75 {% include links.md %}