| Step | Annotation |
|---|---|
|
Step 1: Input dataset collection
select at runtime
|
|
|
Step 2: HISAT2
Use a built-in genome
hg38
Paired-end Dataset Collection
select at runtime
Reverse (RF)
Use default values
Summary Options:
True
False
Advanced Options:
Use default values
Use default values
Use default values
Specify spliced alignment options
0
12
Natural logarithm [f(x) = B + A * log(x)]
-8.0
1.0
Natural logarithm [f(x) = B + A * log(x)]
-8.0
1.0
20
500000
False
select at runtime
Report alignments tailored for transcript assemblers including StringTie
False
Use default values
Use default values
Use default values
Use default job resource parameters
|
Make sure to choose strand correctly after manual inspection with a preliminary mapping of one sample and filtering reads to view on IGV |
|
Step 3: Input dataset
select at runtime
|
|
|
Step 4: Trimmomatic
Paired-end (two separate input files)
Output dataset 'output' from step 1
Output dataset 'output' from step 1
True
Standard
TruSeq3 (paired-ended, for MiSeq and HiSeq)
2
30
10
8
True
Trimmomatic Operations
Trimmomatic Operation 1
Sliding window trimming (SLIDINGWINDOW)
4
20
Use default job resource parameters
|
choose the right options for Paired-end vs single read adaptors |
|
Step 5: Filter SAM or BAM, output SAM or BAM
Output dataset 'output_alignments' from step 2
Include header
20
yes
Read is paired
Read is mapped in a proper pair
Nothing selected.
Empty.
Empty.
select at runtime
False
Empty.
BAM (-b)
|
BAM cleanup |
|
Step 6: FastQC
Output dataset 'fastq_out_paired' from step 4
select at runtime
select at runtime
|
|
|
Step 7: StringTie
Output dataset 'output1' from step 5
Reverse (RF)
Do not use reference GTF/GFF3
Advanced Options:
True
Empty.
Empty.
0.15
200
10
1
2
50
0.95
True
False
Use default job resource parameters
|
choose strand correctly Input the read name Prefix according to sample group if desired To make custom GFF |
|
Step 8: StringTie merge
Output dataset 'output_gtf' from step 7
Output dataset 'output' from step 3
50
0
1.0
1.0
0.01
250
False
Use default job resource parameters
|
But in order to distinguish between novel and existing transcripts we will need provide Stringtie merge with a list of known annotated transcripts. For mouse genome version used in this tutorial (mm10) such a list can be downloaded from a Galaxy Library as was described above. Now we can finally run Stringtie merge |
|
Step 9: featureCounts
Output dataset 'output1' from step 5
Stranded (Reverse)
in your history
Output dataset 'out_gtf' from step 8
Gene-ID "\t" read-count (MultiQC/DESeq2/edgeR/limma-voom compatible)
True
Options for paired-end reads:
Disabled; all reads/mates will be counted individually
False
True
Advanced options:
exon
transcript_id
False
False
Disabled; multi-mapping reads are excluded (default)
12
False
False
False
False
1
0
0
0
0
Leave the read as it is
False
False
False
False
Use default job resource parameters
|
lho-w
All published workflows
Published workflows by lho-w