Description

Locate and tag duplicate reads in a BAM file

Input

name:type
description
pattern

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

reads{:bash}

:file

Sequence reads file, can be SAM/BAM/CRAM format

*.{bam,cram,sam}

meta2{:bash}

:map

Groovy Map containing reference information e.g. [ id:‘genome’ ]

fasta{:bash}

:file

Reference genome fasta file, required for CRAM input

*.{fasta,fa}

meta3{:bash}

:map

Groovy Map containing reference information e.g. [ id:‘genome’ ]

fai{:bash}

:file

Reference genome fasta index

*.{fai}

Output

name:type
description
pattern

bam{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.bam{:bash}

:file

BAM file with duplicate reads marked/removed

*.{bam}

bai{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.bai{:bash}

:file

An optional BAM index file. If desired, —CREATE_INDEX must be passed as a flag

*.{bai}

cram{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.cram{:bash}

:file

Output CRAM file

*.{cram}

metrics{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.metrics.txt{:bash}

:file

Duplicate metrics file generated by picard

*.{metrics.txt}

versions_picard{:bash}

${task.process}{:bash}

:string

The process the versions were collected from

picard{:bash}

:string

The tool name

picard MarkDuplicates --version 2>&1 | sed -n 's/^Version:*//p'{:bash}

:string

The command used to generate the version of the tool

Topics

name:type
description
pattern

versions{:bash}

${task.process}{:bash}

:string

The process the versions were collected from

picard{:bash}

:string

The tool name

picard MarkDuplicates --version 2>&1 | sed -n 's/^Version:*//p'{:bash}

:string

The command used to generate the version of the tool

Tools

picard
MIT

A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.