modules/ampcombi2_parsetables

A submodule that parses and standardizes the results from various antimicrobial peptide identification tools.

antimicrobial peptidesampsparsingreportingalignmacrelamplifyhmmsearchneubiampirampgramamptransformerDRAMPMMseqs2InterProScan

https://github.com/nf-core/modules/[...]/modules/nf-core/ampcombi2/parsetables

Description

A submodule that parses and standardizes the results from various antimicrobial peptide identification tools.

Input

name:type

description

pattern

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

`amp_input{:bash}`
`:list`

The path to the directory containing the results for the AMP tools for each processed sample or a list of files corresponding to each file generated by AMP tools.

[*amptool.tsv, *amptool.tsv]

`faa_input{:bash}`
`:file`

The path to the file corresponding to the respective protein fasta files with ‘.faa’ extension. File names have to contain the corresponding sample name, i.e. sample_1.faa

*.faa

`gbk_input{:bash}`
`:file`

The path to the file corresponding to the respective annotated files with either ‘.gbk’ or ‘.gbff’ extensions. File names must contain the corresponding sample name, i.e. sample_1.faa where “sample_1” is the sample name.

*.gbk

`opt_amp_db{:bash}`
`:string`

The name of the database to download and set up. This can either be ‘DRAMP’, ‘APD’ or ‘UniRef100’.

DRAMP|APD|UniRef100

`opt_amp_db_dir{:bash}`
`:directory`

The path to the folder containing the fasta and tsv database files.

path/to/amp_*_database

`opt_interproscan{:bash}`
`:directory`

A path to a file corresponding to the respective tsv files containing protein classifications of the annotated CDSs. The file must be the raw output from InterProScan.

*.tsv

Output

name:type

description

pattern

`sample_dir{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

`${prefix}/{:bash}`
`:directory`

The output directory that contains the summary output and related alignment files for one sample.

/*

`contig_gbks{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

`${prefix}/contig_gbks/{:bash}`
`:directory`

The output subdirectory that contains the gbk files containing the AMP hits for each sample.

/*/contig_gbks

`db_tsv{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

`${prefix}/${prefix}_mmseqs_matches.tsv{:bash}`
`:file`

An alignment file containing the results from the MMseqs2 alignment step done on all AMP hits.

/*/*_mmseqs_matches.tsv

`tsv{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

`${prefix}/${prefix}_ampcombi.tsv{:bash}`
`:file`

A file containing the summary report of all predicted AMP hits from all AMP tools given as input, the corresponding taxonomic and functional classification from the alignment step and the estimated physiochemical properties.

/*/*_ampcombi.tsv

`faa{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

`${prefix}/${prefix}_amp.faa{:bash}`
`:file`

A fasta file containing the amino acid sequences of all predicted AMP hits.

/*/*_amp.faa

`sample_log{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

`${prefix}/${prefix}_ampcombi.log{:bash}`
`:file`

A log file that captures the standard output per sample in a log file. Can be activated by --log.

/*/*.log

`full_log{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

`Ampcombi_parse_tables.log{:bash}`
`:file`

A log file that captures the standard output for the entire process in a log file. Can be activated by --log.

Ampcombi_parse_tables.log

`db{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

`amp_${opt_amp_db}_database/{:bash}`
`:directory`

If the AMP reference database ID is not provided by the user using the flag `—amp_database’, by default the DRAMP database will be downloaded, filtered and stored in this folder.

/amp_*_database

`db_txt{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

`amp_${opt_amp_db}_database/*.txt{:bash}`
`:file`

AMP reference database in tsv-format with two columns containing header and sequence.

/amp_*_database/*.txt

`db_fasta{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

`amp_${opt_amp_db}_database/*.fasta{:bash}`
`:file`

AMP reference database fasta file in clean format.

/amp_*_database/*.fasta

`db_mmseqs{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

`amp_${opt_amp_db}_database/mmseqs2/{:bash}`
`:directory`

As alignment to the reference database is carried out by MMseqs2, this directory contains all the files generated by MMseqs2 on the fasta file of the database.

/amp_*_database/mmseqs2

`versions{:bash}`

`versions.yml{:bash}`
`:file`

File containing software versions

versions.yml

Tools

ampcombi2/parsetables
MIT

A parsing tool to convert and summarise the outputs from multiple AMP detection tools in a standardized format.

github.com/Darcy220606/AMPcombi ampcombi.readthedocs.io/en/main https://github.com/Darcy220606/AMPcombi/tree/dev

modules/ampcombi2_parsetables

Description

Input

meta{:bash}:map

amp_input{:bash}:list

faa_input{:bash}:file

gbk_input{:bash}:file

opt_amp_db{:bash}:string

opt_amp_db_dir{:bash}:directory

opt_interproscan{:bash}:directory

Output

sample_dir{:bash}

meta{:bash}:map

${prefix}/{:bash}:directory

contig_gbks{:bash}

meta{:bash}:map

${prefix}/contig_gbks/{:bash}:directory

db_tsv{:bash}

meta{:bash}:map

${prefix}/${prefix}_mmseqs_matches.tsv{:bash}:file

tsv{:bash}

meta{:bash}:map

${prefix}/${prefix}_ampcombi.tsv{:bash}:file

faa{:bash}

meta{:bash}:map

${prefix}/${prefix}_amp.faa{:bash}:file

sample_log{:bash}

meta{:bash}:map

${prefix}/${prefix}_ampcombi.log{:bash}:file

full_log{:bash}

meta{:bash}:map

Ampcombi_parse_tables.log{:bash}:file

db{:bash}

meta{:bash}:map

amp_${opt_amp_db}_database/{:bash}:directory

db_txt{:bash}

meta{:bash}:map

amp_${opt_amp_db}_database/*.txt{:bash}:file

db_fasta{:bash}

meta{:bash}:map

amp_${opt_amp_db}_database/*.fasta{:bash}:file

db_mmseqs{:bash}

meta{:bash}:map

amp_${opt_amp_db}_database/mmseqs2/{:bash}:directory

versions{:bash}

versions.yml{:bash}:file

Tools

ampcombi2/parsetables MIT

included in

maintainer

get in touch

`meta{:bash}`
`:map`

`amp_input{:bash}`
`:list`

`faa_input{:bash}`
`:file`

`gbk_input{:bash}`
`:file`

`opt_amp_db{:bash}`
`:string`

`opt_amp_db_dir{:bash}`
`:directory`

`opt_interproscan{:bash}`
`:directory`

`sample_dir{:bash}`

`meta{:bash}`
`:map`

`${prefix}/{:bash}`
`:directory`

`contig_gbks{:bash}`

`meta{:bash}`
`:map`

`${prefix}/contig_gbks/{:bash}`
`:directory`

`db_tsv{:bash}`

`meta{:bash}`
`:map`

`${prefix}/${prefix}_mmseqs_matches.tsv{:bash}`
`:file`

`tsv{:bash}`

`meta{:bash}`
`:map`

`${prefix}/${prefix}_ampcombi.tsv{:bash}`
`:file`

`faa{:bash}`

`meta{:bash}`
`:map`

`${prefix}/${prefix}_amp.faa{:bash}`
`:file`

`sample_log{:bash}`

`meta{:bash}`
`:map`

`${prefix}/${prefix}_ampcombi.log{:bash}`
`:file`

`full_log{:bash}`

`meta{:bash}`
`:map`

`Ampcombi_parse_tables.log{:bash}`
`:file`

`db{:bash}`

`meta{:bash}`
`:map`

`amp_${opt_amp_db}_database/{:bash}`
`:directory`

`db_txt{:bash}`

`meta{:bash}`
`:map`

`amp_${opt_amp_db}_database/*.txt{:bash}`
`:file`

`db_fasta{:bash}`

`meta{:bash}`
`:map`

`amp_${opt_amp_db}_database/*.fasta{:bash}`
`:file`

`db_mmseqs{:bash}`

`meta{:bash}`
`:map`

`amp_${opt_amp_db}_database/mmseqs2/{:bash}`
`:directory`

`versions{:bash}`

`versions.yml{:bash}`
`:file`

ampcombi2/parsetables
MIT