1533 tool(s) found

Tool Description Topics Available version(s)
AEGeAn CanonGFF3 pre-process GFF3 files, removing all features not directly related to protein-coding genes Genome annotation 1
2D Feature Extraction Feature Extraction 3
ABRicate Mass screening of contigs for antimicrobial and virulence genes Antimicrobial resistance prediction 9
ABRicate List List all of abricate's available databases. Antimicrobial resistance prediction 9
ABRicate Summary Combine ABRicate results into a simple matrix of gene presence/absence Antimicrobial resistance prediction 1
ABySS de novo sequence assembler 9
Add column to an existing dataset Editing 2
Add LoFreq alignment quality scores to aligned read SAM/BAM records 4
Add metadata to a BIOM table Conversion Format validation 8
AddCommentsToBam add comments to BAM dataset 14
AddOrReplaceReadGroups add or replaces read group information 14
AEGeAn GAEVAL compute coverage and integrity scores for gene models using transcript alignments. Genome annotation 1
AEGeAn LocusPocus calculate locus coordinates for the given gene annotation Genome annotation 1
AEGeAn ParsEval compare two sets of gene annotations for the same sequence. Genome annotation 1
Aggregate datapoints Appends the average, min, max of datapoints per interval 2
Align reads and estimate abundance on a de novo assembly of RNA-Seq data 12
Align.check Calculate the number of potentially misaligned bases 6
Align.seqs Align sequences to a template alignment 5
alimask append modelmask line to a multiple sequence alignments 7
Amino Acid Coverage Builds an aa census and returns its coverage 2
Amino Acid Variants Identifies amino acid mutations 2
Amova Analysis of molecular variance 5
Analyse canonical genes against 'peak' data using RnaChipIntegrator 3
AnnData Operations modifies metadata and flags genes 13
Annotate DESeq2/DEXSeq output tables Append annotation from GTF to differential expression tool outputs 2
annotateMyIDs annotate a generic set of identifiers 11
Anosim Non-parametric multivariate analysis of changes in community structure 5
antigenic Predicts potentially antigenic regions of a protein sequence, using the method of Kolaskar and Tongaonkar. 5
Antismash allows the genome-wide identification, annotation and analysis of secondary metabolite biosynthesis gene clusters 7
ASTRAL-III. Tool for estimating an unrooted species tree given a set of unrooted gene trees. 1
Augustus gene prediction for prokaryotic and eukaryotic genomes 8
Auto Threshold applies a standard threshold algorithm to an image 3
AXT to concatenated FASTA Converts an AXT formatted file to a concatenated FASTA alignment Conversion 1
AXT to FASTA Converts an AXT formatted file to FASTA format Conversion 1
AXT to LAV Converts an AXT formatted file to LAV format Conversion 1
backtranseq Back translate a protein sequence 7
Bakta genome annotation via alignment-free sequence identification 2
BAM filter Removes reads from a BAM file based on criteria 4
BAM to Wiggle converts all types of RNA-seq data from .bam to .wig 10
bam-to-bai converter 1
BAM-to-SAM convert BAM to SAM 5
BAM/SAM Mapping Stats reads mapping statistics for a provided BAM or SAM file. 10
bamCompare normalizes and compares two BAM or CRAM files to obtain the ratio, log2ratio or difference between them 16
bamCoverage generates a coverage bigWig file from a given BAM or CRAM file 18
BamLeftAlign indels in BAM datasets 12
bamPEFragmentSize Estimate the predominant cDNA fragment length from paired-end sequenced BAM/CRAM files 16
BamUtil diff two coordinate sorted SAM/BAM files Sequence cutting 1
banana Bending and curvature plot in B-DNA 5
Bandage Image visualize de novo assembly graphs 8
Bandage Info determine statistics of de novo assembly graphs 6
Bar chart for multiple columns 2
Barcode Splitter 4
barrnap Locate ribosomal RNA's in a fasta file. (GFF output) 4
Base Coverage of all intervals Quantification 5
Batch_correction Corrects intensities for signal drift and batch-effects 4
BBTools: BBduk decontamination using kmers Global alignment Pairwise sequence alignment 3
bcf_uncompressed-to-bcf converter 1
bcftools annotate Annotate and edit VCF/BCF files 11
bcftools call SNP/indel variant calling from VCF/BCF 12
bcftools cnv Call copy number variation from VCF B-allele frequency (BAF) and Log R Ratio intensity (LRR) values 14
bcftools concat Concatenate or combine VCF/BCF files 10
bcftools consensus Create consensus sequence by applying VCF variants to a reference fasta file 13
bcftools convert from vcf Converts VCF/BCF to IMPUTE2/SHAPEIT formats 10
bcftools convert to vcf Converts other formats to VCF/BCFk 10
bcftools counts plugin counts number of samples, SNPs, INDELs, MNPs and total sites 11
bcftools csq Haplotype aware consequence predictor 11
bcftools dosage plugin genotype dosage 10
bcftools fill-AN-AC plugin Fill INFO fields AN and AC 10
bcftools fill-tags plugin Set INFO tags AF, AN, AC, AC_Hom, AC_Het, AC_Hemi 10
bcftools filter Apply fixed-threshold filters 10
bcftools fixploidy plugin Fix ploidy 10
bcftools gtcheck Check sample identity 10
bcftools impute-info plugin Add imputation information metrics to the INFO field 10
bcftools isec Create intersections, unions and complements of VCF files 11
bcftools List Samples in VCF/BCF file 10
bcftools mendelian plugin Count Mendelian consistent / inconsistent genotypes 10
bcftools merge Merge multiple VCF/BCF files from non-overlapping sample sets to create one multi-sample file 11
bcftools missing2ref plugin Set missing genotypes 10
bcftools mpileup Generate VCF or BCF containing genotype likelihoods for one or multiple alignment (BAM or CRAM) files 11
bcftools norm Left-align and normalize indels; check if REF alleles match the reference; split multiallelic sites into multiple rows; recover multiallelics from multiple rows 13
bcftools query Extracts fields from VCF/BCF file and prints them in user-defined format 11
bcftools reheader Modify header of VCF/BCF files, change sample names 10
bcftools roh HMM model for detecting runs of homo/autozygosity 10
bcftools setGT plugin Sets genotypes 10
bcftools stats Parses VCF or BCF and produces stats which can be plotted using plot-vcfstats 12
bcftools tag2tag plugin Convert between similar tags, such as GL and GP 12
bcftools view VCF/BCF conversion, view, subset and filter VCF/BCF files 14
bed to protein map genomic location of proteins for MVP 2
BED-to-bigBed converter Conversion 2
BED-to-GFF converter Conversion 1
BedCov calculate read depth for a set of genomic intervals 5
bedgraph-to-bigwig converter 1
BedToIntervalList convert coordinate data into picard interval list format 14
bedtools AnnotateBed annotate coverage of features from multiple files 21
bedtools BAM to BED converter 23
bedtools BED to BAM converter 21
bedtools BED to IGV create batch script for taking IGV screenshots 16
bedtools BED12 to BED6 converter 20
bedtools BEDPE to BAM converter 21
bedtools ClosestBed find the closest, potentially non-overlapping interval 21
bedtools ClusterBed cluster overlapping/nearby intervals 20
bedtools ComplementBed Extract intervals not represented by an interval file 21
bedtools ExpandBed replicate lines based on lists of values in columns 20
bedtools FisherBed calculate Fisher statistic between two feature files 20
bedtools FlankBed create new intervals from the flanks of existing intervals 21
bedtools Genome Coverage compute the coverage over an entire genome 20
bedtools GroupByBed group by common cols and summarize other cols 20
bedtools JaccardBed calculate the distribution of relative distances between two files 20
bedtools LinksBed create a HTML page of links to UCSC locations 20
bedtools MakeWindowsBed make interval windows across a genome 21
bedtools MaskFastaBed use intervals to mask sequences from a FASTA file 20
bedtools Merge BedGraph files combines coverage intervals from multiple BEDGRAPH files 20
bedtools MergeBED combine overlapping/nearby intervals into a single interval 20
bedtools MultiCovBed counts coverage from multiple BAMs at specific intervals 20
bedtools Multiple Intersect identifies common intervals among multiple interval files 23
bedtools OverlapBed computes the amount of overlap from two intervals 20
bedtools RandomBed generate random intervals in a genome 21
bedtools ReldistBed calculate the distribution of relative distances 20
bedtools ShuffleBed randomly redistrubute intervals in a genome 21
bedtools SlopBed adjust the size of intervals 22
bedtools SortBED order the intervals 20
bedtools SpacingBed reports the distances between features 17
bedtools SubtractBed remove intervals based on overlaps 20
bedtools TagBed tag BAM alignments based on overlaps with interval files 20
bedtools WindowBed find overlapping intervals within a window around an interval 20
Berokka Trim, circularise, orient and filter long read bacterial genome assemblies 1
bigwig-to-wig converter 1
bigwigCompare normalizes and compares two bigWig files to obtain the ratio, log2ratio or difference between them 16
Bin.seqs Order Sequences by OTU 5
Binary 2 Label Converts Binary to Label Image 2
Biom.info create shared and taxonomy files from biom 2
biom2-to-biom1 converter 1
Bionano Hybrid Scaffold automates the scaffolding process Genome assembly 7
biosed Replace or delete sequence sections 5
biosyntheticSPAdes biosynthetic gene cluster assembly 6
BLAST XML to tabular Convert BLAST XML output to tabular Conversion 18
BlastXML to gapped GFF3 1
blockbuster detects blocks of overlapping reads using a gaussian-distribution approach 3
BlockClust efficient clustering and classification of non-coding RNAs from short read RNA-seq profiles 2
Bowtie2 - map reads against reference genome 21
Boxplot of quality statistics 2
breseq find mutations in haploid microbial genomes DNA mapping Genetic mapping Genome annotation Mapping assembly Protein SNP mapping Sequence word comparison 4
btwisted Calculates the twisting in a B-DNA sequence 5
Build expression matrix for a de novo assembly of RNA-Seq data by Trinity 12
Bundle Collection Download a collection of files 2
Busco assess genome assembly and annotation completeness 15
BWA-MEM2 - map medium and long reads (> 100 bp) against reference genome 1
bwameth Fast and accurate aligner of BS-Seq reads. 6
Cactus whole-genome multiple sequence alignment 7
Cactus: export whole-genome multiple sequence alignment to other formats 9
cai CAI codon adaptation index 5
cai custom CAI codon adaptation index using custom codon usage file 5
Calculate a Heinz score for each node 2
Calculate community metrics calculate community metrics from abundance data 3
Calculate presence absence table calculate presence absence table from observation data 3
Calculate the box parameters using RDKit for an AutoDock Vina job from a ligand or pocket input file (confounding box) 5
Call and phase heterozygous SNPs 4
Call variants with LoFreq 7
CAMERA.annotate CAMERA annotate function. Returns annotation results (isotope peaks, adducts and fragments) and a diffreport if more than one condition. 9
CAMERA.combinexsAnnos Wrapper function for the combinexsAnnos CAMERA function. Returns a dataframe with recalculated annotations. 9
Canu assembler Assembler optimized for long error-prone reads such as PacBio, Oxford Nanopore 4
cap3 Sequence Assembly tool DNA mapping Mapping Mapping assembly Sequence assembly 2
CAT bins annotate with taxonomic classification 2
CAT summarise the number of contigs or bins assigned to each taxonomic name 1
Categorize by collapsing hierarchical data to a specified functional level 2
Categorize Elements satisfying criteria 1
CD-HIT PROTEIN Cluster a protein dataset into representative sequences 3
CD-HIT-EST Cluster a nucleotide dataset into representative sequences 3
CellProfiler run a CellProfiler pipeline 2
Change Case of selected columns 2
chaos Create a chaos game representation plot for a sequence 5
charge Protein charge plot 5
Check Format Checking/formatting the sample and variable names of the dataMatrix, sampleMetadata, and variableMetadata files 2
checktrans Reports STOP codons and ORF statistics of a protein 5
ChEMBL structure pipeline for curation and standardizing of molecular structures 2
Chimera.bellerophon Find putative chimeras using bellerophon 5
Chimera.ccode Find putative chimeras using ccode 6
Chimera.check Find putative chimeras using chimeraCheck 5
Chimera.perseus Find putative chimeras using chimeraCheck 6
Chimera.pintail Find putative chimeras using pintail 5
Chimera.slayer Find putative chimeras using slayer 5
Chimera.uchime Find putative chimeras using uchime 5
Chimera.vsearch find potential chimeric sequences using vsearch 4
chips Codon usage statistics 5
ChIPseeker for ChIP peak annotation and visualization 6
Chop.seqs Trim sequences to a specified length 6
Chromeister ultra-fast pairwise genome comparisons 5
CIRCexplorer 2
Circos visualizes data in a circular layout Visualisation 11
Circos Builder creates circos plots from standard bioinformatics datatypes. 2
Circos: Alignments to links reformats alignment files to prepare for Circos Parsing 10
Circos: bigWig to Scatter reformats bigWig files to prepare for Circos 2d scatter/line/histogram plots Conversion 10
Circos: Bundle Links reduce numbers of links in datasets before plotting Aggregation 10
Circos: Interval to Circos Text Labels reformats interval files to prepare for Circos text labels Conversion 10
Circos: Interval to Tiles reformats interval files to prepare for Circos tile plots Conversion 10
Circos: Link Density Track reduce links to a density plot Aggregation 10
Circos: Resample 1/2D data reduce numbers of points in a dataset before plotting Aggregation 10
Circos: Stack bigWigs as Histogram reformats for use in Circos stacked histogram plots Formatting 6
Circos: Table viewer easily creates circos plots from tabular data Visualisation 10
cirdna Draws circular maps of DNA constructs 5
Classify.otu Assign sequences to taxonomy 5
Classify.rf description 2
Classify.seqs Assign sequences to taxonomy 5
Classify.tree Get a consensus taxonomy for each node on a tree 5
CleanSam perform SAM/BAM grooming 14
Clearcut Generate a tree using relaxed neighbor joining 5
CLIFinder Find chimerics transcripts containing LINEs sequences 1
climate stripes from timeseries Visualisation 3
clinker Automatic creation of publication-ready gene cluster comparison figures 1
Clip adapter sequences 6
Clipping Profile estimates clipping profile of RNA-seq reads from BAM or SAM file 10
ClustalW multiple sequence alignment program for DNA or proteins 5
Cluster Assign sequences to OTUs (Operational Taxonomic Unit) 4
Cluster the intervals of a dataset Sequence clustering Sequence merging 5
Cluster Inspection using RaceID examines gene expression within clusters 6
Cluster, infer trajectories and embed with scanpy 7
Cluster.classic Assign sequences to OTUs (Dotur implementation) 5
Cluster.fragments Group sequences that are part of a larger sequence 5
Cluster.split Assign sequences to OTUs and split large matrices 5
Clustering using RaceID performs clustering, outlier detection, dimensional reduction 6
cmalign Align sequences to a covariance model against a sequence database 3
cmbuild Build covariance models from sequence alignments 4
cml-to-mol2 converter 1
cml-to-smi converter 2
cmpress Prepare a covariance model database for cmscan 2
cmscan Search sequences against collections of covariance models 2
cmsearch Search covariance model(s) against a sequence database 3
cmstat Summary statistics for covariance model 3
codcmp Codon usage table comparison 5
coderet Extract CDS, mRNA and translations from feature tables 5
Codon Variants Identifies codon variants and non-synonymous/synonymous mutations 2
Collapse sequences 3
Collapse Collection into single dataset in order of the collection 5
Collect Alignment Summary Metrics writes a file containing summary alignment metrics 14
Collect.shared Generate collector's curves for calculators on OTUs 5
Collect.single Generate collector's curves for OTUs 6
CollectBaseDistributionByCycle charts the nucleotide distribution per cycle in a SAM or BAM dataset 14
CollectGcBiasMetrics charts the GC bias metrics 14
CollectHsMetrics compute metrics about datasets generated through hybrid-selection (e.g. exome) 4
CollectInsertSizeMetrics plots distribution of insert sizes 14
Collector’s curve of sequencing yield over time 2
CollectRnaSeqMetrics collect metrics about the alignment of RNA to various functional classes of loci in the genome 15
CollectWgsMetrics compute metrics for evaluating of whole genome sequencing experiments 14
ColorToGray converts color and channel-stacked images to grayscale 2
Column arrange by header name 2
Column Join on Collections 5
Column Regex Find And Replace 6
Combine FASTA and QUAL into FASTQ Aggregation 5
Combine MetaPhlAn2 and HUMAnN2 outputs to relate genus/species abundances and gene families/pathways abundances 3
Compare BIOM tables Compare the accuracy of biom files (expected and observed) either by observations (default) or by samples. 3
Compare two Datasets to find common or distinct rows Filtering 1
Complement intervals of a dataset Sequence coordinate conversion 6
Complexity BAM 2
Complexity FASTA 2
Compound conversion - interconvert between various chemistry and molecular modeling data files 6
Compress file(s) 1
compseq Count composition of dimer/trimer/etc words in a sequence 5
Compute on rows 10
Compute both the depth and breadth of coverage of features in file B on the features in file A (bedtools coverage) 21
Compute contig Ex90N50 statistic and Ex90 transcript count from a Trinity assembly 10
Compute GLM on community data Compute a GLM of your choice on community data 2
Compute GLM on population data Compute a GLM of your choice on population data 2
Compute Motif Frequencies in indel flanking regions 2
Compute Motif Frequencies For All Motifs motif by motif 2
Compute P-values and Correlation Coefficients for Feature Occurrences between two datasets using Discrete Wavelet Transfoms 2
Compute P-values and Correlation Coefficients for Occurrences of Two Set of Features between two datasets using Discrete Wavelet Transfoms 2
Compute P-values and Max Variances for Feature Occurrences in one dataset using Discrete Wavelet Transfoms 2
Compute P-values and Second Moments for Feature Occurrences between two datasets using Discrete Wavelet Transfoms 2
Compute quality statistics 3
Compute sequence length 6
computeGCBias Determine the GC bias of your sequenced reads 17
computeMatrix prepares data for plotting a heatmap or a profile of given regions 18
computeMatrixOperations Modify or combine the output of computeMatrix in a variety of ways. 11
Concatenate FASTA alignment by species 3
Concatenate two BED files Aggregation 6
Concatenate datasets tail-to-head Aggregation 1
Concatenate datasets tail-to-head (cat) 11
Concatenate multiple datasets tail-to-head 1
Concatenate multiple datasets tail-to-head 4
Condense consecutive characters 2
Consensus Sequence Generate a consensus sequence from a BAM file 2
Consensus.seqs Find a consensus sequence for each OTU or phylotype 6
ConsensusID Computes a consensus of peptide identifications of several identification engines. 6
Construct Expression Set Object Create an ExpressionSet object from tabular and textual data 5
Convert delimiters to TAB 1
Convert between BIOM table formats 9
Convert BED to Feature Location Index 1
Convert BED to GFF 2
Convert FASTA to 2bit 2
Convert FASTA to Bowtie base space Index 2
Convert FASTA to Bowtie color space Index 2
Convert FASTA to len file 2
Convert FASTA to Tabular 1
Convert from BAM to FastQ 13
Convert genome coordinates between assemblies and genomes 2
Convert Genomic Intervals To BED 1
Convert Genomic Intervals To Coverage 2
Convert Genomic Intervals To Strict BED 2
Convert Genomic Intervals To Strict BED12 1
Convert Genomic Intervals To Strict BED6 2
Convert GFF to BED 2
Convert GFF to Feature Location Index 1
Convert GFF3 to prot_table for TRANSIT 4
Convert gffCompare annotated GTF to BED for StringTie results 4
Convert GTF to BED12 1
Convert image Convert image 2
Convert Kraken data to Galaxy taxonomy representation 3
Convert Len file to Linecount 2
Convert lped to fped 2
Convert lped to plink pbed 2
Convert MAF to Fasta 2
Convert MAF to Genomic Intervals 2
Convert Picard Interval List to BED6 converter 2
Convert plink pbed to ld reduced format 2
Convert plink pbed to linkage lped 2
Convert SAM to interval 2
Convert VCF to MAF with vcf2maf 3
ConvertObjectsToImage convert the identified objects into an image 2
Cooccurrence tests whether presence-absence patterns differ from chance 6
Copernicus Climate Data Store for retrieveing climate data Data retrieval Format detection Formatting 1
coronaSPAdes SARS-CoV-2 de novo genome assembler 6
Corr.axes correlation of data to axes 5
correctGCBias uses the output from computeGCBias to generate GC-corrected BAM/CRAM files 16
Correlation for numeric columns 1
Cosine Content - measure the cosine content of the PCA projection 4
Count occurrences of each record 2
Count GFF Features 2
Count Objects in labled images 1
Count.groups counts the number of sequences represented by a specific group or set of groups 5
Count.seqs (aka make.table) counts the number of sequences represented by the representative 5
Coverage of a set of intervals on second set of intervals Comparison Filtering 5
cpgplot Plot CpG rich areas 5
cpgreport Reports all CpG rich regions 5
cram-to-bam converter 2
Create a deep learning model architecture using Keras 6
Create a genus level gene families file 4
Create a plot from GLM data as temporal trend 1
Create assemblies with Unicycler pipeline for bacterial genomes Genome assembly 10
Create deep learning model with an optimizer, loss function and fit parameters 6
Create Frankenstein ligand for docking active site definition 3
Create InterMine Interchange Dataset 1
Create nested list based on filenames and batch sizes 2
Create single interval as a new dataset 1
Create text file with recurring lines 9
Create.database creates a database file from a list, repnames, repfasta and contaxonomy file 5
csv-to-tabular converter 1
CTSM/FATES-EMERALD Functionally Assembled Terrestrial Ecosystem Simulator Ecological modelling Modelling and simulation 2
Cuffcompare compare assembled transcripts to a reference annotation and track Cufflinks transcripts across multiple experiments 8
Cufflinks transcript assembly and FPKM (RPKM) estimates for RNA-Seq data 8
Cuffmerge merge together several Cufflinks assemblies 8
Cuffnorm Create normalized expression levels 7
Cuffquant Precompute gene expression levels 4
cummeRbund visualize Cuffdiff output 3
cusp Create a codon usage table 5
CustomProDB Generate protein FASTAs from exosome or transcriptome data 2
Cut columns from a table Filtering 2
Cut columns from a table (cut) 9
cut_francais keep or remove selected column 1
Cutadapt Remove adapter sequences from FASTQ/FASTA Sequence editing 22
cuteSV detects long-read-based SVs 1
cutseq Removes a specified section from a sequence 5
dada2: assignTaxonomy and addSpecies Learn Error rates 6
dada2: dada Remove sequencing errors 6
dada2: filterAndTrim Filter and trim short read data 6
dada2: learnErrors Learn Error rates 6
dada2: makeSequenceTable construct a sequence table (analogous to OTU table) 6
dada2: mergePairs Merge denoised forward and reverse reads 6
dada2: plotComplexity Plot sequence complexity profile 6
dada2: plotQualityProfile plot a visual summary of the quality scores 6
dada2: removeBimeraDenovo Remove bimeras from collections of unique sequences 6
dada2: sequence counts 6
dan Calculates DNA RNA/DNA melting temperature 5
Data Fetch Query and retrieval 1
Datamash (operations on tabular data) 5
DAVID functional annotation for a list of genes 1
dbnsfp.tabular-to-snpsiftdbnsfp converter 3
DecoyDatabase Create decoy sequence database from forward sequence database. 5
Deep learning training and evaluation conduct deep training and evaluation either implicitly or explicitly 4
Degap.seqs Remove gap characters from sequences 5
degapseq Removes gap characters from sequences 5
Deletion Profile calculates the distributions of deleted nucleotides across reads 6
Delta-Filter Filters alignment (delta) file from nucmer Read mapping Sequence alignment 5
Describe samples and replicates 11
descseq Alter the name or description of a sequence 5
DESeq2 Determines differentially expressed features from count tables Differential gene expression profiling RNA-Seq quantification 22
Determine_batch_correction to choose between linear, lowess and loess methods 4
Deunique.seqs Return all sequences 5
Deunique.tree Reinsert the redundant sequence identiers back into a unique tree. 5
DEXSeq Determines differential exon usage from count tables 10
DEXSeq-Count Prepare and count exon abundancies from RNA-seq data 10
DIA_Umpire_SE DIA signal extraction 1
Diamond alignment tool for short sequences against a protein database Sequence alignment analysis 9
Diamond makedb Build database from a FASTA file Sequence alignment analysis 8
Diamond view generate formatted output from DAA files Sequence alignment analysis 6
diapysef library generation generates spectral library for DIA analysis 1
DiffBind differential binding analysis of ChIP-Seq peak data 12
Differential expression analysis using a Trinity assembly 11
diffseq Find differences between nearly identical sequences 5
digest Protein proteolytic enzyme or reagent cleavage digest 5
DisplayDataOnImage produce an image with data on top of identified objects 2
Dist.seqs calculate uncorrected pairwise distances between aligned sequences 5
Dist.shared Generate a phylip-formatted dissimilarity distance matrix among multiple groups 5
DNAdiff Evaluate similarities/differences between two sequences Read mapping Sequence alignment 5
dNdS Report Calculate the dN/dS value for each region in a bed file 2
dotmatcher Displays a thresholded dotplot of two sequences 5
dotpath Non-overlapping wordmatch dotplot of two sequences 5
dottup Displays a wordmatch dotplot of two sequences 5
Download and Extract Reads in BAM format from NCBI SRA Data retrieval Formatting 24
Download and Extract Reads in FASTQ format from NCBI SRA Data retrieval Formatting 20
Downsample SAM/BAM Downsample a file to retain a subset of the reads 14
dpocket to calculate descriptors for protein pockets 1
Draw nucleotides distribution chart 4
Draw quality score boxplot 4
Draw ROC plot on "Perform LDA" output 2
Draw Stacked Bar Plots for different categories and different criteria 1
dreg Regular expression search of a nucleotide sequence 6
DropletUtils Utilities for handling droplet-based single-cell RNA-seq data DNA barcoding Parsing 9
Drug Resistance Mutations 2
edgeR Perform differential expression of count data Differential gene expression profiling RNA-seq read count analysis 10
edgeR-repenrich Determines differentially expressed features from RepEnrich counts 1
eggNOG Mapper functional sequence annotation by orthology 10
EGSEA easy and efficient ensemble gene set testing 4
einverted Finds DNA inverted repeats 5
ENA Upload tool 9
EncyclopeDIA Library Searching Directly from Data-Independent Acquisition (DIA) MS/MS Data 3
EncyclopeDIA Quantify samples from Data-Independent Acquisition (DIA) MS/MS Data 3
EnhanceOrSuppressFeatures to improve subsequent identification of objects 2
Ensemble methods for classification and regression 12
Enumerate changes calculated with Dimorphite DL and RDKit 2
epestfind Finds PEST motifs as potential proteolytic cleavage sites 5
equicktandem Finds tandem repeats 5
est2genome Align EST and genomic DNA sequences 5
EstimateLibraryComplexity assess sequence library complexity from read sequences 14
Estimator attributes get important attributes from an estimator or scikit object 9
etandem Looks for tandem repeats in a nucleotide sequence 5
Exonerate pairwise sequence comparison 4
Export datasets to remote files source 1
Export to GraPhlAn 2
ExportToSpreadsheet export measurements into one or more files 2
eXpress Quantify the abundances of a set of target sequences from sampled subsequences 2
Extract and cluster differentially expressed transcripts from a Trinity assembly 12
Extract element identifiers of a list collection 2
Extract energy components with GROMACS 7
Extract FASTQ in tabular format from a set of FAST5 files 2
Extract features from GFF data 1
Extract MAF blocks given a set of genomic intervals 4
Extract MAF by block number given a set of block numbers and a MAF file 1
Extract nanopore events from a set of sequencing reads 2
Extract Pairwise MAF blocks given a set of genomic intervals 1
Extract reads in FASTA or FASTQ format from nanopore files 2
Extract time and channel information from a set of FAST5 files 2
Extract values from an SD-file into a tabular file using RDKit 4
extractfeat Extract features from a sequence 5
extractseq Extract regions from a sequence 5
FalseDiscoveryRate Estimates the false discovery rate on peptide and protein level using decoy searches. 7
faSplit Split a FASTA file 3
Fasta Extract Sequence Extract a single sequence from a fasta file. 2
FASTA Merge Files and Filter Unique Sequences Concatenate FASTA database files together 4
Fasta Statistics Display summary statistics for a fasta file. 5
FASTA Width formatter 3
fasta-to-fai converter 1
FASTA-to-Tabular converter 3
FastaCLI Appends decoy sequences to FASTA files 6
Faster Download and Extract Reads in FASTQ format from NCBI SRA Data retrieval Formatting 15
fastp - fast all-in-one preprocessing for FASTQ files 8
FASTQ de-interlacer on paired end reads Splitting 5
FASTQ Groomer convert between various FASTQ quality formats Sequence conversion 5
FASTQ interlacer on paired end reads Aggregation 6
FASTQ joiner on paired end reads Aggregation 7
FASTQ Masker by quality score Sequence masking 5
FASTQ Quality Trimmer by sliding window 5
FASTQ splitter on joined paired end reads Splitting 6
FASTQ Summary Statistics by column Sequence assembly validation 4
FASTQ to FASTA converter from FASTX-toolkit Sequence conversion 6
FASTQ to FASTA converter Conversion 5
FASTQ to Tabular converter Sequence conversion 5
FASTQ Trimmer by column Sequence trimming 5
Fastq.info Convert fastq to fasta and quality 6
FastQC Read Quality reports Sequence composition calculation Sequencing quality control Statistical calculation 18
FASTQE visualize fastq files with emoji's 🧬😎 5
fastqillumina-to-fqtoc converter 1
FastqToSam convert Fastq data into unaligned BAM 19
FASTTREE build maximum-likelihood phylogenetic trees Phylogenetic analysis Phylogenetic inference (from molecular sequences) 4
featureCounts Measure gene expression in RNA-Seq experiments from SAM or BAM files. Sequence assembly 25
FeatureFinderMultiplex Determination of peak ratios in LC-MS data 7
Fetch closest non-overlapping feature for every interval Filtering 4
fgsea - fast preranked gene set enrichment analysis 3
FidoAdapter Runs the protein inference engine Fido. 6
FileConverter Converts between different MS file formats. 5
FileFilter Extracts or manipulates portions of data from peak, feature or consensus-feature files. 5
FileInfo Shows basic information about the file, such as data ranges and file type. 5
FileMerger Merges several MS files into one file. 5
Filter data on any column using simple expressions Formatting 2
Filter with scanpy 7
Filter and merge chimeric reads from Arima Genomics 1
Filter BAM datasets on a variety of attributes 6
Filter by quality Filtering 6
Filter Combined Transcripts using tracking file 1
Filter FASTA on the headers and/or the sequences 5
Filter FASTQ reads by quality score and length Filtering 4
Filter GFF data by attribute using simple expressions 2
Filter GFF data by feature count using simple expressions 1
Filter GTF data by attribute values_list 2
Filter Image applies a standard filter to an image 2
Filter low expression transcripts from a Trinity assembly 11
Filter MAF by specified attributes 1
Filter MAF blocks by Species 1
Filter MAF blocks by Size 1
Filter pileup on coverage and SNPs 3
Filter SAM on bitwise flag values 1
Filter SAM or BAM, output SAM or BAM files on FLAG MAPQ RG LN or by region 6
Filter segmentation Filter segmentation by rules 1
Filter sequences by ID from a tabular file 5
Filter sequences by length Filtering 6
Filter Tabular 5
Filter with SortMeRNA of ribosomal RNAs in metatranscriptomic data 6
Filter.seqs removes columns from alignments 5
Filter.shared remove OTUs based on various critieria 6
FilterSamReads include or exclude aligned and unaligned reads and read lists 14
filtlong Filtering long reads by quality 3
Finds SNP sites from a multi-FASTA alignment file 1
Fit a BUM model with p-values 2
FixMateInformation ensure that all mate-pair information is in sync between each read and it's mate pair 14
FlashLFQ ultrafast label-free quantification for mass-spectrometry proteomics 5
Flye de novo assembler for single molecule sequencing reads Genome assembly 8
Format tree and trait tables 2
Format MetaPhlAn2 output for Krona 2
FPKM Count calculates raw read count, FPM, and FPKM for each gene 8
fpocket - find potential binding sites in protein structures 4
FragGeneScan for finding (fragmented) genes in short reads 2
freak Residue/base frequency table or plot 5
FreeBayes bayesian genetic variant detector 12
Funannotate assembly clean 5
Funannotate compare annotations 5
Funannotate functional annotation 5
Funannotate predict annotation 6
fuzznuc Nucleic acid pattern search 6
fuzzpro Protein pattern search 5
fuzztran Protein pattern search after translation 5
g:Profiler tools for functional profiling of gene lists 1
garnier Predicts protein secondary structure 5
Gblocks Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis Sequence masking 3
GC Skew calculates skew over genomic sequences Nucleic acid property calculation 10
Gecko Ungapped genome comparison 3
geecee Calculates fractional GC content of nucleic acid sequences 5
GEMINI actionable_mutations Retrieve genes with actionable somatic mutations via COSMIC and DGIdb 4
GEMINI amend Amend an already loaded GEMINI database. 5
GEMINI annotate the variants in an existing GEMINI database with additional information 7
GEMINI autosomal recessive/dominant Find variants meeting an autosomal recessive/dominant model 2
GEMINI burden perform sample-wise gene-level burden calculations 4
GEMINI comp_hets Identifying potential compound heterozygotes 2
GEMINI database info Retrieve information about tables, columns and annotation data stored in a GEMINI database 4
GEMINI de_novo Identifying potential de novo mutations 2
GEMINI dump Extract data from the Gemini DB 2
GEMINI fusions Identify somatic fusion genes from a GEMINI database 5
GEMINI gene_wise Custom genotype filtering by gene 5
GEMINI inheritance pattern based identification of candidate genes 2
GEMINI interactions Find genes among variants that are interacting partners 4
GEMINI load Loading a VCF file into GEMINI 7
GEMINI lof_sieve Filter LoF variants by transcript position and type 4
GEMINI mendel_errors Identify candidate violations of Mendelian inheritance 2
GEMINI pathways Map genes and variants to KEGG pathways 5
GEMINI qc Quality control tool 5
GEMINI query Querying the GEMINI database 5
GEMINI region Extracting variants from specific regions or genes 2
GEMINI roh Identifying runs of homozygosity 5
GEMINI set_somatic Tag somatic mutations in a GEMINI database 5
GEMINI stats Compute useful variant statistics 4
GEMINI windower Conducting analyses on genome "windows" 4
Genbank to GFF3 converter 2
Genbank-to-Fasta produces a Fasta file from a genbank file 1
genbank-to-genbank.gz converter 1
genbank.gz-to-genbank converter 1
Gene BED To Exon/Intron/Codon BED expander 1
Gene Body Coverage (BAM) Read coverage over gene body. 13
Gene Body Coverage (Bigwig) Read coverage over gene body 12
Gene length and GC content from GTF and FASTA file 4
Generalized linear models for classification and regression 11
Generate A Matrix for using PC and LDA 1
Generate box-whisker plot of quality score distribution over positions in nanopore reads 2
Generate count matrix from individual files 1
Generate gene to transcript map for Trinity assembly 10
Generate histogram of nanopore read lengths 2
Generate MD topologies for small molecules using acpype 9
Generate pileup from BAM dataset 3
Generation, personalization and annotation of tree for GraPhlAn 2
Generic_Filter Removes elements according to numerical or qualitative values 3
Genome annotation statistics 1
GenomeScope reference-free genome profiling 3
Genrich Detecting sites of genomic enrichment 3
Get flanks returns flanking region/s for every gene Sequence analysis 4
Get longest read from a set of FAST5 files. 2
Get open reading frames (ORFs) or coding sequences (CDSs) e.g. to get peptides from ESTs 5
Get organelle from reads 1
Get PDB file from Protein Data Bank 2
Get.communitytype description 5
Get.coremicrobiome fraction of OTUs for samples or abundances 6
Get.dists selects distances from a phylip or column file 5
Get.group group names from shared or from list and group 6
Get.groups Select groups 5
Get.label label names from list, sabund, or rabund file 5
Get.lineage Picks by taxon 6
Get.mimarkspackage creates a mimarks package form with your groups 5
Get.otulabels Selects OTU labels 6
Get.otulist Get otus for each distance in a otu list 5
Get.oturep Generate a fasta with a representative sequence for each OTU 5
Get.otus Get otus containing sequences from specified groups 5
Get.rabund Get rabund from a otu list or sabund 6
Get.relabund Calculate the relative abundance of each otu 5
Get.sabund Get sabund from a otu list or rabund 5
Get.seqs Picks sequences by name 5
Get.sharedseqs Get shared sequences at each distance from list and group 5
GetFastaBed use intervals to extract sequences from a FASTA file 20
getorf Finds and extracts open reading frames (ORFs) 5
GFA to FASTA Convert Graphical Fragment Assembly files to FASTA format 4
gfastats the swiss army knife for genome assembly 8
GFF-to-BED converter Conversion 1
gff3.bz2-to-gff3 converter 1
GffCompare compare assembled transcripts to a reference annotation 5
gffread Filters and/or converts GFF3/GTF2 records 7
GMAJ Multiple Alignment Viewer 1
GOEnrichment performs GO enrichment analysis of a set of gene products 2
goseq tests for overrepresented gene categories Gene-set enrichment analysis 12
GOSlimmer converts a set of annotation from GO to a given GOSlim version 2
GraPhlAn to produce graphical output of an input tree 2
GrayToColor take grayscale images and produces a color image from them 2
gro-to-pdb converter 1
GROMACS energy minimization of the system prior to equilibration and production MD 7
GROMACS initial setup of topology and GRO structure file 6
GROMACS simulation for system equilibration or data collection 9
GROMACS solvation and adding ions to structure and topology files 8
GROMACS structure configuration using editconf 5
Group data by a column and perform aggregate operation on other columns. 4
GTDB-Tk Classify genomes by placement in GTDB reference tree 1
GTF-to-BEDGraph converter Conversion 1
gtf-to-interval_index converter 1
GTF2GeneList extracts a complete annotation table or subsets thereof from an Ensembl GTF using rtracklayer 6
Gubbins Recombination detection in Bacteria 2
Gumbel - determine essential genes 6
Hapcut2 - haplotype assembly for diploid organisms 1
Hcluster Assign sequences to OTUs (Operational Taxonomic Unit) 2
Heatmap w ggplot 6
Heatmap.bin Generate a heatmap for OTUs 5
Heatmap.sim Generate a heatmap for pariwise similarity 5
heatmap2 6
helixturnhelix Report nucleic acid binding motifs 5
Hexamer frequency calculates hexamer (6mer) frequency for reads, genomes, and mRNA sequences 6
hicBuildMatrix create a contact matrix 18
hicFindTADs identify TAD boundaries by computing the degree of separation of each Hi-C matrix bin 16
hicMergeMatrixBins merge adjacent bins from a Hi-C contact matrix to reduce its resolution 15
hicPCA compute the principal components for A / B compartment analysis 12
hicPlotMatrix plot a Hi-C contact matrix heatmap 16
HiFi Adapter Filter Remove CCS reads with remnant PacBio adapter sequences 1
Hifiasm haplotype-resolved de novo assembler for PacBio Hifi reads 11
Hifiasm_meta for metagenome assembly using Hifi reads 2
HighResPrecursorMassCorrector Corrects the precursor mass and charge determined by the instrument software. 5
HISAT2 A fast and sensitive alignment program 19
Histogram of a numeric column 4
Histogram equalization automatic histogram equalization 1
HMDB MS search search by masses on HMDB online LCMS bank 6
hmmalign align sequences to a profile HMM 7
hmmbuild Build a profile HMM from an input multiple alignment 8
hmmconvert convert profile file to a HMMER format 7
hmmemit sample sequence(s) from a profile HMM 7
hmmfetch retrieve profile HMM(s) from a file 7
hmmscan search sequence(s) against a profile database 8
hmmsearch search profile(s) against a sequence database 8
hmoment Hydrophobic moment calculation 5
Homova Homogeneity of molecular variance 5
htseq-count - Count aligned reads in a BAM file that overlap features in a GFF file 9
HUMAnN to profile presence/absence and abundance of microbial pathways and gene families Nucleic acid sequence analysis Phylogenetic analysis 2
HUMAnN2 to profile presence/absence and abundance of microbial pathways and gene families 5
Hydra pipeline Identifies drug resistance within an NGS dataset 2
Hydrogen Bond Analysis using VMD between two segments of a trajectory 1
Hyperparameter Search performs hyperparameter optimization using various SearchCVs 11
HyPhy-aBSREL adaptive Branch Site Random Effects Likelihood 27
HyPhy-GARD Genetic Algorithm for Recombination Detection 26
IDConflictResolver Resolves ambiguous annotations of features with peptide identifications 5
Identification Parameters Sets the identification parameters to be used in SearchGUI and PeptideShaker apps 6
Identify optimal scoring subnetwork using Heinz 2
IdentifyPrimaryObjects identify biological objects of interest 2
IDFilter Filters results from protein or peptide identification engines based on different criteria. 6
IDMapper Assigns protein/peptide identifications to features or consensus features. 5
IDMerger Merges several protein/peptide identification files into one file. 6
IDPosteriorErrorProbability Estimates probabilities for incorrectly assigned peptide sequences and a set of search engine scores using a mixture model. 5
IDScoreSwitcher Switches between different scores of peptide or protein hits in identification data 6
IdxStats reports stats of the BAM index file 8
iep Calculates the isoelectric point of a protein 5
Illuminapairedend - Assembling pair-end reads Construct consensus reads from Illumina pair-end reads 3
Image Info Show Image Info 2
ImageMath perform simple mathematical operations on images 2
Import Anndata and loom from different format 9
inchi-to-mol converter 2
Indicator Identify indicator "species" for nodes on a tree 5
Infer Experiment speculates how RNA-seq were configured 12
infoseq Displays some simple information about sequences 5
Inner Distance calculate the inner distance (or insert size) between two paired RNA reads 12
Insert indel qualities into a BAM file 4
Insertion Profile calculates the distribution of inserted nucleotides across reads 6
Inspect and manipulate with scanpy 7
Inspect AnnData object 10
Inspect Expression Set Object Inspect an ExpressionSet object by a variety of attributes 5
Integron Finder is a program that detects integrons in DNA sequences Genome annotation Nucleic acid sequence feature detection Protein sequence feature detection Sequence motif recognition 2
InterProphet Combine Peptide Prophet results from multiple search engines 2
InterProScan functional annotation 9
Intersect the intervals of two datasets Filtering 6
Intersect multiple VCF datasets 2
Intersect intervals find overlapping intervals in various ways 22
IQ-TREE Phylogenomic / evolutionary tree construction from multiple sequences 7
ISEScan Insertion Sequence Elements detection in prokaryotic genomes 1
isochore Plots isochores in large DNA sequences 5
ivar consensus Call consensus from aligned BAM file 9
ivar filtervariants Filter variants across replicates or multiple samples aligned using the same reference 7
ivar getmasked Detect primer mismatches and get primer indices for the amplicon to be masked 4
ivar removereads Remove reads from trimmed BAM file 11
ivar trim Trim reads in aligned BAM 13
ivar variants Call variants from aligned BAM file 9
IWTomics Load Smooth and Plot 1
IWTomics Plot with Threshold on Test Scale 1
IWTomics Test and Plot 1
jackhmmer iteratively search a protein sequence against a protein database (PSIBLAST-like) 8
JBrowse genome browser Map drawing Sequence visualisation 31
JBrowse - Data Directory to Standalone upgrades the bare data directory to a full JBrowse instance Conversion 30
jellyfish 1
Join the intervals of two datasets side-by-side Aggregation 5
Join two files 12
Join +/- Ions Join positive and negative ionization-mode W4M datasets for the same samples 1
Join MAF blocks by Species 1
Join two Datasets side by side on a specified field 4
Join two files on column allowing a small difference 2
JQ process JSON 1
Junction Annotation compares detected splice junctions to reference gene model 12
Junction Saturation detects splice junctions from each subset and compares them to reference gene model 12
Kallisto pseudo - run pseudoalignment on RNA-Seq transcripts 6
Kallisto quant - quantify abundances of RNA-Seq transcripts 8
khmer: Abundance Distribution Calculate abundance distribution of k-mers using pre-made k-mer countgraphs 4
khmer: Abundance Distribution (all-in-one) Calculate abundance distribution of k-mers 4
khmer: Count Median Count the median/avg k-mer abundance for each sequence 4
khmer: Extract partitions Separate sequences that are annotated with partitions into grouped files 4
khmer: Filter reads by minimal k-mer abundance 4
khmer: Filter reads below k-mer abundance of 50 4
khmer: Normalize By Median Filter reads using digital normalization via k-mer abundances Sequence file editing Sequence word comparison 5
khmer: Sequence partition all-in-one Load, partition, and annotate sequences 4
Kraken assign taxonomic labels to sequencing reads 8
Kraken-filter filter classification by confidence score 4
Kraken-mpa-report view report of classification for multiple samples 5
Kraken-report view sample report of a classification 5
Kraken-translate convert taxonomy IDs to names 4
Kraken2 assign taxonomic labels to sequencing reads 5
Krona pie chart from taxonomic profile 10
LASTZ : align long sequences 7
Lastz paired reads map short paired reads against reference sequence 2
LASTZ_D : estimate substitution scores matrix 5
LAV to BED Converts a LAV formatted file to BED format 1
LCMS matching Annotation of LCMS peaks using matching on a in-house spectra database or on PeakForest spectra database. 3
LD linkage disequilibrium and tag SNPs 1
Lefse description 5
Length Distribution chart 2
Libshuff Cramer-von Mises tests communities for the same structure 5
limma Perform differential expression with limma-voom or limma-trend Differential gene expression profiling RNA-seq read count analysis 19
lindna Draws linear maps of DNA constructs 5
Line/Word/Character count of a dataset 1
Lineage Branch Analysis using StemID inspects branches of a lineage tree 6
Lineage computation using StemID generates lineage from prior clustering 6
LINKS - scaffold genome assemblies with long reads 1
List.otulabels Lists otu labels from shared or relabund file 5
List.seqs Lists the names (accnos) of the sequences 5
Lofreq filter called variants posteriorly 4
Machine Learning Visualization Extension includes several types of plotting for machine learning 6
MACS2 bdgbroadcall Call broad peaks from bedGraph output 9
MACS2 bdgcmp Deduct noise by comparing two signal tracks in bedGraph 9
MACS2 bdgdiff Differential peak detection based on paired four bedgraph files 10
MACS2 bdgpeakcall Call peaks from bedGraph output 9
MACS2 callpeak Call peaks from alignment results 12
MACS2 filterdup Remove duplicate reads at the same position 9
MACS2 predictd Predict 'd' or fragment size from alignment results 9
MACS2 randsample Randomly sample number or percentage of total reads 9
MACS2 refinepeak Refine peak summits and give scores measuring balance of forward- backward tags (Experimental) 9
MACS2.1.1 Model-based Analysis of ChIP-Seq: peak calling 2
MAF Coverage Stats Alignment coverage information 3
MAF to BED Converts a MAF formatted file to the BED format 1
MAF to FASTA Converts a MAF formatted file to FASTA format 1
MAF to Interval Converts a MAF formatted file to the Interval format 1
MAFFT Multiple alignment program for amino acid or nucleotide sequences 9
MAFFT add Align a sequence,alignment or fragments to an existing alignment. 6
MAGeCK count - collect sgRNA read counts from read mapping files 8
MAGeCK GSEA - a fast implementation of Gene Set Enrichment Analysis 4
MAGeCK mle - perform maximum-likelihood estimation of gene essentiality scores 5
MAGeCK pathway - given a ranked gene list, test whether one pathway is enriched 4
MAGeCKs test - given a table of read counts, perform the sgRNA and gene ranking 5
Make Design Assign groups to Sets 5
Make.biom Make biom files from a shared file 5
Make.contigs Aligns paired forward and reverse fastq files to contigs as fasta and quality 6
Make.fastq Convert fasta and quality to fastq 6
Make.group Make a group file 6
Make.lefse create a lefse formatted input file from mothur's output files 6
Make.lookup allows you to create custom lookup files for use with shhh.flows 5
Make.shared Make a shared file from a list and a group 5
Make.sra creates the necessary files for a NCBI submission 5
Maker genome annotation pipeline 10
MALDIquant peak detection Peak detection, binning and filtering for mass-spectrometry imaging data 7
MALDIquant preprocessing Preprocessing of mass-spectrometry imaging data 4
Manipulate AnnData object 10
Manipulate FASTQ reads on various attributes Sequence conversion 4
Mantel Mantel correlation coefficient between two matrices. 5
Map annotation ids on a Maker annotation 5
Map with Bowtie for Illumina 7
Map with BWA - map short reads (< 100 bp) against reference genome 12
Map with BWA-MEM - map medium and long reads (> 100 bp) against reference genome DNA mapping Genetic mapping Genome annotation Mapping Mapping assembly Protein SNP mapping Sequence assembly Sequence tag mapping 14
Map with minimap2 A fast pairwise aligner for genomic and spliced nucleotide sequences Sequence alignment 18
MapBed apply a function to a column for each overlapping interval 21
MarkDuplicates examine aligned records in BAM datasets to locate duplicate molecules 17
MarkDuplicatesWithMateCigar examine aligned records in BAM datasets to locate duplicate molecules 16
marscan Finds MAR/SAR sites in nucleic sequences 5
mash screen determines how well query sequences are contained within a pool of sequences 3
mash sketch Create a reduced representation of a sequence or set of sequences, based on min-hashes 1
maskfeat Mask off features of a sequence 5
MaskImage hide portions of an image based on previously identified objects 2
maskseq Mask off regions of a sequence 5
MasterVar to pgSnp Convert from MasterVar to pgSnp format 1
MaSuRCA simple The MaSuRCA (Maryland Super Read Cabog Assembler) genome assembly and analysis toolkit without config 1
matcher Finds the best local alignments between two sequences 5
Max SuCOS score - determine maximum SuCOS score of ligands against clustered fragment hits 7
MaxBin2 clusters metagenomic contigs into bins 4
MaxQuant Imputation Protein quantification Statistical calculation 17
MaxQuant (using mqpar.xml) Imputation Protein quantification Statistical calculation 9
MDTraj file converter - interconvert between MD trajectory file formats. 5
MeanQualityByCycle chart distribution of base qualities 14
MeasureGranularity output spectra of size measurements of the textures 2
MeasureImageAreaOccupied measure the area in an image occupied by objects 2
MeasureImageIntensity measure several intensity features across an entire image 2
MeasureImageQuality measure features that indicate image quality 2
MeasureObjectIntensity measure several intensity features for identified objects 2
MeasureObjectSizeShape measure area and shape features of identified objects 2
MeasureTexture quantify the roughness and smoothness of the textures 2
medaka consensus pipeline Assembly polishing via neural networks 11
medaka consensus tool Assembly polishing via neural networks 10
medaka variant tool decodes variant calls from medaka consensus output 14
MEGAHIT for metagenomics assembly 7
megamerger Merge two large overlapping nucleic acid sequences 5
Merge MetaPhlAn abundance tables Nucleic acid sequence analysis Phylogenetic analysis 8
Merge the overlapping intervals of a dataset Sequence merging 5
Merge multiple VCF datasets 2
Merge Columns together 4
Merge GROMACS topologies and GRO files 7
Merge.count Merge count tables 2
Merge.files Merge data 5
Merge.groups Merge groups in a shared file 5
Merge.sfffiles Merge SFF files 5
Merge.taxsummary Merge tax.summary files 6
MergeBamAlignment merge alignment data with additional info stored in an unmapped BAM dataset 14
merger Merge two overlapping nucleic acid sequences 5
MergeSamFiles merges multiple SAM/BAM datasets into one 14
Merqury evaluate the assembly quality Genome assembly 4
Meryl a genomic k-mer counter and sequence utility Genome assembly 8
MetaBAT2 metagenome binning 3
Metagenome Contributions of OTUs to user-specified functions 2
MetaPhlAn to profile the composition of microbial communities Nucleic acid sequence analysis Phylogenetic analysis 7
MetaPhlAn2 to profile the composition of microbial communities 2
metaplasmidSPAdes extract and assembly plasmids from metagenomic data 6
metaQuantome: create samples file by specifying the experiment's groups and associated column names 6
metaQuantome: database download the GO, EC, and NCBI databases 6
metaQuantome: expand a set of functional or taxonomy annotations 6
metaQuantome: filter for quality, redundancy, and sample coverage 6
metaQuantome: stat differential analysis of functional expression and taxonomic abundance 6
metaQuantome: visualize taxonomic analysis, functional analysis, and function-taxonomy analysis results 6
metaSPAdes metagenome assembler 12
Metastats generate principle components plot data 5
metaviralSPAdes extract and assembly viral genomes from metagenomic data 6
MetaWRAP metagenome binning pipeline 1
MethylDackel A tool for processing bisulfite sequencing alignments Gene methylation analysis 7
metilene calling differentially methylated regions from bisulfite sequencing data 5
Mimarks.attributes Reads bioSample Attributes xml and generates source for get.mimarkspackage command 5
MiModD Convert converts sequence data into different formats 2
MiModD Coverage Statistics calculates coverage statistics for a BCF file as generated by the MiModd Variant Calling tool 2
MiModD Deletion Calling (for PE data) predicts deletions in one or more aligned paired-end read samples based on coverage of the reference genome and on insert sizes 2
MiModD Extract Variant Sites from a BCF file 2
MiModD File Information provides summary reports for supported sequence data formats. 3
MiModD NacreousMap maps phenotypically selected variants by multi-variant linkage analysis 3
MiModD Read Alignment maps sequence reads to a reference genome using SNAP 1
MiModD Rebase Sites from a VCF file 3
MiModD Reheader takes a BAM file and generates a copy with the original header (if any) replaced or modified by that found in a template SAM file 2
MiModD Report Variants in a human-friendly format that simplifies data exploration 3
MiModD Run Annotation writes run metadata in SAM format for attaching it to sequenced reads data 2
MiModD Sort takes a SAM/BAM dataset and generates a coordinate/name-sorted copy 2
MiModD Variant Calling generates a BCF file of position-specific variant likelihoods and coverage information based on a reference sequence and reads aligned against it 2
MiModD VCF Filter extracts lines from a vcf variant file based on field-specific filters 3
Minia Short-read assembler based on a de Bruijn graph 3
miniasm Ultrafast de novo assembly for long noisy reads 7
miRcounts Counts miRNA alignments from small RNA sequence data 1
MiRDeep2 identification of novel and known miRNAs 3
MiRDeep2 Mapper process and map reads to a reference genome 3
MiRDeep2 Quantifier fast quantitation of reads mapping to known miRBase precursors 2
Mismatch Profile calculates the distribution of mismatches across reads 6
MITObim mitochondrial baiting and iterative mapping 1
MITOS2 de-novo annotation of metazoan mitochondrial genomes 3
MLST Scans genomes against PubMLST schemes 4
MLST List Lists available schemes for the MLST tool 4
MOB-Recon Type contigs and extract plasmid sequences 2
MOB-Typer Get the plasmid type and mobility given its sequence 2
Model Prediction predicts on new data using a preffited model 7
moFF extracts MS1 intensities from spectrum files 5
mol2-to-mol converter 2
Molecule to fingerprint conversion to several different fingerprint formats 5
Morpheus database search algorithm for high-resolution tandem mass spectra 2
mothur.ref.taxonomy-to-mothur.seq.taxonomy converter 2
msbar Mutate sequence beyond all recognition 5
msconvert Convert and/or filter mass spectrometry files 5
MSFraggerAdapter Peptide Identification with MSFragger.Important note:The Regents of the University of Michigan (“Michigan”) grants us permission to redistribute the MS Fragger application developed by Michigan within the OpenMS Pipeline and make available for use on related service offerings supported by the University of Tubingen and the Center forIntegrative Bioinformatics. Per the license agreement the use of the pipeline and associated materials is for academic research, non-commercial or educational purposes. Any commercial use inquiries must be directed to the University of Michigan Technology Transfer Office at techtransfer@umich.edu. All right title and interest in MS Fragger shall remain with the University of Michigan.For details, please see the supplied license file or https://raw.githubusercontent.com/OpenMS/THIRDPARTY/master/All/MSFragger/License.txt 4
MSGFPlusAdapter MS/MS database search using MS-GF+. 6
MSI classification spatial classification of mass spectrometry imaging data 10
MSI combine combine several mass spectrometry imaging datasets into one 11
MSI data exporter exports imzML and Analyze7.5 to tabular files 7
MSI filtering tool for filtering mass spectrometry imaging data 8
MSI mz images mass spectrometry imaging m/z heatmaps 8
MSI plot spectra mass spectrometry imaging mass spectra plots 10
MSI preprocessing mass spectrometry imaging preprocessing 11
MSI Qualitycontrol mass spectrometry imaging QC 11
MSI segmentation mass spectrometry imaging spatial clustering 8
MSnbase readMSData Imports mass-spectrometry data files 3
MSstats statistical relative protein significance analysis in DDA, SRM and DIA Mass Spectrometry 5
MSstatsTMT protein significance analysis in shotgun mass spectrometry-based proteomic experiments with tandem mass tag (TMT) labeling 4
Multi-Join (combine multiple files) 10
multiBamSummary calculates average read coverages for a list of two or more BAM/CRAM files 17
multiBigwigSummary calculates average scores for a list of two or more bigwig files 17
Multilevel Data transformation: Within matrix decomposition for repeated measurements (cross-over design) with mixOmics package 1
MultiplexResolver Completes peptide multiplets and resolves conflicts within them. 4
MultiQC aggregate results from bioinformatics analyses into a single report Statistical calculation Validation Visualisation 18
Mummer Align two or more sequences Read mapping Sequence alignment 5
Mummerplot Generate 2-D dotplot of aligned sequences Read mapping Sequence alignment 5
MuSiC Deconvolution estimate cell type proportions in bulk RNA-seq data 4
mz to sqlite Extract mzIdentML and associated proteomics datasets into a SQLite DB 5
MzTabExporter Exports various XML formats to an mzTab file. 5
Naive Variant Caller (NVC) - tabulate variable sites from BAM datasets 5
NanoFilt Filtering and trimming of long read sequencing data 1
NanoPlot Plotting suite for Oxford Nanopore sequencing data and alignments 6
Nanopolish eventalign - Align nanopore events to reference k-mers 3
Nanopolish methylation - Classify nucleotides as methylated or not. 3
Nanopolish polyA - Estimate the length of the poly-A tail on direct RNA reads. 3
Nanopolish variants - Find SNPs of basecalled merged Nanopore reads and polishes the consensus sequences 3
Natural Product likeness calculator - calculates the similarity of the molecule to the structure space covered by known natural products 3
NCBI Accession Download Download sequences from GenBank/RefSeq by accession through the NCBI ENTREZ API 3
NCBI BLAST+ blastdbcmd entry(s) Extract sequence(s) from BLAST database Data retrieval Database search 18
NCBI BLAST+ blastn Search nucleotide database with nucleotide query sequence(s) Data retrieval Database search Sequence similarity search 19
NCBI BLAST+ blastp Search protein database with protein query sequence(s) Data retrieval Database search Sequence similarity search 19
NCBI BLAST+ blastx Search protein database with translated nucleotide query sequence(s) Data retrieval Database search Sequence similarity search 18
NCBI BLAST+ convert2blastmask Convert masking information in lower-case masked FASTA input to file formats suitable for makeblastdb Conversion 15
NCBI BLAST+ database info Show BLAST database information from blastdbcmd Data retrieval 18
NCBI BLAST+ dustmasker masks low complexity regions Sequence complexity calculation 16
NCBI BLAST+ makeblastdb Make BLAST database Genome indexing 18
NCBI BLAST+ makeprofiledb Make profile database Genome indexing 14
NCBI BLAST+ rpsblast Search protein domain database (PSSMs) with protein query sequence(s) Data retrieval Database search Sequence similarity search 17
NCBI BLAST+ rpstblastn Search protein domain database (PSSMs) with translated nucleotide query sequence(s) Data retrieval Database search Sequence similarity search 17
NCBI BLAST+ segmasker low-complexity regions in protein sequences Sequence complexity calculation 15
NCBI BLAST+ tblastn Search translated nucleotide database with protein query sequence(s) Data retrieval Database search Sequence similarity search 19
NCBI BLAST+ tblastx Search translated nucleotide database with translated nucleotide query sequence(s) Data retrieval Database search Sequence similarity search 19
NCBI get species taxids 2
Nearest Neighbors Classification 11
needle Needleman-Wunsch global alignment 5
neostore.zip-to-neostore converter 1
NetCDF xarray Metadata Info summarize content of a Netcdf file 5
NetCDF xarray Selection extracts variable values with custom conditions on dimensions 5
newcpgreport Report CpG rich areas 5
newcpgseek Reports CpG rich region 5
Newick Display visualize a phylogenetic tree Phylogenetic tree visualisation 2
newseq Type in a short new sequence 5
Nextclade Viral genome clade assignment, mutation calling, and sequence quality checks 18
NGSfilter Assigns sequence records to the corresponding experiment/sample based on DNA tags and primers 3
nhmmer search a DNA model or alignment against a DNA database (BLASTN-like) 8
nhmmscan search DNA sequence(s) against a DNA profile database 8
Nmds generate non-metric multidimensional scaling data 6
NMR spectra alignment based on the Cluster-based Peak Alignment (CluPA) algorithm 2
NMR_Bucketing Bucketing and integration of NMR Bruker raw data 4
NMR_Preprocessing Preprocessing of 1D NMR spectra 4
NMR_Read Read Bruker NMR raw files 3
noreturn Removes carriage return from ASCII files 5
Normalization Normalization of (preprocessed) spectra 4
Normalize the relative abundance of each OTU by the predicted number of 16S copies 2
Normalize with scanpy 7
Normalize.shared Normalize the number of sequences per group to a specified level 5
NormalizeFasta normalize fasta datasets 14
notseq Exclude a set of sequences and write out the remaining ones 5
NOVOplasty de novo assembler for short circular genomes 3
nthseq Writes one sequence from a multiple set of sequences 5
NucBed profile the nucleotide content of intervals in a FASTA file 21
Nucleotide Variants Identifies nucleotide variants 2
Nucmer Align two or more sequences Read mapping Sequence alignment 5
Numeric Clustering 12
NxN clustering of molecular fingerprints 5
obiannotate Adds/Edits sequence record annotations 3
obiclean tags a set of sequences for PCR/sequencing errors identification 3
obiconvert converts sequence files to different output formats 3
obigrep Filters sequence file 3
obisort sorts sequence records according to the value of a given attribute 3
obistat computes basic statistics for attribute values 3
obitab converts sequence file to a tabular file 3
obiuniq 3
octanol Displays protein hydropathy 5
oddcomp Find protein sequence regions with a biased composition 5
Online data fetching ... 2
OpenSwathAnalyzer Picks peaks and finds features in an SWATH-MS or SRM experiment. 4
OpenSwathAssayGenerator Generates assays according to different models for a specific TraML 4
OpenSwathConfidenceScoring Compute confidence scores for OpenSwath results 5
OpenSwathDecoyGenerator Generates decoys according to different models for a specific TraML 5
OpenSwathDIAPreScoring Scoring spectra using the DIA scores. 4
OpenSwathFileSplitter Splits SWATH files into n files, each containing one window. 5
OpenSwathMzMLFileCacher This tool caches the spectra and chromatogram data of an mzML to disk. 4
OpenSwathRTNormalizer This tool will take a description of RT peptides and their normalized retention time to write out a transformation file on how to transform the RT space into the normalized space. 5
OpenSwathWorkflow Complete workflow to run OpenSWATH 4
OPLS-DA_Contrasts OPLS-DA Contrasts of Univariate Results 5
Otu.association Calculate the correlation coefficient for the otus 5
Otu.hierarchy Relate OTUs at different distances 5
OTUTable Convert UCLUST format from Vsearch to OTU Table 1
OverlayOutlines places outlines of objects over a desired image. 2
Paired-end histogram of insert size frequency 1
Pairwise.seqs calculate uncorrected pairwise distances between sequences 5
palindrome Looks for inverted repeats in a nucleotide sequence 5
Pangolin Phylogenetic Assignment of Outbreak Lineages 22
Parallel Coordinates Plot of tabular data 3
parquet-to-csv converter 1
Parse mitochondrial blast overlap-conscious coverage calculation between scaffold-accession number pairs from blast output 2
Parse.list Generate a List file for each group 5
Parsimony Describes whether two or more communities have the same structure 5
Partition genes into expression clusters after differential expression analysis using a Trinity assembly 12
Paste two files side by side 1
pasteseq Insert one sequence into another 5
Pathview for pathway based data integration and visualization 3
patmatdb Search a protein sequence with a motif 5
PCA - principal component analysis using Bio3D 2
Pca Principal Coordinate Analysis for a shared file 6
PCA visualization - generate trajectories of principal components of atomic motion 2
Pcoa Principal Coordinate Analysis for a distance matrix 5
Pcr.seqs Trim sequences 6
pdb-to-gro converter 1
PeakPickerHiRes Finds mass spectrometric peaks in profile mass spectra. 4
Pear Paired-End read merger 5
PEAR Statistics Generate paired-end reads overlap Statistic from PEAR log file 1
pepcoil Predicts coiled coil regions 5
pepinfo Plots simple amino acid properties in parallel 5
pepnet Displays proteins as a helical net 5
PepPointer classify genomic location of peptides 2
pepstats Protein statistics 5
Peptide Genomic Coordinate Get Peptide's genomic coordinate using mzsqlite DB and genomic mapping sqlite DB 2
Peptide Prophet Calculate Peptide Prophet statistics on search results 2
Peptide Shaker Perform protein identification using various search engines based on results from SearchGUI 19
PeptideIndexer Refreshes the protein references for all peptide hits. 6
pepwheel Shows protein sequences as helices 5
pepwindow Displays protein hydropathy 5
pepwindowall Displays protein hydropathy of a set of sequences 5
PepXML to Table Converts a pepXML file to a tab delimited text file 1
Percolator accurate peptide identification 2
phmmer search a protein sequence against a protein database (BLASTP-like) 8
Phylo.diversity Alpha Diversity calculates unique branch length 5
Phyloseq Abundance plot Phyloseq Abundance Plot with the factors of choice 1
Phyloseq Abundance Taxonomy Phyloseq Abundance Plot on Taxonomy level 1
Phyloseq Biom Filtering biom file filter 1
Phyloseq DESeq2 Perform differential expression analysis of BIOM file 1
Phyloseq Network Plot Phyloseq Network Plot 1
Phyloseq Ordination Plot ordination plotting 1
Phyloseq Richness Phyloseq Richness Plot 1
Phylotype Assign sequences to OTUs based on taxonomy 5
PhyML Phylogeny software based on the maximum-likelihood method. Phylogenetic analysis Phylogenetic inference (from molecular sequences) 4
Phyogenetic reconstruction with RAxML - Maximum Likelihood based inference of large phylogenetic trees 6
Picard Collect Sequencing Artifact Metrics Collect metrics to quantify single-base sequencing artifacts 6
Pileup-to-Interval condenses pileup format into ranges of bases 4
pilon An automated genome assembly improvement and variant detection tool 2
PIPE-T A tool for analyzing RTqPCR expression data 1
Pipeline Builder an all-in-one platform to build pipeline, single estimator, preprocessor and custom wrappers 11
plasmidSPAdes extract and assembly plasmids from WGS data 6
Plot with scanpy 8
Plot actual vs predicted curves and residual plots of tabular data 1
Plot confusion matrix, precision, recall and ROC and AUC curves of tabular data 2
Plot performance per cell in nanopore reads 2
Plot signals for nanopore reads 2
plotcon Plot quality of conservation of a sequence alignment 5
plotCorrelation Create a heatmap or scatterplot of correlation scores between different samples 16
plotCoverage assesses the sequencing depth of BAM/CRAM files 16
plotDEXSeq Visualization of the per gene DEXSeq results 5
plotEnrichment plots read/fragment coverage over sets of regions 12
plotFingerprint plots profiles of BAM files; useful for assesing ChIP signal strength 17
plotHeatmap creates a heatmap for score distributions across genomic regions 17
plotorf Plot potential open reading frames 5
plotPCA Generate principal component analysis (PCA) plots from multiBamSummary or multiBigwigSummary output 16
plotProfile creates a profile plot for score distributions across genomic regions 16
Plotting tool for multiple series and graph types 5
PMIDs to PubTator binary matrix 1
Poisson two-sample test 1
polydot Displays all-against-all dotplots of a set of sequences 5
Porechop adapter trimmer for Oxford Nanopore reads 3
Pout2mzid add Percolator scoring to mzIdentML 2
Pre.cluster Remove sequences due to pyrosequencing errors 5
Predict Metagenome based on the abundance of OTUs and a functional database 2
preg Regular expression search of a protein sequence 6
Prepare receptor Tool to prepare receptor for docking with Autodock Vina 2
Pretext Snapshot 2
PretextMap converts SAM or BAM files into genome contact maps 7
prettyplot Displays aligned sequences, with colouring and boxing 5
prettyseq Output sequence with translated ranges 5
Primer.design identify sequence fragments that are specific to particular OTUs 5
primersearch Searches DNA sequences for matches with primer pairs 5
proFIA Preprocessing of FIA-HRMS data 2
Prokka Prokaryotic genome annotation Coding region prediction Gene prediction Genome annotation 11
proportional venn from 2-3 sets 1
Protein Database Downloader 6
Protein Prophet Calculate Protein Prophet statistics on search results 2
ProteinQuantifier Compute peptide and protein abundances 4
ProtXML to Table Converts a ProtXML file to a table 1
PubChem Assay Downloader as canonical SMILES 3
PubChem Download as canonical SMILES 3
PubMed query download a defined number of abstracts or PMIDs from PubMed 1
Purge overlaps and haplotigs in an assembly based on read depth (purge_dups) 6
Pycoqc 1
pyGenomeTracks plot genomic data tracks 12
PyProphet merge Merge multiple osw files 1
PyProphet peptide Peptide error-rate estimation 1
PyProphet protein Protein error-rate estimation 1
PyProphet score Error-rate estimation for MS1, MS2 and transition-level data 3
QualiMap BamQC 5
QualiMap Multi-Sample BamQC 2
QualiMap RNA-Seq QC 3
Quality control Performs quality control on FASTQ reads. 2
Quality format converter (ASCII-Numeric) 3
Quality Metrics Metrics and graphics to check the quality of the data 2
QualityScoreDistribution chart quality score distribution 14
Quasispecies Distance Calculate the evolutionary distance between viral quasispecies. 2
Quast Genome assembly Quality 14
Query Tabular using sqlite sql 8
Racon Consensus module for raw de novo DNA assembly of long uncorrected reads 8
Rarefaction.shared Generate inter-sample rarefaction curves for OTUs 5
Rarefaction.single Generate intra-sample rarefaction curves for OTUs 6
Raven De novo assembly of Oxford Nanopore Technologies data 5
Raw Tools Perform scan data parsing, quantification and quality control analysis of Thermo Orbitrap raw mass spectrometer files. 1
Read Distribution calculates how mapped reads were distributed over genome feature 12
Read Duplication determines reads duplication rate with sequence-based and mapping-based strategies 10
Read GC determines GC% and read count 10
Read length statistics from a set of FAST5 files 2
Read NVC to check the nucleotide composition bias 10
Read Quality determines Phred quality score 10
Realign reads with LoFreq viterbi 5
Rebase GFF3 features against parent features 2
Regex Find And Replace 5
Regroup a HUMAnN2 generated table by features 4
Regroup HUMAnN table features Nucleic acid sequence analysis Phylogenetic analysis 2
reheader Rename sequence header in FASTQ file 1
Reheader copy SAM/BAM header between datasets 4
RelateObjects assign relationships parent-children between objects 2
Remove beginning of a file 1
Remove confounders with scanpy 7
Remove sequencing artifacts 5
Remove.dists Removes distances from a phylip or column file 6
Remove.groups Remove groups from groups,fasta,names,list,taxonomy 5
Remove.lineage Picks by taxon 5
Remove.otulabels Removes OTU labels 6
Remove.otus Removes OTUs from various file formats 5
Remove.rare Remove rare OTUs 5
Remove.seqs Remove sequences by name 6
Rename features of a HUMAnN generated table Nucleic acid sequence analysis Phylogenetic analysis 2
Rename sequences 5
Rename.seqs Rename sequences by concatenating the group name 2
Renormalize a HUMAnN2 generated table 5
Renormalize a HUMAnN generated table Nucleic acid sequence analysis Phylogenetic analysis 3
ReorderSam reorder reads to match ordering in reference sequences 17
RepeatMasker screen DNA sequences for interspersed repeats and low complexity regions Repeat sequence analysis 8
RepeatModeler Model repetitive DNA 2
RepEnrich Repeat Element Profiling 1
Replace parts of text 12
Replace column by values which are defined in a convert file 2
Replace Text in a specific column 11
Replace Text in entire line 11
ReplaceSamHeader replace header in a SAM/BAM dataset 17
Reverse columns in a tabular file 5
Reverse Complement a MAF file 1
Reverse-Complement 5
Reverse.seqs Reverse complement the sequences 5
RevertOriginalBaseQualitiesAndAddMateCigar revert the original base qualities and add the mate cigar tag 14
RevertSam revert SAM/BAM datasets to a previous state 14
revseq Reverse and complement a sequence 5
RmDup remove PCR duplicates 5
RMSD Analysis using Bio3D 2
RMSF Analysis using Bio3D 2
RNA fragment size calculates the fragment size for each gene/transcript 8
RNA STAR Gapped-read mapper for RNA-seq data Mapping assembly Sequence alignment 19
RNA STARSolo mapping, demultiplexing and gene quantification for single cell RNA-seq Sequence alignment 8
RNA/DNA converter 5
RnaChipIntegrator Integrated analysis of 'gene' and 'peak' data 3
RNASeq samples quality check for transcript quantification 11
rnaSPAdes de novo transcriptome assembler 13
rnaviralSPAdes de novo assembler for transcriptomes, metatranscriptomes and metaviromes 6
Roary the pangenome pipeline - Quickly generate a core gene alignment from gff3 files 8
Row Means Calculates the mean of a row of numbers for an entire table 1
RPKM Saturation calculates raw count and RPKM values for transcript at exon, intron, and mRNA level 12
rxDock cavity definition - generate the active site definition needed for rxDock docking 3
rxDock docking - perform protein-ligand docking with rxDock 4
Sailfish transcript quantification from RNA-seq data 4
Salmon quant Perform dual-phase, reads or mapping-based estimation of transcript abundance from RNA-seq reads 13
salmonKallistoMtxTo10x Transforms .mtx matrix and associated labels into a format compatible with tools expecting old-style 10X data 3
SALSA scaffold long read assemblies with Hi-C 5
SAM-to-BAM convert SAM to BAM 8
sam-to-bam converter 1
sam-to-bigwig converter 1
sam-to-qname_sorted.bam converter 1
sam-to-unsorted.bam converter 1
SampComp to compare Nanopolished datasets 2
Sample, Slice or Filter BAM on flags, fields, and tags using Sambamba 1
SamToFastq extract reads and qualities from SAM/BAM dataset and convert to fastq 19
Samtools calmd recalculate MD/NM tags 4
Samtools fastx extract FASTA or FASTQ from alignment files 3
Samtools flagstat tabulate descriptive stats for BAM datset 8
Samtools merge merge multiple sorted alignment files 3
samtools mpileup multi-way pileup of variants 11
Samtools sort order of storing aligned sequences 7
Samtools split BAM dataset on readgroups 5
Samtools stats generate statistics for BAM dataset 7
Samtools view - reformat, filter, or subsample SAM, BAM or CRAM 9
SaveImages or movie files 2
Scanpy BBKNN batch-balanced K-nearest neighbours 6
Scanpy ComBat adjust expression for variables that might introduce batch effect 6
Scanpy ComputeGraph to derive kNN graph 16
Scanpy DiffusionMap calculate diffusion components 11
Scanpy DPT diffusion pseudotime inference 11
Scanpy FilterCells based on counts and numbers of genes expressed 15
Scanpy FilterGenes based on counts and numbers of cells expressed 15
Scanpy FindCluster based on community detection on KNN graph 17
Scanpy FindMarkers to find differentially expressed genes between groups 17
Scanpy FindVariableGenes based on normalised dispersion of expression 13
Scanpy Harmony adjust principal components for variables that might introduce batch effect 7
Scanpy MNN correct batch effects by matching mutual nearest neighbors 6
Scanpy NormaliseData to make all cells having the same total expression 14
Scanpy PAGA trajectory inference 10
Scanpy ParameterIterator produce an iteration over a defined parameter 5
Scanpy Plot Scrublet visualise multiplet scoring distribution 2
Scanpy PlotEmbed visualise cell embeddings 12
Scanpy PlotTrajectory visualise cell trajectories 11
Scanpy Read10x into hdf5 object handled by scanpy 13
Scanpy RegressOut variables that might introduce batch effect 10
Scanpy RunFDG visualise cell clusters using force-directed graph 11
Scanpy RunPCA for dimensionality reduction 14
Scanpy RunTSNE visualise cell clusters using tSNE 15
Scanpy RunUMAP visualise cell clusters using UMAP 14
Scanpy ScaleData to make expression variance the same for all genes 13
Scanpy Scrublet remove multiplets from annData objects with Scrublet 2
Scater: Calculate QC metrics Computes QC metrics from single-cell expression matrix 4
Scater: filter SCE with user-defined parameters or PCA 5
Scater: PCA plot cell-level reduced dimension of a normalised SCE 4
Scater: plot library QC to visualise library size, feature counts and mito gene expression 4
Scatterplot with ggplot2 10
Screen.seqs Screen sequences 6
SDF to Fingerprint - extract fingerprints from sdf file metadata 5
Search ChEMBL database for compounds which are similar to a SMILES string 6
Search engine output to Pin converter to create Percolator input files 2
Search GUI Perform protein identification using various search engines and prepare results for input to Peptide Shaker 22
Search in textfiles (grep) 11
SearchToLib Build a Chromatogram Library from Data-Independent Acquisition (DIA) MS/MS Data 3
Secure Hash / Message Digest on a dataset 2
Select lines that match an expression Filtering 3
Select first lines from a dataset Filtering 4
Select first lines from a dataset (head) 9
Select last lines from a dataset 2
Select last lines from a dataset (tail) 9
Select random lines from a file 2
Select sequences by ID from a tabular file 3
Send to cloud 1
Sens.spec Determine the quality of OTU assignment 6
Seq.error assess error rates in sequencing data 5
seqmatchall All-against-all comparison of a set of sequences 5
seqret Reads and writes sequences 5
seqtk_comp get the nucleotide composition of FASTA/Q 6
seqtk_cutN cut sequence at long N 6
seqtk_dropse drop unpaired from interleaved Paired End FASTA/Q 6
seqtk_fqchk fastq QC (base/quality summary) 6
seqtk_hety regional heterozygosity 6
seqtk_listhet extract the position of each het 6
seqtk_mergefa merge two FASTA/Q files 6
seqtk_mergepe interleave two unpaired FASTA/Q files for a paired-end file 6
seqtk_mutfa point mutate FASTA at specified positions 6
seqtk_randbase choose a random base from hets 6
seqtk_sample random subsample of fasta or fastq sequences 6
seqtk_seq common transformation of FASTA/Q 6
seqtk_subseq extract subsequences from FASTA/Q files 6
seqtk_trimfq trim FASTQ using the Phred algorithm 6
Sequence Logo generator for fasta (eg Clustal alignments) 2
Seurat - toolkit for exploration of single-cell RNA-seq data 12
SFF converter 1
Sffinfo Summarize the quality of sequences 5
Shasta De novo assembly of long read sequencing data 1
Shhh.flows Denoise flowgrams (PyroNoise algorithm) 5
Shhh.seqs Denoise program (Quince SeqNoise) 5
Shovill Faster SPAdes assembly of Illumina reads 8
Show nucleotide distribution in nanopore sequencing reads 2
Show quality score distribution in nanopore sequencing reads 2
Show-Coords Parse delta file and report coordinates and other information Read mapping Sequence alignment 5
showfeat Show features of a sequence 5
shuffleseq Shuffles a set of sequences maintaining composition 5
SICER Statistical approach for the Identification of ChIP-Enriched Regions 3
sigcleave Reports protein signal cleavage sites 5
sirna Finds siRNA duplexes in mRNA 5
sistr_cmd Salmonella In Silico Typing Resource commandline tool for serovar prediction 3
sixpack Display a DNA sequence with 6-frame translation and ORFs 7
skipseq Reads and writes sequences, skipping first few 5
Slice BAM by genomic regions 5
Slice MD trajectories using the MDTraj package 3
smi-to-mol converter 2
smi-to-smi converter 2
Smudgeplot inference of ploidy and heterozygosity structure using whole genome sequencing 1
sniffles Structural variation caller using third generation sequencing 1
snippy Snippy finds SNPs between a haploid reference genome and your NGS sequence reads. 11
snippy-clean_full_aln Replace any non-standard sequence characters in snippy 'core.full.aln' file. 4
snippy-core Combine multiple Snippy outputs into a core SNP alignment 10
SNP distance matrix Compute distance in SNPs between all sequences in a FASTA file 2
SNP splitter splits multicharacter entries into separate lines 1
SNP table merger merges any number of SNP tables into one 1
SnpEff build: database from Genbank or GFF record 13
SnpEff chromosome-info: list chromosome names/lengths 8
SnpEff databases: list available databases 13
SnpEff download: download a pre-built database 13
SnpEff eff: annotate variants 11
SnpEff eff: annotate variants for SARS-CoV-2 1
snpFreq significant SNPs in case-control data 2
SnpSift Annotate SNPs from dbSnp 9
SnpSift CaseControl Count samples are in 'case' and 'control' groups. 9
SnpSift Extract Fields from a VCF file into a tabular file 7
SnpSift Filter Filter variants using arbitrary expressions 9
SnpSift Intervals Filter variants using intervals 9
SnpSift rmInfo remove INFO field annotations 8
SnpSift Variant Type Annotate with variant type 8
SnpSift vcfCheck basic checks for VCF specification compliance 8
Sort data in ascending or descending order Sorting 3
Sort data in ascending or descending order 11
Sort a row according to their columns 12
Sort assembly 5
Sort.seqs put sequences in different files in the same order 6
SortSam sort SAM/BAM dataset 14
SPAdes genome assembler for genomes of regular and single-cell projects Genome assembly 16
Split a HUMAnN table into 2 tables (one stratified and one unstratified) Nucleic acid sequence analysis Phylogenetic analysis 3
Split file to dataset collection 7
Split MAF blocks by Species 1
Split.abund Separate sequences into rare and abundant groups 5
Split.groups Generates a fasta file for each group 5
splitter Split a sequence into (overlapping) smaller sequences 5
SQLite to tabular for SQL query 3
Stacks2: clone filter Identify PCR clones 8
Stacks2: cstacks Generate catalog of loci 8
Stacks2: de novo map the Stacks pipeline without a reference genome (denovo_map.pl) 8
Stacks2: gstacks Call variants, genotypes and haplotype 8
Stacks2: kmer filter Identify PCR clones 8
Stacks2: populations Calculate population-level summary statistics 8
Stacks2: process radtags the Stacks demultiplexing script 8
Stacks2: process shortreads fast cleaning of randomly sheared genomic or transcriptomic data 8
Stacks2: reference map the Stacks pipeline with a reference genome (ref_map.pl) 8
Stacks2: sstacks Match samples to the catalog 8
Stacks2: tsv2bam Sort reads by RAD locus 8
Stacks2: ustacks Identify unique stacks 8
Stacks: assemble read pairs by locus run the STACKS sort_read_pairs.pl and exec_velvet.pl wrappers 6
Stacks: clone filter Identify PCR clones 2
Stacks: cstacks build a catalogue of loci 5
Stacks: de novo map the Stacks pipeline without a reference genome (denovo_map.pl) 5
Stacks: genotypes analyse haplotypes or genotypes in a genetic cross ('genotypes' program) 5
Stacks: populations analyze a population of individual samples ('populations' program) 5
Stacks: process radtags the Stacks demultiplexing script 5
Stacks: pstacks find stacks from short reads mapped to a reference genome 5
Stacks: reference map the Stacks pipeline with a reference genome (ref_map.pl) 5
Stacks: rxstacks make corrections to genotype and haplotype calls 5
Stacks: sstacks match stacks to a catalog 4
Stacks: statistics on stacks found for multiple samples 2
Stacks: ustacks align short reads into stacks 5
staramr Scans genome assemblies against the ResFinder, PlasmidFinder, and PointFinder databases searching for AMR genes. 11
Starting Modules load images and metadata into CellProfiler (Images, Metadata, NamesAndTypes, Groups) 2
Stitch Gene blocks given a set of coding exon intervals 4
Stitch MAF blocks given a set of genomic intervals 1
StringTie transcript assembly and quantification 17
StringTie merge transcripts 11
Sub.sample Create a sub sample 5
Subtract the intervals of two datasets Filtering 6
Subtract Whole Dataset from another dataset 2
Summary Statistics for any numerical column 3
Summary.qual Summarize the quality scores 6
Summary.seqs Summarize the quality of sequences 5
Summary.shared Summary of calculator values for OTUs 5
Summary.single Summary of calculator values for OTUs 6
Summary.tax Assign sequences to taxonomy 5
supermatcher Match large sequences against one or more other sequences 5
Support vector machines (SVMs) for classification 12
syco Synonymous codon usage Gribskov statistic plot 5
Symmetric Plot Symmetric Plot 1
T Test for Two Samples 2
Table Compute computes operations on table data 4
Table Merge Merging dataMatrix with a metadata table 2
Table to GFF3 2
Tabular to FASTQ converter Conversion 5
tabular-to-csv converter 1
Tabular-to-FASTA converts tabular file to FASTA format 2
tac reverse a file (reverse cat) 9
Tandem to pepXML Converts a tandem result file to pepXML 1
tar-to-directory converter 2
TargetedFileConverter Converts different transition files for targeted proteomics / metabolomics analysis. 4
TargetFinder plant small RNA target prediction tool Sequence alignment 2
Taxonomy-to-Krona convert a mothur taxonomy file to Krona input format 2
Taylor-Butina clustering of molecular fingerprints 5
TB Variant Filter M. tuberculosis H37Rv VCF filter 4
TB Variant Report - generate HTML report from SnpEff annotated M.tb VCF(s) 3
TB-Profiler Profile Infer strain types and drug resistance markers from sequences 15
tcode Fickett TESTCODE statistic to identify protein-coding DNA 5
Text reformatting with awk 13
Text transformation with sed 11
TextExporter Exports various XML formats to a text file. 4
textsearch Search sequence documentation. Slow, use SRS and Entrez! 5
Thermo RAW file converter 9
Tile tiles images together to form large montage images 2
tmap Displays membrane spanning regions 5
To categorical Converts a class vector (integers) to binary class matrix 2
TopHat Gapped-read mapper for RNA-seq data 9
Tophat Fusion Post post-processing to identify fusion genes 2
TrackObjects tracking objects throughout sequential frames of a series of images 2
Train Augustus ab-initio gene predictor 5
Train SNAP ab-initio gene predictor 1
TRAMS Tool for Rapid Annotation of Microbial SNPs 1
tranalign Align nucleic coding regions given the aligned proteins 5
Transcript Integrity Number evaluates RNA integrity at a transcript level 8
TransDecoder finds coding regions within transcripts 7
transeq Translate nucleic acid sequences 5
Translate BED transcripts cDNA in 3frames or CDS 1
Transpose rows/columns in a tabular file 5
Tree.shared Generate a newick tree for dissimilarity among groups 5
Trim leading or trailing characters 2
Trim Galore! Quality and adapter trimmer of reads 9
Trim sequences 5
Trim.flows partition by barcode, trim to length, cull by length and mismatches 5
Trim.seqs Trim sequences - primers, barcodes, quality 5
trimest Trim poly-A tails off EST sequences 5
Trimmomatic flexible read trimming tool for Illumina NGS data Sequence trimming 15
trimseq Trim ambiguous bits off the ends of sequences 5
Trinity de novo assembly of RNA-Seq data 14
Trinotate functional transcript annotation 4
tRNA and tmRNA prediction (Aragorn) 3
tRNA prediction (tRNAscan) 2
Trycycler cluster cluster the contigs of your input assemblies into per-replicon groups Genome assembly 4
Trycycler consensus generate a consensus contig sequence for each cluster Genome assembly 4
Trycycler partition assign the reads to the clusters Genome assembly 4
Trycycler reconcile/msa reconcile the contigs within each cluster and perform a multiple sequence alignment Genome assembly 4
Trycycler subsample make a maximally-independent read subsets of an appropiate depth for your genome Genome assembly 4
twofeat Finds neighbouring pairs of features in sequences 5
UMI-tools count performs quantification of UMIs from BAM files 9
UMI-tools deduplicate Extract UMI from fastq files 6
UMI-tools extract Extract UMI from fastq files 12
Unfold columns from a table 11
unifrac.unweighted Describes whether two or more communities have the same structure 5
unifrac.weighted Describes whether two or more communities have the same structure 5
union Reads sequence fragments and builds one sequence 5
Unipept retrieve taxonomy for peptides 6
UniProt ID mapping and retrieval 4
UniProt download proteome as XML or fasta 6
Unique occurrences of each record 9
Unique lines assuming sorted input file 11
Unique.seqs Return unique sequences 5
Univariate Univariate statistics 2
Unpack pathway abundances to show the genes for each Nucleic acid sequence analysis Phylogenetic analysis 3
Unpack pathway abundances to show genes included 3
unsorted.bam-to-bigwig converter 1
Unzip Unzip file 1
ValidateSamFile assess validity of SAM/BAM dataset 15
Variant Frequency Plot Generates a heatmap of allele frequencies grouped by variant type for SnpEff-annotated SARS-CoV-2 data Aggregation 4
VarScan for variant detection 2
VarScan somatic Call germline/somatic and LOH variants from tumor-normal sample pairs 9
VCF to MAF Custom Track for display at UCSC 2
VCF-BEDintersect: Intersect VCF and BED datasets 5
vcf-to-bgzip converter 2
vcf-to-bigwig converter 1
vcf-to-tabix converter 2
VCF-VCFintersect: Intersect two VCF datasets 5
vcf_bgzip-to-tabix converter 2
VCFaddinfo: Adds info fields from the second dataset which are not present in the first dataset 5
VcfAllelicPrimitives: Split alleleic primitives (gaps or mismatches) into multiple VCF lines 5
VCFannotate: Intersect VCF records with BED annotations 5
VCFannotateGenotypes: Annotate genotypes in a VCF dataset using genotypes from another VCF dataset 5
VCFbreakCreateMulti: Break multiple alleles into multiple records, or combine overallpoing alleles into a single record 5
VCFcheck: Verify that the reference allele matches the reference genome 5
VCFcombine: Combine multiple VCF datasets 6
VCFcommonSamples: Output records belonging to samples common between two datasets 5
VCFdistance: Calculate distance to the nearest variant 5
VCFfilter: filter VCF data in a variety of attributes 8
VCFfixup: Count the allele frequencies across alleles present in each record in the VCF file 5
VCFflatten: Removes multi-allelic sites by picking the most common alternate 5
VCFgenotype-to-haplotype: Convert genotype-based phased alleles into haplotype alleles 5
VCFgenotypes: Convert numerical representation of genotypes to allelic 5
VCFhetHomAlleles: Count the number of heterozygotes and alleles, compute het/hom ratio 5
VCFleftAlign: Left-align indels and complex variants in VCF dataset 5
VCFprimers: Extract flanking sequences for each VCF record 5
VCFrandomSample: Randomly sample sites from VCF dataset 5
VCFselectsamples: Select samples from a VCF dataset 5
VCFsort: Sort VCF dataset by coordinate 6
VCFtoTab-delimited: Convert VCF data into TAB-delimited format 6
vectorstrip Strips out DNA between a pair of vector sequences 5
velvetg Velvet sequence assembler for very short reads De-novo assembly Formatting 6
velveth Prepare a dataset for the Velvet velvetg Assembler De-novo assembly Formatting 6
VelvetOptimiser Automatically optimize Velvet assemblies 5
Venn Generate Venn diagrams for groups 5
VINA Docking tool to perform protein-ligand docking with Autodock Vina 5
Visualisation of compounds 5
Visualize the optimal scoring subnetwork 2
Visualize with Krona Visualise any hierarchical data 1
Volcano Plot create a volcano plot Visualisation 5
VSearch alignment Chimera detection Sequence clustering Sequence masking 6
VSearch chimera detection Chimera detection Sequence clustering Sequence masking 6
VSearch clustering Chimera detection Sequence clustering Sequence masking 7
VSearch dereplication Chimera detection Sequence clustering Sequence masking 7
VSearch masking Chimera detection Sequence clustering Sequence masking 6
VSearch search Chimera detection Sequence clustering Sequence masking 6
VSearch shuffling Chimera detection Sequence clustering Sequence masking 6
VSearch sorting Chimera detection Sequence clustering Sequence masking 7
W4m Data Subset Filter W4m data by values or metadata 9
Walnut PeCAn-based Peptide Detection Directly from Data-Independent Acquisition (DIA) MS/MS Data 3
water Smith-Waterman local alignment 5
Wavelet variance using Discrete Wavelet Transfoms 3
wig-to-bigwig converter 1
Wig/BedGraph-to-bigWig converter 2
Wiggle to Interval 2
Wiggle-to-Interval converter 2
wobble Wobble base plot 5
wordcount Counts words of a specified size in a DNA sequence 5
wordmatch Finds all exact matches of a given size between 2 sequences 5
X!Tandem MSMS Search Run an X!Tandem Search 2
xcms adjustRtime (retcor) Retention Time Correction 7
xcms fillChromPeaks (fillPeaks) Integrate areas of missing peaks 7
xcms findChromPeaks (xcmsSet) Chromatographic peak detection 8
xcms findChromPeaks Merger Merge xcms findChromPeaks RData into a unique file to be used by group 5
xcms get a sampleMetadata file which need to be filled with extra information 3
xcms groupChromPeaks (group) Perform the correspondence, the grouping of chromatographic peaks within and between samples. 8
xcms plot chromatogram Plots base peak intensity chromatogram (BPI) and total ion current chromatogram (TIC) from MSnbase or xcms experiment(s) 5
xcms process history Create a summary of XCMS analysis 8
XMLValidator Validates XML files against an XSD schema. 4
XTandemAdapter Annotates MS/MS spectra using X! Tandem. 5
xtc-to-dcd converter 1
YAHS yet another HI-C scaffolding tool 4