Bioinformatics Sequence Processing Protocols#

Reference Database Generation & Curation#

  • MitoPilot
    An R package for scalable mitogenome assembly and annotation, uses Nexflow and includes (Shiny) web app for project management and curation of results. Currently supports fish and starfish datasets, but will be expanded to other taxa soon. Developed as a joint effort by the NOAA National Systematics Lab and the Smithsonian Ocean DNA Initiative
  • rCRUX Reference Database Generating Tool
    This R package generates reference databases for eDNA metabarcoding from user-supplied primer sequences. The package employs iterative Basic Alignment Search Tool (BLAST) searches paired with quality control to produce a final reference library.
  • Mitohelper
    This Python-based tool is designed to assist with preparing eDNA reference databases for fishes. It can determine whether a reference sequence exists for a specific species/group and visualize the available sequenced regions of target genes.

Metabarcoding#

Population Genomics#

  • SEDNA Workflows
    List of scripts, pipelines, and tools for a variety of analyses including whole genome sequencing workflows, SNP discovery, microhaplotype genotyping, population assignment, historical demography. All written specifically for the NOAA NMFS SEDNA bioinformatics computing cluster.
  • strataG: An R package for manipulating, summarizing and analysing population genetic data
    R package for exploring multi-locus genetic datasets, calculating population genetic summary statistics, and demographic and population structure analyses.

Data Visualization#