SeqScrub: a web tool for automatic cleaning and annotation of FASTA file headers for bioinformatic applications

Abstract

Data consistency is necessary for effective bioinformatic analysis. SeqScrub is a web tool that parses and maintains consistent information about protein and DNA sequences in FASTA file format, checks if records are current, and adds taxonomic information by matching identifiers against entries in authoritative biological sequence databases.

Publication
BioTechniques
Pre-inference