Skip to content
BY-NC-ND 4.0 license Open Access Published by De Gruyter October 18, 2016

Key2Ann: a tool to process sequence sets by replacing database identifiers with a human-readable annotation

  • Andreas Pürzer , Felix Grassmann , Dietmar Birzer and Rainer Merkl EMAIL logo

Summary

Deducing common properties or degrees of phylogenetic relationship by analyzing a grouping or clustering of sequence sets is a frequently used technique in computational biology. If interpreted by means of visual inspection, the conclusions depend for many of these applications on meaningful names for the input data. In accordance with the aim of the analysis, the sequences should be provided with names indicating the function of the genes or gene-products, the phylogenetic position or other properties characterizing the contributing species. However, sequences extracted from databases are most often annotated with identifiers which only implicitly contain the desired information. To solve this problem, we have designed and implemented a tool named Key2Ann, which replaces in multiple fasta files the database keys with short terms indicating the taxonomic position or other features like the gene name or the EC-number. In addition, properties like habitat, growth temperature or the degree of pathogenicity can be coded for microbial species. To allow for highest flexibility, the user can control the composition of the names by means of command line parameters. Key2Ann is written in Java and can be downloaded via http://www-bioinf.uni-regensburg.de/downl/Key2Ann.zip. We demonstrate the usage of Key2Ann by discussing three typical examples of phylogenetic analysis.

Published Online: 2016-10-18
Published in Print: 2011-3-1

© 2011 The Author(s). Published by Journal of Integrative Bioinformatics.

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Downloaded on 28.3.2024 from https://www.degruyter.com/document/doi/10.2390/biecoll-jib-2011-153/html
Scroll to top button