Jump to ContentJump to Main Navigation
Show Summary Details
Weitere Optionen …

Journal of Integrative Bioinformatics

Editor-in-Chief: Schreiber, Falk / Hofestädt, Ralf

Managing Editor: Sommer, Björn

Hrsg. v. Baumbach, Jan / Chen, Ming / Orlov, Yuriy / Allmer, Jens

Wissenschaftlicher Beirat: Giorgetti, Alejandro / Harrison, Andrew / Kochetov, Aleksey / Krüger, Jens / Ma, Qi / Matsuno, Hiroshi / Mitra, Chanchal K. / Pauling, Josch K. / Rawlings, Chris / Fdez-Riverola, Florentino / Romano, Paolo / Röttger, Richard / Shoshi, Alban / Soares, Siomar de Castro / Taubert, Jan / Tauch, Andreas / Yousef, Malik / Weise, Stephan / Hassani-Pak, Keywan

CiteScore 2018: 0.90

SCImago Journal Rank (SJR) 2018: 0.315

Open Access
Alle Formate und Preise
Weitere Optionen …
Band 8, Heft 2


Putting Encyclopaedia Knowledge into Structural Form: Finite State Transducers Approach

Vesna Pajić
Online erschienen: 18.10.2016 | DOI: https://doi.org/10.1515/jib-2011-164


In biology and functional genomics in particular, understanding the dependence and interplay between different genome and ecological characteristics of organisms is a very challenging problem. There are some public databases which combine this kind of information, but there is still much more information about microbes and other organisms that reside in unstructured and semi-structured documents, such as encyclopaedias. In this paper we present a method for extracting information from semi-structured resources, such as encyclopaedias, based on finite state transducers, consisting of two clearly distinguished phases. The first phase strongly relies on the analysis of the document structure and it is used for locating records of data in the text. The second phase is based on the finite state transducers created for extracting the data, which can be modified so as to achieve the preferred efficiency and it is used for extracting the particular characteristic from the text. We show how the two phase method is applied to the text of the encyclopaedia “Systematic Bacteriology”. A fully structured database with genotype and phenotype characteristics of organisms has been created from the encyclopaedia unstructured descriptions.


Online erschienen: 18.10.2016

Erschienen im Druck: 01.06.2011

Quellenangabe: Journal of Integrative Bioinformatics, Band 8, Heft 2, Seiten 115–129, ISSN (Online) 1613-4516, DOI: https://doi.org/10.1515/jib-2011-164.

Zitat exportieren

© 2011 The Author(s). Published by Journal of Integrative Bioinformatics.. This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. BY-NC-ND 4.0

Kommentare (0)