EnsMart: A Generic System for Fast and Flexible Access to Biological Data

  1. Arek Kasprzyk1,3,
  2. Damian Keefe1,
  3. Damian Smedley1,
  4. Darin London1,
  5. William Spooner2,
  6. Craig Melsopp1,
  7. Martin Hammond1,
  8. Philippe Rocca-Serra1,
  9. Tony Cox2, and
  10. Ewan Birney1
  1. 1 European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SH, UK
  2. 2 The Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SH, UK

Abstract

The EnsMart system (www.ensembl.org/EnsMart) provides a generic data warehousing solution for fast and flexible querying of large biological data sets and integration with third-party data and tools. The system consists of a query-optimized database and interactive, user-friendly interfaces. EnsMart has been applied to Ensembl, where it extends its genomic browser capabilities, facilitating rapid retrieval of customized data sets. A wide variety of complex queries, on various types of annotations, for numerous species are supported. These can be applied to many research problems, ranging from SNP selection for candidate gene screening, through cross-species evolutionary comparisons, to microarray annotation. Users can group and refine biological data according to many criteria, including cross-species analyses, disease links, sequence variations, and expression patterns. Both tabulated list data and biological sequence output can be generated dynamically, in HTML, text, Microsoft Excel, and compressed formats. A wide range of sequence types, such as cDNA, peptides, coding regions, UTRs, and exons, with additional upstream and downstream regions, can be retrieved. The EnsMart database can be accessed via a public Web site, or through a Java application suite. Both implementations and the database are freely available for local installation, and can be extended or adapted to `non-Ensembl' data sets.

Footnotes

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1645104.

  • 3 Corresponding author. E-MAIL arek{at}ebi.ac.uk; FAX 44-1223-494468.

    • Accepted October 17, 2003.
    • Received June 11, 2003.
| Table of Contents

Preprint Server