Learn to use BioMart, a publicly available open source tool for management and querying of many types of biological data. It is widely used around the world for various projects. As a component of the use GMOD (Generic Model Organism Database) suite of tools, many projects and individuals contribute to the development of BioMart.

The main focus of this material will be the BioMart v0.7 Portal, and touches on other versions and installations of BioMart as well. The query interface allows users to assemble customized and complex queries of the underlying data sets that are accessible from this site.

Besides the Portal, researchers will find BioMart installed at many project sites. The solid and flexible foundation supports a huge range of species and data types. The various sites and versions may have some differences in their look or their data types, but understanding the basic functions of BioMart will serve users well at any site, or in third party tools developed around BioMart.

You will learn:

  • to perform effective, complex, and customized searches using BioMart datasets and filters
  • to obtain results with specified attributes and interact with the output
  • to identify BioMart installations and versions at other sites
  • ways to access the BioMart framework from other tools and third-party software
  • to interact with the new version of BioMart associated with the International Cancer Genome Consortium use (ICGC) project


This tutorial is a part of the tutorial group Advanced Analysis and Queries. You might find the other tutorials in the group interesting:

UniProt: UniProt, Universal Protein Resource

DBTSS: Database of Transcriptional Start Sites


Galaxy: Analysis tools for researchers


Algorithms and Analysis : This category contains various tools that may help perform analysis of different genomics data types. This may include sequence alignment, large-scale or complex queries, motif finding, or comparative assessments.

EBI : This category includes all resources maintained at the European Bioinformatics Institute (EBI)

Genome Databases (euk) : Genomic databases or repositories primarily aimed at eukaryotic organisms. Some may contain prokaryotic data as well.


Friday SNPpets: This week's SNPpets include data protection issues in the EU, reminiscing on the past and musing on the future of bioinformatics, RNAseq tools including Salmon and SeqMonk, updates to the human nomencl...

Video Tip of the Week: TargetMine, Data Warehouse for Drug Discovery: Browsing around genomic regions, layering on lots of associated data, and beginning to explore new data types I might come across are things that really fire up my brain. For me, visualization is key t...

Video Tip of the Week: InterMine for complex queries: We've been fans of InterMine for a long time. We did a tip-of-the-week in a while ago that highlighted ways that this software can be used to mine from big data projects of many types. The generic fram...

BioMart news, and a shiny new look: Just got the news via the mailing list, I haven't had a chance to kick the tires yet: We are pleased to announce the release of BioMart version 0.9. The latest version of BioMart includes support for d...

Video Tip of the Week: Phytozome and the Peach Genome: We've laughed in the past about a "genome of the day" because there are so many projects each week that we want to explore, and it's hard to keep up. But recently I wanted to have a look at the peach ...


Recent BioMed Central research articles citing this resource

Vadnal Jonathan et al., Identification of candidate infection genes from the model entomopathogenic nematode Heterorhabditis bacteriophora Multicellular invertebrate genomics. BMC Genomics (2017) doi:10.1186/s12864-016-3468-6

Geuens Thomas et al., Mutant HSPB1 causes loss of translational repression by binding to PCBP1, an RNA binding protein with a possible role in neurodegenerative disease. Acta Neuropathologica Communications (2017) doi:10.1186/s40478-016-0407-3

Martini W. R. Johannes et al., Genomic prediction with epistasis models: on the marker-coding-dependent performance of the extended GBLUP and properties of the categorical epistasis model (CE) Results and data. BMC Bioinformatics (2017) doi:10.1186/s12859-016-1439-1

Buckberry Sam et al., Placental transcriptome co-expression analysis reveals conserved regulatory programs across gestation Human and rodent genomics. BMC Genomics (2017) doi:10.1186/s12864-016-3384-9

Ren Li et al., Determination of dosage compensation and comparison of gene expression in a triploid hybrid fish Non-human and non-rodent vertebrate genomics. BMC Genomics (2017) doi:10.1186/s12864-016-3424-5