How many species are supported?
We support ALL species (>14K) available from NCBI and Ensembl. See more.
How frequent are the gene annotation data updated?
Gene annotation data are regularly updated once a week to keep it up-to-date. The most updated data information can be accessed here. [See more]( <http://docs.mygene.info/en/latest/doc/data.html#data-sources).
Want to suggest more gene annotation data?
File an issue here.
Can I get XML output instead of JSON output?
We currently do not provide XML output format. There exist many ready-to-use third-party libraries/packages to convert JSON format to XML format, e.g. Python packages dicttoxml and json2xml. They will be essentially the same packages we would use if decide to provide XML output format. For this reason and also for reducing the development complexity, we decided not to provide XML format.
What's behind MyGene.info?
MyGene.info is built on ElasticSearch, a document-based database and powerful query engine. Unlike more commonly used relational database systems (e.g., Oracle, MySQL), data are stored as "key-document" pairs. The "document" is a JSON-formatted gene annotation object, while the "key" is a gene ID (Entrez or Ensembl). The hierarchical structure of gene annotation data can be represented naturally in this key-document model. This simple object structure in ElasticSearch greatly simplified both data loading and data queries, and also gains impressive query performance and flexibility.
On top of ElasticSearch, we use tornado, a lightweighted and fast web framework in python, to build our application layer. And then Nginx is used as the front-end to serve outside requests.
See also: MyGene.info talk at BOSC2013.
Is this project open-sourced?
Yes. See "Source code" section at About page.
How to cite MyGene.info?
see our citation page.
Who is using MyGene.info?
How to reach us?