Ensembl mobile site help

Things to know when navigating the Ensembl mobile site

Search box

Use the search box at the top right of all Ensembl views to search for a gene, phenotype, sequence variant, and more.

Top navigation

Touch MENU button to open the main menu and touch again to close.

Touch MENU

Left hand side menu

Touch the left menu icon () or swipe right to open the side menu and touch anywhere outside the menu or touch the cross icon or swipe left to close.

The ? icon

Touch the icon to get help

And don't forget to send us your comments using the feedback link inside the main menu.

EnsemblEnsembl Home

Orangutan assembly and gene annotation


This site presents the 6X whole genome shotgun assembly from a female Sumatran orangutan (Pongo pygmaeus abelii) named Susie, housed at the Gladys Porter Zoo (Brownsville, TX). The primary donor-derived reads were assembled using PCAP (Huang, 2006) using stringent parameters; by aligning the orangutan genome against the human genome, it was possible to identify interchromosomal cross-overs and thus eliminate global mis-assemblies larger than 50kb.

Of the 3.09Gb of total sequence, 3.08Gb are ordered and oriented along the chromosomes. Gap sizes between supercontigs were estimated based on their size in human, with a maximum allowed gap size of 30kb.

Gene annotation

Due to the high sequence similarity to the human genome, the Orangutan genebuild was based on a projection of human gene structures. The projections were made through chained whole genome BLASTz alignments. These projected genes were combined with orangutan-specific proteins, and additional human genes were added using exonerate where the projection was unable to make satisfactory gene models. UTRs were added using orangutan-specific ESTs and cDNAs as well as human cDNAs.

More information

General information about this species can be found in Wikipedia.



AssemblyPPYG2, Sep 2007
Base Pairs3,446,771,396
Golden Path Length3,446,771,396
Annotation providerEnsembl
Annotation methodProjection build
Genebuild startedOct 2007
Genebuild releasedMar 2008
Genebuild last updated/patchedAug 2012
Database version104.1

Gene counts

Coding genes20,424
Non coding genes6,996
Small non coding genes5,796
Misc non coding genes1,200
Gene transcripts29,447


Genscan gene predictions53,999
Short Variants10,004,323

About this species