APPRIS is a system that deploys a range of computational methods to provide value to the annotations of the human genome. APPRIS also selects one of the CDS for each gene as the constitutive isoform.
APPRIS defines the principal variant by combining protein structural and functional information and information from the conservation of related species.
The server is being used in the context of the scale up of the ENCODE project to annotate the human genome (20,700 protein-coding genes and 84,408 distinct alternative transcripts from Gencode7, Ensembl62).
The National Human Genome Research Institute (NHGRI) launched ENCODE, the Encyclopedia Of DNA Elements, in September 2003, to carry out a project to identify all functional elements in the human genome sequence.
The aim of GENCODE as a sub-project of the ENCODE is the high accuracy annotation of all evidence-based gene features. The consortium is genarating a set of annotations including protein-coding loci, alternatively transcribed variants, transcribed non-coding loci, and pseudogenes. The annotation process involves manual curation, computational analysis and targeted experimental approaches.