This section provides a summary of the annotation and transcript record from a number of public domain Databases. The entries depend on the species to which the probe set is associated.
This section provides a summary of the annotation and transcript record from a number of public domain Databases. The entries depend on the species to which the probe set is associated.
Gene Title – The gene name is usually extracted from the Gene or UniGene databases. In some cases, specialty databases (such as WormBase, etc.) may provide the gene name.
Gene Symbol – Gene symbols are derived by different organizations for different species. Affymetrix data comes from the UniGene record for UniGene based arrays such as human, mouse, and rat. For arrays that are not based on the UniGene database, Affymetrix obtains the gene symbol from various sources including: FlyBase, WormBase, and Saccharomyses Genome Database.
Chromosomal location – The cytoband location of the Gene derived from the UniGene record, as available. This location may vary from the Genomic location given for the Consensus/Exemplar Sequence given elsewhere on the page.
EC Number – Derived from the NCBI or ENSEMBL entry, the Enzyme Commission (EC) family number describes enzymatic activity of the gene. The EC number is a hierarchical description of enzymatic activity with up to four levels in this format: A.B.C.D.
- The first level (A) describes the substrate class for the enzyme.
- The second level (B) describes the chemical donor the enzyme uses.
- The third level (C) describes the chemical acceptor the enzyme uses.
- The fourth level (D) describes the specific family of enzymes.
For example, the three numbers in the EC designation of 1.1.5, respectively, describe an Oxidoreductase, acting on CH-OH donor groups with a quinine or similar compound as an acceptor.
The full description of the EC number can be found at the Expasy.org site.
OMIM – A link to the gene?s description in Online Mendelian Inheritance in Man, a hand-curated database of disease and genetic disorders, biomedical and biochemical information, and phenotypes associated with known human genes. OMIM indexes give the NetAffx user access to detailed descriptions of biomedical research associated with their genes of interest. Only available probe sets to human genes.
MeSH terms (not currently included ) – Medical Subject Headings are controlled vocabulary of biomedical and heath terms linked the NCBI transcripts. Examples of MeSH terms are: Arteriosclerosis, Osteosarcoma, Coronary Disease, Inflammation, Leukemia, and Bipolar Disorder. Although MeSH terms are too numerous to display on the probe set details page, they are searchable in the All Descriptions field in the Standard Query.
Transcript Accessions
Transcripts from various sources may appear in this section, varying depending on the species origin of the transcripts that the probe set detects.
AGI ID – A uniform, gene nomenclature system for Arabidopsis created by the Arabidopsis Genome Initiative (AGI). AGI is an international effort to sequence the complete Arabidopsis genome.
AGI ID's are based on the following format:
At = organism 1, 2, 3, 4, 5 = chromosome g = gene 00010 = gene id.
Ensembl ID – A transcript identifier from the ENSEMBL project.
FLYBASE – A locus name from FlyBase: A database of the drosophila genome.
MGI ID – A locus identifier from the Mouse Genome Informatics (MGI) database.
RefSeq – Reference Sequences (RefSeq) are obtained from NCBI?s nonredundant and comprehensive sequence collection.
RGD ID – A locus from the Rat Genome Database (RGD).
SGD ID – A locus from the Saccharomyces Genome Database (SGD™).
SWISS–PROT – SWISS-PROT (sometimes known as SWALL) accession numbers of the peptide sequences corresponding to the mRNA?s in the UniGene cluster represented by the probe set.
UniGene ID – The UniGene collection of sequences.
WORMBASE – A locus name from Wormbase, a database of the genome and biology of C. elegans.
XDB – Xenopus Gene Database provides mappings between XGD IDs and Affymetrix probe set IDs.
|