Gene ontology analysis

Gene ontologies are controlled vocabularies used to describe genes. Read more abou gene ontologies at www.geneontology.org

Files needed for the analysis

Analysis

Before doing any analysis the GO terms must be linked to the dataset.

There are two types of analysis you can do:

Outputting results

Press F1 for more help on this component.

 

Mapping

  1. Click the ( Gene Ontology Mapping ) button or select from the methods menu.
  2. In the Mapping File combo box, select the correct mapping file.
  3. J-Express can use one of two columns from the mapping file. If Use Synonyms is selected column 11 will be used, and if it is not selected column 3 will be used. Open the mapping file in Excel to see what type of ids it has in column 3 and 11, or see the README file for your organism. When you close the file again, do not save changes if you get this question. Excel sometimes does strange things to files.
  4. In the Data Identifier Column combo box, select the column from your J-Express data set containing the same gene identifiers that is used in column 3 or 11 in the mapping file. If none of these identifiers are not present in your J-Express dataset, use the Annotation Manager to add it.
  5. Click Map Data set

 

Browsing

  1. It is often a good idea to have a Gene Graph open when browsing GO terms, so that you can see the gene profiles of the gene terms you select. Click the ( Gene Graph ) button or selecting Methods | Charts | Gene Graph Viewer from the J-Express menu bar.
  2. Move the GO TREE window and the Gene Graph window so that you can see both at the same time.
  3. In the Gene Graph window: click the ( Shadow Unselected ) button. All gene profiles should now become faint. When making selections in other components you can see the expression profiles of the selected genes in this window.
  4. In the GO TREE window: open the different Gene Ontology terms (by double clicking or clicking the open handle) and browse the trees. See how the gene graph profiles change as you select a node (if the number of genes in the object are more than 1 and less than the Maximum Members, default 100).

 

Overrepresentation Analysis

After having found some interesting genes, e.g. from differential expression analysis, GO overrepresentation analysis can be used to see if particular GO terms are enriched in the gene list. To decide whether there is an overrepresentation of some GO terms, the number of genes in the gene list linked to particular terms must be compared with the expected number of genes linked to that term. This can be calculated from a reference dataset. Follow the steps below to see how you can do this in J-Express.

  1. Select the dataset you want to use as reference dataset. This will typically be the dataset you used for differential expression analysis. It should contain all eg. all of the genes on the array or all genes that are considered to be expressed in the system you are looking at.
  2. Open the GO component and follow the steps described under the Mapping section
  3. Select the dataset containing only the gene list of interesting genes that you want to check for overrepresentation
  4. Open another GO component and follow the steps described under the Mapping section
  5. In the GO component for the gene list, select Statistical comparison to other GO components from the View menu.
  6. A pop-up window will tell you that it found the GO component for the reference dataset. Click Ok.
  7. A new window will appear containing a list of sorted GO terms, with the most enriched terms at the top of the list. Select a GO term from the top of the list. This term will then be highlighted in the GO tree windows where you can see where it belongs in the hierarchy, and if you have a Gene Graph open, you can see the gene profiles of the genes liked to this GO term.

 

Outputting results

The File and View menues contain different ways of output results form the GO component.