MANUAL: apt-midas (apt-1.10.1)

apt-midas is a program for computing an alternative splicing score based on the MiDAS method described in the Affymetrix Alternative Transcript Analysis Methods for Exon Arrays whitepaper.

Quick Start

Most users will just want to compute MiDAS p-values using:

apt-midas -c celfiles.txt -g gene-plier.summary.txt -e exon-plier.summary.txt -m meta-probeset.full.txt -o apt-midas-results

Where -c specifies a cel group file, -g specifies a gene level signal estimate file, -e specifies an exon level signal estimate file, -m specifies a meta-probeset file to match the exon level data with the gene level data, and -o specifies an output directory.

Options:

midas - Microarray Detection of Alternative Splicing.
Usage:
   midas --cel-files cels.txt -g gene.summary.txt -e exon.summary.txt -m metaprobeset.txt \
            -o out -pvalues -fstats -normalized -stabilize 8.0 -no-log-transform

options:
   -h, --help                       [default 'false'] 
     --cel-files File defining mapping between cel_files and
                      experimental group_id. [default ''] 
   -g, --genedata File containing gene-level summary data.
                      [default ''] 
   -e, --exondata File containing exon-level summary data.
                      [default ''] 
   -m, --metaprobeset File containing mapping (meta-probeset file)
                      between exon probesets and gene level 
                      probesets. [default ''] 
   -o, --out-dir Directory which will contain the midas output.
                      [default ''] 
   -pv, --pvalues Output p values. [default 'true'] 
   -f, --fstats Output F statistics. [default 'false'] 
   -n, --normalized Output normalized exon intensities. [default
                      'false'] 
   -s, --stabilize Stabilization factor for log data transforms.
                      [default '8.0'] 
   -nol, --no-logtrans Do not log transform gene and exon level 
                      summary data. [default 'false'] 
     --version Display version information. [default 'false'] 
     --keep-path Keep cel file path. [default 'false'] 

Group File

The CEL group file is used to define which results are to be grouped together for the MiDAS ANOVA test. The file is a text file with at least two columns: cel_files, group_id. The cel_files column should contain values that correspond to the headers in the summary files. The group_id is an arbitrary value where rows with the same group_id value are grouped together. Here is an example:

cel_files	group_id
brain-rep1.cel	brain
brain-rep2.cel	brain
brain-rep3.cel	brain
heart-rep1.cel	heart
heart-rep2.cel	heart
heart-rep3.cel	heart

Frequently Asked Questions

Q. What meta-probeset file(s) should I use.

A. A good starting point is to use the core or extended meta probeset file along with the IterPlier method to generate gene level signal estimates. You can then use the full meta probeset file with MiDAS. In other words, use the more conservative meta probeset list to generate gene level estimates, but then compute MiDAS p-values for all exons associated with the transcript clusters in your gene level signal estimate file regardless of how speculative the content is.

Q. What is the Log Stabilization Factor For?

A. This factor (default 8) acts to stabilize the variance and helps to prevent inflated p-values for low or un-expressed probesets. The default is based on using PLIER signal summaries. You may want to use a lower value, or not log transform, if you are using RMA signal summaries.

Q. When would I want to not log transform?

A. The default is to log transform the signal estimates. This works well for PLIER input which is not already log transformed. If you are using RMA signal estimates as input, then you probably want to turn off the log transformation. The Log Stabilization Factor is ignored when log transformation is turned off.

Q. What are the normalized intensities?

A. These are the exon by sample fitted values (ie values for samples in the same group for the same exon are identical). The values are also scaled to a max of 0 by subtracting the largest observed fitted value for a given exon over all the sample groups.

Q. In MIDAS result are the p values after multiple test correction?

A. No. No multiple testing correction is applied.


Generated on Mon Nov 3 12:21:42 2008 for Affymetrix Power Tools by  doxygen 1.5.3