apt-midas is a program for computing an alternative splicing score based on the MiDAS method described in the Affymetrix Alternative Transcript Analysis Methods for Exon Arrays whitepaper.
apt-midas -c celfiles.txt -g gene-plier.summary.txt -e exon-plier.summary.txt -m meta-probeset.full.txt -o apt-midas-results
Where -c specifies a cel group file, -g specifies a gene level signal estimate file, -e specifies an exon level signal estimate file, -m specifies a meta-probeset file to match the exon level data with the gene level data, and -o specifies an output directory.
midas - Microarray Detection of Alternative Splicing.
Usage:
midas --cel-files cels.txt -g gene.summary.txt -e exon.summary.txt -m metaprobeset.txt \
-o out -pvalues -fstats -normalized -stabilize 8.0 -no-log-transform
options:
-h, --help [default 'false']
--cel-files File defining mapping between cel_files and
experimental group_id. [default '']
-g, --genedata File containing gene-level summary data.
[default '']
-e, --exondata File containing exon-level summary data.
[default '']
-m, --metaprobeset File containing mapping (meta-probeset file)
between exon probesets and gene level
probesets. [default '']
-o, --out-dir Directory which will contain the midas output.
[default '']
-pv, --pvalues Output p values. [default 'true']
-f, --fstats Output F statistics. [default 'false']
-n, --normalized Output normalized exon intensities. [default
'false']
-s, --stabilize Stabilization factor for log data transforms.
[default '8.0']
-nol, --no-logtrans Do not log transform gene and exon level
summary data. [default 'false']
--version Display version information. [default 'false']
--keep-path Keep cel file path. [default 'false']
cel_files group_id brain-rep1.cel brain brain-rep2.cel brain brain-rep3.cel brain heart-rep1.cel heart heart-rep2.cel heart heart-rep3.cel heart
A. A good starting point is to use the core or extended meta probeset file along with the IterPlier method to generate gene level signal estimates. You can then use the full meta probeset file with MiDAS. In other words, use the more conservative meta probeset list to generate gene level estimates, but then compute MiDAS p-values for all exons associated with the transcript clusters in your gene level signal estimate file regardless of how speculative the content is.
Q. What is the Log Stabilization Factor For?
A. This factor (default 8) acts to stabilize the variance and helps to prevent inflated p-values for low or un-expressed probesets. The default is based on using PLIER signal summaries. You may want to use a lower value, or not log transform, if you are using RMA signal summaries.
Q. When would I want to not log transform?
A. The default is to log transform the signal estimates. This works well for PLIER input which is not already log transformed. If you are using RMA signal estimates as input, then you probably want to turn off the log transformation. The Log Stabilization Factor is ignored when log transformation is turned off.
Q. What are the normalized intensities?
A. These are the exon by sample fitted values (ie values for samples in the same group for the same exon are identical). The values are also scaled to a max of 0 by subtracting the largest observed fitted value for a given exon over all the sample groups.
Q. In MIDAS result are the p values after multiple test correction?
A. No. No multiple testing correction is applied.
1.5.3