apt-cel-extract will extract probe level intensities from cel files.This program memory maps all the cel files from which intensities will be extracted. As a result, there is an upper limit on the number of cel files that can be extracted at the same time. The practical limit appears to be 25 cel files on a Windows XP 32-bit computer with 2Gb of memory.
apt-cel-extract -o out.txt [-c chip.clf -p chip.pgf] [-d chip.cdf] \ --probeset-ids=norm-exon.txt --probeset-ids=norm-intron.txt \ --probe-ids=antigenomic.bgp *.cel
apt-cel-extract - Extract probe level intensities from cel files.
Usage:
apt-cel-extract -o intensities.txt \
[-c chip.clf -p chip.pgf] || [-d chip.cdf] \
[--probeset-ids norm-exon.txt] \
[--probe-ids probelist.txt] \
[--analysis analysis-string] \
[--pm-only] | [--pm-with-mm-only] \
[--report-background] \
[--target-sketch-file sketchfile.txt] \
[--block-size num-probes-per-iteration] \
[--cel-files celfiles.txt] | [ file1.CEL ... ]
Synopsis:
Simple extraction with no library files:
apt-cel-extract -o intensities.txt *.CEL
Simple extraction of myprobes.txt from WT-based expression array:
apt-cel-extract -o intensities.txt -c chip.clf \
-p chip.pgf --probeset-ids myprobes.txt *.CEL
Simple extraction from CDF-based expression array:
apt-cel-extract -o intensities.txt -d chip.cdf *.CEL\
Extract quantile(sketch) normalized PM with GCBG values:
apt-cel-extract -o intensities.txt -c chip.clf \
-p chip.pgf -b chip.bgp -a quant-norm,pm-gcbg \
--report-background --cel-files celfiles.txt
options:
--probeset-ids File containing probeset ids to extract
probe level data for. If no probeset-ids
and no probe-ids file is provided,
information will be extracted for all
probes. May be specified multiple times.
[default '']
--probe-ids File containing probe ids to extract probe
level data for. May be specified multiple
times. [default '']
-c, --clf-file The cel layout file, describing where a
probe is within the cel file. [default '']
-p, --pgf-file The probe group file, describing what
probes are included in what probe sets.
[default '']
-d, --cdf-file Alternate method for describing probe sets.
Either -d or both -c and -p is required.
[default '']
-s, --spf-file Use simple probe file (SPF) for chip
layout.[Experimental] [default '']
-b, --bgp-file The background probes file, describing what
probes are to be used for gc background
adjustment. Required if --pm-gcbg
extraction is requested. [default '']
--cel-files Optional file containing the names of cel
files to extract data from. [default '']
-o, --out-file Output file to contain the extraction
output. If not provided, the output will go
to stdout (the console). [default '']
--target-sketch-file A target sketch to normalize to when using
quant-norm. [default '']
-a, --analysis An analysis string (no quant method) to use
to transform intensities. [EXPERIMENTAL]
[default '']
-f, --force Override sanity checks, for instance,
requiring the same lib file set/version in
pgf, clf, and bgp files. [default 'false']
--pm-only Only report PM probes. Requires chip layout
information. [default 'false']
--pm-with-mm-only Only report PM probes with have an MM. Note
only the PM probes are reported. Requires
chip layout information. [default 'false']
--report-background Report the background value associated with
each probe. Requires an analysis string and
chip layout information. [EXPERIMENTAL]
[default 'false']
--version Display version information. [default
'false']
-h, --help Print help message. [default 'false']
-v, --verbose How verbose to be with status messages 0 -
quiet, 1 - usual messages, 2 - more
messages. [default '1']
--block-size How many probes to process at once, useful
when memory is limited. There is no atempt
to guess a reasonable block size if set to
0. When set to zero, the value is set to
the number of features on the array (ie one
iteration). [default '0']
--use-disk Use disk based representation to avoid
excessive RAM use. [default 'true']
--disk-dir Directory for temporary files when working
off disk. Using network mounted drives is
not advised. When not set, the output
folder will be used. [default '']
--disk-cache Size of memory cache when working off disk
in megabytes. [default '50']
A. See the FAQ item on probe IDs for more info.
1.5.3