The APT implementation requires files with prior information on how probe intensity summaries of individual CNV regions will cluster. For a non-prior implementation see the release at the Broad Institute www.broad.mit.edu/mpg/birdsuite/.
To run canary with the above input files the correct CDF file to use is GenomeWideSNP_6.cdf. CEL files should be compatible with this CDF.
apt-canary \ --out-dir canary-results \ --cdf-file ../regression-data/data/lib/GenomeWideSNP_6/GenomeWideSNP_6.cdf \ --cnv-region-file inputs/GenomeWideSNP_6.canary-v1.region \ --cnv-normalization-file inputs/GenomeWideSNP_6.canary-v1.normalization \ --cnp-prior-file inputs/GenomeWideSNP_6.canary-v1.prior \ --cnv-map-file inputs/GenomeWideSNP_6.canary-v1.bed \ --cel-files inputs/celfiles.txt
apt-canary - Call copy number states for defined regions using the canary algorithm
options:
Basic Info and Control Options
-h, --help This message. [default 'false']
--explain Explain a particular operation (i.e.
--explain canary). [default '']
--verbose How verbose to be with status messages 0 -
quiet, 1 - usual messages, 2 - more
messages. [default '1']
--version Output program version and quit. [default
'false']
--force Disable various checks including chip types
and map file versions. Consider using
--chip-type option rather than --force.
[default 'false']
Input Options
--cel-files Text file specifying cel files to process,
one per line with the first line being
'cel_files'. [default '']
--cdf-file File defining probe sets. Use either
--cdf-file or --spf-file [default '']
--spf-file File defining probe sets in spf (simple
probe format) which is like a text cdf
file. [default '']
--cnv-region-file File defining CNV regions and what
probesets to use for each CNV region.
[default '']
--cnp-prior-file File defining the canary priors for a given
CNV regions file. [default '']
--cnv-map-file File (bed format) used for visualizing CNV
regions in other applications. This arg
causes the map file name to be included in
the CHP meta info. [default '']
--cnv-normalization-file File containing probesets to use
(restricted to) for doing probe level
normalization. [default '']
--chip-type Chip types to check library and CEL files
against. Can be specified multiple times.
The first one is propigated as the chip
type in the output files. Warning, use of
this option will override the usual check
between chip types found in the library
files and cel files. You should use this
option instead of --force when possible.
[default '']
Output Options
--out-dir Directory to write result files into. Any
previous results in directory will be
overwritten. [default '.']
--table-output Output matching matrices of tab delimited
genotype calls and confidences. [default
'true']
--cc-chp-output Output resulting calls in binary CHP
format. This makes one AGCC Multi Data CHP
file per cel file analyzed. [default
'false']
Analysis Options
--apt-summarize-analysis String representing analysis parameters for
the apt-probeset-summarize step a.k.a.
pre-canary [default '']
--apt-canary-analysis String representing analysis parameters for
canary. See --explain canary for more info.
[default '']
Execution Control Options
--block-size How many probesets to process at once,
useful when memory is limited. If set to 0
program attempts to guess available RAM and
set appropriately. [default '0']
--precision Precision after decimal place [default '4']
--analysis-name Set the name of the analysis. [default '']
1.5.3