bioconductor v3.9.0 Illuminaio
Tools for parsing Illumina's microarray output files, including IDAT.
Link to this section Summary
Functions
Low-level function to read BGX files
Low-level function to read BPM files
Low-level function to read IDAT files
Link to this section Functions
readBGX()
Low-level function to read BGX files
Description
Reads BGX files (array design information) from Illumina expression arrays.
Usage
readBGX(file)
Arguments
Argument | Description |
---|---|
file | character string specifying BGX file to be read in |
Details
Illumina provide probe annotation for expression arrays as either tab separated text files or as files with the extension .bgx. BGX files are infact just gzipped versions of the text files. This function reads such files and returns a list with two entries, the first containing the target-probe information and the second containing details of the control probes.
Value
A list.
Author
Mike Smith. Based losely on code developed by Mark Dunning available in beadarray ~1.16.0.
readBPM()
Low-level function to read BPM files
Description
Reads BPM files (array design information) from Illumina arrays.
list("WARNING: This function is deprecated since illuminaio 0.1.4 ", " (January 2016), because its parser is invalid/incorrect. (Issue #6)")
Usage
readBPM(file)
Arguments
Argument | Description |
---|---|
file | character string specifying BPM file to be read in |
Value
A list.
Author
Originally created by Keith Baggerly in 2008. Later included in the crlmm package.
readIDAT()
Low-level function to read IDAT files
Description
Reads intensity information for each bead type from a single IDAT file for Infinium II, Infinium HD and expression platforms.
Usage
readIDAT(file, what = c("all", "IlluminaID", "nSNPsRead"))
Arguments
Argument | Description |
---|---|
file | character string specifying IDAT file to be read in. |
what | This allows the return of parts of the IDAT file, see Value. This argument is only supported for non-encrypted IDAT files; setting it to be different from default on an encrypted file throws an error. |
Details
This function returns a list containing summarised intensities and other information extracted from a single IDAT file.
Thanks to Keith Baggerly for providing the first version of this code.
There are two commonly encounted versions of IDAT. One version uses encryption to store data and is the format usually produced by Illumina expression arrays. The other version is unencrypted files and is found in data from SNP arrays and DNA methylation arrays.
This function is able to handle data in either format, scanning the file header and calling the appropriate reading routine internally.
The function supports reading gzipped, unencrypted IDATs.
The use of the what
argument is for package writers.
Value
The exact data returned will vary depending upon whether the input file was encrypted or not. In either case, everything that can be extract from the file will be returned in the form of a list.
For both file types the most important entry in the returned list is the
item Quants
. When reading unencrypted files this contains average intensity ( Mean
),
number of beads ( NBeads
) and a measure of variability
( SD
) for each bead type on the array. For expression arrays
in addition to these some additional information is available,
including median and trimmed-mean estimates of average intensity,
averaged local background intensities and the number of beads present
before outliers where excluded.
For both file types details of scanning and decoding dates are
included in the RunInfo
slot.
A further list entry, Extra
, may also be present. Occassionally an field
in the IDAT file will have a number of data points inconsistent with the remainder
of the file and as such is not returned in the Quants
entry. Such
fields will be placed here. So far only the HumanHap550 v1 BeadChip has exhibited
this property.
In case what
is set to nSNPsRead
the function returns an
integer equal to this field. In case what
is set to
IlluminaID
the function returns a character vector containing
the IDs.
Author
Unencrypted: Keith Baggerly, with modifications by Matt Ritchie, Henrik Bengtsson and Kasper Daniel Hansen. Encrypted: Mike Smith
References
Ritchie ME, Carvalho BS, Hetrick KN, Tavar'e S, Irizarry RA. R/Bioconductor software for Illumina's Infinium whole-genome genotyping BeadChips. Bioinformatics. 2009 Oct 1;25(19):2621-3.
Examples
if(require(IlluminaDataTestFiles)) {
idatFile <- system.file("extdata", "idat", "4019585376_B_Red.idat",
package = "IlluminaDataTestFiles")
idat <- readIDAT(idatFile)
names(idat)
idat$Quants[1:5,]
}