bioconductor v3.9.0 Illuminaio

Tools for parsing Illumina's microarray output files, including IDAT.

Link to this section Summary

Functions

Low-level function to read BGX files

Low-level function to read BPM files

Low-level function to read IDAT files

Link to this section Functions

Low-level function to read BGX files

Description

Reads BGX files (array design information) from Illumina expression arrays.

Usage

readBGX(file)

Arguments

ArgumentDescription
filecharacter string specifying BGX file to be read in

Details

Illumina provide probe annotation for expression arrays as either tab separated text files or as files with the extension .bgx. BGX files are infact just gzipped versions of the text files. This function reads such files and returns a list with two entries, the first containing the target-probe information and the second containing details of the control probes.

Value

A list.

Author

Mike Smith. Based losely on code developed by Mark Dunning available in beadarray ~1.16.0.

Low-level function to read BPM files

Description

Reads BPM files (array design information) from Illumina arrays.

list("WARNING: This function is deprecated since illuminaio 0.1.4 ", " (January 2016), because its parser is invalid/incorrect. (Issue #6)")

Usage

readBPM(file)

Arguments

ArgumentDescription
filecharacter string specifying BPM file to be read in

Value

A list.

Author

Originally created by Keith Baggerly in 2008. Later included in the crlmm package.

Low-level function to read IDAT files

Description

Reads intensity information for each bead type from a single IDAT file for Infinium II, Infinium HD and expression platforms.

Usage

readIDAT(file, what = c("all", "IlluminaID", "nSNPsRead"))

Arguments

ArgumentDescription
filecharacter string specifying IDAT file to be read in.
whatThis allows the return of parts of the IDAT file, see Value. This argument is only supported for non-encrypted IDAT files; setting it to be different from default on an encrypted file throws an error.

Details

This function returns a list containing summarised intensities and other information extracted from a single IDAT file.

Thanks to Keith Baggerly for providing the first version of this code.

There are two commonly encounted versions of IDAT. One version uses encryption to store data and is the format usually produced by Illumina expression arrays. The other version is unencrypted files and is found in data from SNP arrays and DNA methylation arrays.

This function is able to handle data in either format, scanning the file header and calling the appropriate reading routine internally.

The function supports reading gzipped, unencrypted IDATs.

The use of the what argument is for package writers.

Value

The exact data returned will vary depending upon whether the input file was encrypted or not. In either case, everything that can be extract from the file will be returned in the form of a list.

For both file types the most important entry in the returned list is the item Quants . When reading unencrypted files this contains average intensity ( Mean ), number of beads ( NBeads ) and a measure of variability ( SD ) for each bead type on the array. For expression arrays in addition to these some additional information is available, including median and trimmed-mean estimates of average intensity, averaged local background intensities and the number of beads present before outliers where excluded.

For both file types details of scanning and decoding dates are included in the RunInfo slot.

A further list entry, Extra , may also be present. Occassionally an field in the IDAT file will have a number of data points inconsistent with the remainder of the file and as such is not returned in the Quants entry. Such fields will be placed here. So far only the HumanHap550 v1 BeadChip has exhibited this property.

In case what is set to nSNPsRead the function returns an integer equal to this field. In case what is set to IlluminaID the function returns a character vector containing the IDs.

Author

Unencrypted: Keith Baggerly, with modifications by Matt Ritchie, Henrik Bengtsson and Kasper Daniel Hansen. Encrypted: Mike Smith

References

Ritchie ME, Carvalho BS, Hetrick KN, Tavar'e S, Irizarry RA. R/Bioconductor software for Illumina's Infinium whole-genome genotyping BeadChips. Bioinformatics. 2009 Oct 1;25(19):2621-3.

Examples

if(require(IlluminaDataTestFiles)) {
idatFile <- system.file("extdata", "idat", "4019585376_B_Red.idat",
package = "IlluminaDataTestFiles")
idat <- readIDAT(idatFile)
names(idat)
idat$Quants[1:5,]
}