bioconductor v3.9.0 AnnotationForge

Provides code for generating Annotation packages and

Link to this section Summary

Functions

Create an SQLite-based annotation package

available.db0pkgs

Generates the seqnames.db package and database

Read a data file describing the probe sequences on an Affymetrix genechip

Read a 1lq file for an Affymetrix genechip

Making a chip package from annotations available from data.frame of probes mapped to gene IDs and an existing org package.

Builds an individual DB from online files at inparanoid that is in turn meant for generating Inparanoid 8 based objects.

Making an organism package from annotations available from a set of named data.frames.

Make an organism package from annotations available from NCBI.

Make a package with probe sequence related data for microarrays

Creates a sqlite database, and then makes an annotation package with it

Populates an SQLite DB with and produces a schema definition

Wrap up all the Base Databases into Packages for distribution

Link to this section Functions

Link to this function

AnnDbPkg_maker()

Create an SQLite-based annotation package

Description

Create an SQLite-based annotation package from an SQLite file.

Usage

makeAnnDbPkg(x, dbfile, dest_dir=".", no.man=FALSE, ...)
  loadAnnDbPkgIndex(file)

Arguments

ArgumentDescription
xA AnnDbPkgSeed object, a list, a string or a regular expression.
dbfileThe path to the SQLite containing the annotation data for the package to build.
dest_dirThe directory where the package will be created.
fileThe path to a DCF file containing the list of annotation packages to build.
no.manIf TRUE then no man page is included in the package.
...Extra args used for extra filtering.

Seealso

AnnDbPkg-checker

Examples

## With a "AnnDbPkgSeed" object:
seed <- new("AnnDbPkgSeed",
Package="hgu133a2.db",
Version="0.0.99",
PkgTemplate="HUMANCHIP.DB",
AnnObjPrefix="hgu133a2"
)
if (FALSE)
makeAnnDbPkg(seed, "path/to/hgu133a2.sqlite")

## With package names:
## (Note that in this case makeAnnDbPkg() will use the package descriptions
## found in the master index file ANNDBPKG-INDEX.TXT located in the
## AnnotationDbi package.)
if (FALSE)
makeAnnDbPkg(c("hgu95av2.db", "hgu133a2.db"))

## A character vector of length 1 is treated as a regular expression:
if (FALSE)
makeAnnDbPkg("hgu.*")
## To make all the packages described in the master index:
if (FALSE)
makeAnnDbPkg("")
## Extra args can be used to narrow down the roaster of packages to make:
if (FALSE) {
makeAnnDbPkg("", PkgTemplate="HUMANCHIP.DB", manufacturer="Affymetrix")
makeAnnDbPkg(".*[3k]\.db", species=c("Mouse", "Rat"))
}

## The master index file ANNDBPKG-INDEX.TXT can be loaded with:
loadAnnDbPkgIndex()
Link to this function

availabledb0pkgs()

available.db0pkgs

Description

Get the list of intermediate annotation data packages (.db0 data packages) that are currently available on the Bioconductor repositories for your version of R/Bioconductor.

Or get a list of schemas supported by AnnotationDbi.

Usage

available.db0pkgs()
  available.dbschemas()
  available.chipdbschemas()

Details

The SQLForge code uses a series of intermediate database packages that are necessary to build updated custom annotation packages. These packages must be installed or updated if you want to make a custom annotation package for a particular organism. These special intermediate packages contain the latest freeze of the data needed to build custom annotation data packages and are easily identified by the fact that they end with the special ".db0" suffix. This function will list all such packages that are available for a specific version of bioconductor.

The available.dbschemas() and available.chipdbschemas() functions allow you to get a list of the schema names that are available similar to how you can list the available ".db0" packages by using available.db0pkgs(). This list of shemas is useful (for example) when you want to build a new package and need to know the name of the schema you want to use.

Value

A character vector containing the names of the available ".db0" data packages. Or a a character vector listing the names of the available schemas.

Author

Hervé Pagès and Marc Carlson

Examples

# Get the list of BSgenome data packages currently available:
available.db0pkgs()

# Make your choice and install like this:
library(BiocManager)
install("human.db0")

# Get the list of chip DB schemas:
available.chipdbschemas()

# Get the list of ALL DB schemas:
available.dbschemas()
Link to this function

generateSeqnames_db()

Generates the seqnames.db package and database

Description

This function is used to generate the seqnames.db package and it's database from the csv files contained in the template for this package within AnnotationForge. The csv files are converted into database tables, and the DB is packaged into a new seqnames.db package.

Usage

generateSeqnames.db(version, outdir=".")

Arguments

ArgumentDescription
versionCharacter. Version number for the final package.
outdirCharacter. Path to output directory where the package is to be placed. By default the current working directory will be used.

Details

The generateSeqnames.db function allows users to regenerate the seqnames.db package from csv sources contained in the currently installed AnnotationForge package. It is expected that the typical user will not need to use this at all, but in case they do, we have made it available. We expect that the more common use case is someone who wants to make some new chromosome conventions available for the world. It is expected that this person will more typicalyl be charitable and want to share their conventions, so they could share their .csv files with us and we would add them to AnnotationForge, install the updated package and then run this function to make a new package.

The .csv files need to be formatted the same as the ones that are currently in the template in AnnotationForge. Examples of these .csv files can be found in AnnotationForge in the "inst/seqnames-template/inst/extdata/dataFiles/" directory. Each file must be named after it's corresponding genus and species with an underscore separator and a .csv file extension. The 1st line of each file defines columns that are the names of the corresponding naming conventions. And the chromosome names are then listed below this header line such that the equivalent names for the different styles share the same row.

So for example the 1st four rows of Mus_musculus look like this (but with only one newline at the end of each row):

UCSC,NCBI,ensembl

chr1,1,1

chr2,2,2

chr3,3,3

etc.

Once you have your file ready your only need to place it in the same dir in AnnoationDbi (with the other files), install AnnotationForge, and then run this function to generate a new seqnames.db package. Of course, if you have a useful set of conventions or species to contribute, it would be best if you gave your .csv files to the Bioconductor core team so that we can add these files to the official version of AnnotationForge and so that they can occur in the official seqnames.db package.

Value

A new seqnames.db package, complete with all the latest data stored in the dataFiles subdirectory

Author

Marc Carlson

Examples

generateSeqnames.db(version="1.0.0")
Link to this function

getProbeDataAffy()

Read a data file describing the probe sequences on an Affymetrix genechip

Description

Read a data file describing the probe sequences on an Affymetrix genechip

Usage

getProbeDataAffy(arraytype, datafile, pkgname = NULL, comparewithcdf = FALSE)

Arguments

ArgumentDescription
arraytypeCharacter. Array type (e.g. 'HG-U133A')
datafileCharacter with the filename of the input data file, or a connection (see example). If omitted a default name is constructed from arraytype (for details you will need to consult this function's source code).
pkgnameCharacter. Package name. If NULL the name is derived from arraytype .
comparewithcdfLogical. If TRUE, run a consistency check against a CDF package of the same name (what used to be Laurent's "extraparanoia".)

Details

This function serves as an interface between the (1) representation of array probe information data in the packages that are generated by makeProbePackage and (2) the vendor- and possibly version-specific way the data are represented in datafile .

datafile is a tabulator-separated file with one row per probe, and column names 'Probe X' , 'Probe Y' , 'Probe Sequence' , and 'Probe.Set.Name' . See the vignette for an example.

Value

A list with three components

*

Seealso

makeProbePackage

Examples

## Please refer to the vignette
Link to this function

getProbeData_1lq()

Read a 1lq file for an Affymetrix genechip

Description

Read a 1lq file for an Affymetrix genechip

Usage

getProbeData_1lq(arraytype, datafile, pkgname = NULL)

Arguments

ArgumentDescription
arraytypeCharacter. Array type (e.g. 'Scerevisiaetiling)
datafileCharacter. The filename of the input data file. If omitted a default name is constructed from arraytype (see this function's source code).
pkgnameCharacter. Package name. If NULL the name is derived from arraytype .

Details

This function serves as an interface between the (1) representation of array probe information data in the packages that are generated by makeProbePackage and (2) the vendor- and possibly version-specific way the data are represented in datafile .

Value

A list with three components

*

Seealso

makeProbePackage

Examples

## makeProbePackage(
##    arraytype = "Scerevisiaetiling",
##    maintainer= "Wolfgang Huber <huber@ebi.ac.uk>",
##    version   = "1.1.0",
##    datafile  = "S.cerevisiae_tiling.1lq",
##    importfun = "getProbeData_1lq")
Link to this function

makeChipPackageFromDataFrames()

Making a chip package from annotations available from data.frame of probes mapped to gene IDs and an existing org package.

Description

The makeChipPackage function allows the user to make an chip package from a data.frame that has two columns to define a set of probes and the gene IDs that they map to as well as an org package that contains data about those gene IDs (where the gene IDs can be used as a foreign key).

makeChipPackage is intended to be compatible with any org packages that are generated by makeOrgPackage as well as most of the older legacy org packages that were based on more popular model organisms. The one exception is the legacy org package for yeast org.Sc.sgd.db since its internal schema is just too different from everything else.

Packages produced in this way can not support the older bimap objects unless they are pointing to an older legacy org package. All packages should support select().

Usage

makeChipPackage(prefix,
                  probeFrame, 
                  orgPkgName, 
                  version,
                  maintainer,
                  author,
                  outputDir = ".",
                  tax_id,
                  genus,
                  species,
                  optionalAccessionsFrame=NULL)

Arguments

ArgumentDescription
prefixThe package name
probeFramedata.frame with two columns. The 1st column are the probes and the second column are genes (gene IDs). The gene IDs must be the main ID type for the org package that is the named in the 3rd argument.
orgPkgNameThe name of the org package that you want to make the chip package to go with
versionWhat is the version number for this package? format: 'x.y.z'
maintainerWho is the package maintainer? (must include email to be valid)
authorWho is the creator of this package?
outputDirA path where the package source should be assembled.
tax_idThe Taxonomy ID that represents your organism. (NCBI has a nice online browser for finding the one you need)
genusSingle string indicating the genus.
speciesSingle string indicating the species.
optionalAccessionsFrameIf you want to also include accessions for your probes (not used for mapping them) then you can include those here.

Value

The path to the package that just created. This is useful for calling install.packages as the next step.

Author

M. Carlson

Examples

if(interactive()){
## 1st lets list some authentic entrez gene IDs
geneIds <- c("1","10","100","1000","10000","100008586")
probeNames <- paste("probe", 1:length(geneIds), sep="")
probeFrame <- data.frame(probes=probeNames, genes=geneIds)

makeChipPackage(prefix='testChip',
probeFrame=probeFrame,
orgPkgName='org.Hs.eg.db',
version='0.99.1',
maintainer='Some One <so@someplace.org>',
author='Some One <so@someplace.org>',
outputDir='.',
tax_id='59729',
genus='Homo',
species='sapiens')

## then you can call install.packages based on the return value
install.packages('./testChip.db', repos=NULL)

}

Builds an individual DB from online files at inparanoid that is in turn meant for generating Inparanoid 8 based objects.

Description

This is really meant to be used by AnnotationHubData for eventual exposure through the AnnotationHub. Users who are interested in Inparanoid 8 should look for the relevant objects to be in AnnotationHub. This function is just a database builder that allows us to put the data up there. So this is not really intended for use by end users.

Usage

makeInpDb(dir, dataDir)

Arguments

ArgumentDescription
dirthe full path to the online Inparanpoid 8 resource.
dataDirdirectory where the source inparanoid.sqlite DB is

Value

A SQLite DB is produced but this is not returned from the function. loadDb is expected to be used by AnnotationHub to convert these into objects after the sqlite DB is downloaded from the hub server.

Author

M. Carlson

Examples

## There are paths to resource data for each set of files at Inparanoid.
## This is how you can turn those files inta sqlite DB.
if(interactive()){
db <- makeInpDb(
dir="http://inparanoid.sbc.su.se/download/current/Orthologs/A.aegypti/",
dataDir=tempdir())
}
Link to this function

makeOrgPackage()

Making an organism package from annotations available from a set of named data.frames.

Description

The makeOrgPackage function allows the user to make an organism package from any collection of data frames that are united by a common gene ID.

Usage

makeOrgPackage(...,
                 version,
                 maintainer,
                 author,
                 outputDir=getwd(),
                 tax_id,
                 genus=NULL,
                 species=NULL,
                 goTable=NULL,
                 verbose=TRUE)

Arguments

ArgumentDescription
...A set of data.frames containing annotation data. Each of these arguments must be named. Those names will become the names of the tables in the final database. Also, there are no rownames for these data.frames, and the colnames are the names that will be used as extractable fields in the final package. In other words they will be what comes back when you call columns and keytypes Finally, the 1st column of every data.frame must be labeled GID, and correspond to a gene ID that is universal for the entire set of data.frames. The GID is how the different tables will be joined internally
versionWhat is the version number for this package? format: 'x.y.z'
maintainerWho is the package maintainer? (must include email to be valid)
authorWho is the creator of this package?
outputDirA path where the package source should be assembled.
tax_idThe Taxonomy ID that represents your organism. (NCBI has a nice online browser for finding the one you need)
genusSingle string indicating the genus.
speciesSingle string indicating the species.
goTableBy default, this is NULL, but if one of your '...' data.frames has GO annotations, then this name will be the name of that argument. When you specify this, makeOrgPackage will process that data.frame to remove extra GO terms (that are too new for the current GO.db) and also will generate a table for GOALL data (based on ancestor terms for each mapping from GO.db). This table will also be checked to make sure that it has exactly THREE columns, that must be named GID, GO and EVIDENCE. These must correspond to the gene IDs, GO IDs and evidence codes respectively. GO IDs should be formatted like this to work with other DBs in the project: 'GO:XXXXXXX'.
verboseWhen TRUE progress messages are displayed.

Value

The path to the package that just created. This is useful for calling install.packages as the next step.

Author

M. Carlson

Examples

if(interactive()){

## Makes an organism package for Zebra Finch data.frames:
finchFile <- system.file("extdata","finch_info.txt",package="AnnotationForge")
finch <- read.table(finchFile,sep=" ")

## not that this is how it should always be, but that it *could* be this way.
fSym <- finch[,c(2,3,9)]
fSym <- fSym[fSym[,2]!="-",]
fSym <- fSym[fSym[,3]!="-",]
colnames(fSym) <- c("GID","SYMBOL","GENENAME")

fChr <- finch[,c(2,7)]
fChr <- fChr[fChr[,2]!="-",]
colnames(fChr) <- c("GID","CHROMOSOME")

finchGOFile <- system.file("extdata","GO_finch.txt",package="AnnotationForge")
fGO <- read.table(finchGOFile,sep=" ")
fGO <- fGO[fGO[,2]!="",]
fGO <- fGO[fGO[,3]!="",]
colnames(fGO) <- c("GID","GO","EVIDENCE")

makeOrgPackage(gene_info=fSym, chromosome=fChr, go=fGO,
version="0.1",
maintainer="Some One <so@someplace.org>",
author="Some One <so@someplace.org>",
outputDir = ".",
tax_id="59729",
genus="Taeniopygia",
species="guttata",
goTable="go")

## then you can call install.packages based on the return value
install.packages("./org.Tguttata.eg.db", repos=NULL)

}
Link to this function

makeOrgPackageFromNCBI()

Make an organism package from annotations available from NCBI.

Description

The makeOrgPackageFromNCBI function allows the user to make an organism package from NCBI annotations available from the NCBI.

Usage

makeOrgPackageFromNCBI(
    version=,
    maintainer,
    author,
    outputDir=getwd(),
    tax_id,
    genus=NULL,
    species=NULL,
    NCBIFilesDir=getwd(),
    databaseOnly=FALSE,
    useDeprecatedStyle=FALSE,
    rebuildCache=TRUE,
    verbose=TRUE)

Arguments

ArgumentDescription
versionPackage version in 'x.y.z' format.
maintainerPackage maintainer followed by email
authorCreator of package.
outputDirPath where the package source should be assembled.
tax_idThe Taxonomy ID that represents the organism.
genusSingle string indicating the genus.
speciesSingle string indicating the species.
NCBIFilesDirWhen a path is given, the files used to create the DB are saved locally.
databaseOnlyWhen TRUE, a DB is created without the package infrastructure. Used for OrgDb packages hosted on AnnotationHub.
useDeprecatedStyleLegacy support for older package style with bimaps.
rebuildCacheWhen TRUE, the files used to create the DB are refreshed (i.e., re-downloaded) if the timestamp is greater than 24 hours old. When FALSE, the temporary NCBI.sqlite DB and final package are re-generated from local files in outputDir . Used internally and for testing.
verboseWhen TRUE, status messages are printed.

Details

makeOrgPackageFromNCBI downloads multiple files and assembles a 12 GB database in NCBIFilesDir . The first time the function is run it may take over an hour; subsequent calls reuse files from the cache and are much faster. The default behavior of makeOrgPackageFromNCBI attempts to refresh the cached files each day (suppress with rebuildCache = FALSE ).

Value

Nothing returned to the R session. Just creates an organism annotation package.

Author

M. Carlson

Examples

## Makes an organism package for Zebra Finch from NCBI:

makeOrgPackageFromNCBI(version = "0.1",
author = "Some One <so@someplace.org>",
maintainer = "Some One <so@someplace.org>",
outputDir = ".",
tax_id = "59729",
genus = "Taeniopygia",
species = "guttata")
Link to this function

makeProbePackage()

Make a package with probe sequence related data for microarrays

Description

Make a package with probe sequence related data for microarrays

Usage

makeProbePackage(arraytype,
    importfun = "getProbeDataAffy",
    maintainer,
    version,
    species,
    pkgname = NULL,
    outdir  = ".",
    quiet = FALSE,
    check = TRUE, build = TRUE, unlink = TRUE, list())

Arguments

ArgumentDescription
arraytypeCharacter. Name of array type (typically a vendor's name like "HG-U133A").
importfunCharacter. Name of a function that can read the probe sequence data e.g. from a file. See getProbeDataAffy for an example.
maintainerCharacter. Name and email address of the maintainer.
versionCharacter. Version number for the package.
speciesCharacter. Species name in the format Genus_species (e.g., Homo_sapiens)
pkgnameCharacter. Name of the package. If missing, a name is created from arraytype .
outdirCharacter. Path where the package is to be written.
quietLogical. If TRUE do not print statements on progress on the console
checkLogical. If TRUE call R CMD check on the package
buildLogical. If TRUE call R CMD build on the package
unlinkLogical. If TRUE unlink (remove) the check directory (only relevant if check=TRUE )
list()Further arguments that get passed along to importfun

Details

See vignette.

Important note for list("Windows") users: Building and checking packages requires some tools outside of list() (e.g. a Perl interpreter). While these tools are standard with practically every Unix, they do not come with MS-Windows and need to be installed separately on your computer. See http://www.murdoch-sutherland.com/Rtools . If you just want to build probe packages, you will not need the compilers, and the "Windows help" stuff is optional.

Examples

filename <- system.file("extdata", "HG-U95Av2_probe_tab.gz",
package="AnnotationDbi")
outdir   <- tempdir()
me       <- "Wolfgang Huber <huber@ebi.ac.uk>"
makeProbePackage("HG-U95Av2",
datafile   = gzfile(filename, open="r"),
outdir     = outdir,
maintainer = me,
version    = "0.0.1",
species    = "Homo_sapiens",
check      = FALSE)
dir(outdir)
Link to this function

sqlForge_makeCHIPDB()

Creates a sqlite database, and then makes an annotation package with it

Description

This function 1st creates a SQLite file useful for making a SQLite based annotation package by using the correct popXXXCHIP_DB function. Next, this function produces an annotation package featuring the sqlite database produced. All makeXXXXChip_DB functions REQUIRE that you previously have installed the appropriate XXXX.db0 package. Call the function available.db0pkgs() to see what your options are, and then install the appropriate package with BiocManager::install().

Usage

makeDBPackage(schema, ...)
#  usage case with required arguments
#  makeDBPackage(schema, affy, prefix, fileName, baseMapType, version)
#  usage case with all arguments
#  makeDBPackage(schema, affy, prefix, fileName, otherSrc, chipMapSrc,
#  chipSrc, baseMapType, outputDir, version, manufacturer, chipName,
#  manufacturerUrl, author, maintainer)

Arguments

ArgumentDescription
schemaString listing the schema that you want to use to make the DB. You can list schemas with available.dbschemas()
affyBoolean to indicate if this is starting from an affy csv file or not. If it is, then that will be parsed to make the sqlite file, if not, then you can feed a tab delimited file with IDs as was done before with AnnBuilder.
prefixprefix is the first part of the eventual desired package name. (ie. "prefix.db")
fileNameThe path and filename for the file to be parsed. This can either be an affy csv file or it can be a more classic file type.
otherSrcThe path and filenames to any other lists of IDs which might add information about how a probe will map.
chipMapSrcThe path and filename to the intermediate database containing the mapping data for allowed ID types and how these IDs relate to each other. If not provided, then the appropriate source DB from the most current .db0 package will be used instead.
chipSrcThe path and filename to the intermediate database containing the annotation data for the sqlite to build. If not provided, then the appropriate source DB from the most current .db0 package will be used instead.
baseMapTypeThe type of ID that is used for the initial base mapping. If using a classic base mapping file, this should be the ID type present in the fileName. This can be any of the following values: "gb" = for genbank IDs "ug" = unigene IDs "eg" = Entrez Gene IDs "refseq" = refseq IDs "gbNRef" = mixture of genbank and refseq IDs
outputDirWhere you would like the output files to be placed.
versionWhat is the version number for the desired package.
manufacturerWho made the chip being described.
chipNameWhat is the name of the chip.
manufacturerUrlURL for manufacturers website.
authorList of authors involved in making the package.
maintainerList of package maintainers with email addresses for contact purposes.
...Just used so we can have a wrapper function. Ignore this argument.

Examples

##Build the hgu95av2.db package
makeDBPackage(
"HUMANCHIP_DB",
affy = TRUE,
prefix = "hgu95av2",
fileName = "~/proj/mcarlson/sqliteGen/srcFiles/hgu95av2/HG_U95Av2_annot.csv.070824",
otherSrc = c(
EA="~/proj/mcarlson/sqliteGen/srcFiles/hgu95av2/hgu95av2.EA.txt",
UMICH="~/proj/mcarlson/sqliteGen/srcFiles/hgu95av2/hgu95av2_UMICH.txt"),
baseMapType = "gbNRef",
version = "1.0.0",
manufacturer = "Affymetrix",
chipName = "hgu95av2",
manufacturerUrl = "http://www.affymetrix.com")
Link to this function

sqlForge_popDB()

Populates an SQLite DB with and produces a schema definition

Description

Creates SQLite file useful for making a SQLite based annotation package. Also produces the schema file which details the schema for the database produced.

Usage

populateDB(schema, ...)
#  usage case with required arguments
#  populateDB(schema, prefix, chipSrc, metaDataSrc)
#  usage case with all possible arguments
#  populateDB(schema, affy, prefix, fileName, chipMapSrc, chipSrc,
#  metaDataSrc, otherSrc, baseMapType, outputDir, printSchema)

Arguments

ArgumentDescription
schemaString listing the schema that you want to use to make the DB. You can list schemas with available.dbschemas()
affyBoolean to indicate if this is starting from an affy csv file or not. If it is, then that will be parsed to make the sqlite file, if not, then you can feed a tab delimited file with IDs as was done before with AnnBuilder.
prefixprefix is the first part of the eventual desired package name. (ie. "prefix.sqlite")
fileNameThe path and filename for the mapping file to be parsed. This can either be an affy csv file or it can be a more classic file type. This is only needed when making chip packages.
chipMapSrcThe path and filename to the intermediate database containing the mapping data for allowed ID types and how these IDs relate to each other. If not provided, then the appropriate source DB from the most current .db0 package will be used instead.
chipSrcThe path and filename to the intermediate database containing the annotation data for the sqlite to build. If not provided, then the appropriate source DB from the most current .db0 package will be used instead.
metaDataSrcEither a named character vector containing pertinent information about the metadata OR the path and filename to the intermediate database containing the metadata information for the package. If this is a custom package, then it must be a named vector with the following fields: metaDataSrc <- c( DBSCHEMA="the DB schema", ORGANISM="the organism", SPECIES="the species", MANUFACTURER="the manufacturer", CHIPNAME="the chipName", MANUFACTURERURL="the manufacturerUrl")
otherSrcThe path and filenames to any other lists of IDs which might add information about how a probe will map.
baseMapTypeThe type of ID that is used for the initial base mapping. If using a classic base mapping file, this should be the ID type present in the fileName. This can be any of the following values: "gb" = for genbank IDs "ug" = unigene IDs "eg" = Entrez Gene IDs "refseq" = refseq IDs "gbNRef" = mixture of genbank and refseq IDs
outputDirWhere you would like the output files to be placed.
printSchemaBoolean to indicate whether or not to produce an output of the schema (default is FALSE).
...Just used so we can have a wrapper function. Ignore this argument.

Examples

##Set up the metadata
my_metaDataSrc <- c( DBSCHEMA="the DB schema",
ORGANISM="the organism",
SPECIES="the species",
MANUFACTURER="the manufacturer",
CHIPNAME="the chipName",
MANUFACTURERURL="the manufacturerUrl")

##Builds the org.Hs.eg sqlite:
populateDB(
"HUMAN_DB",
prefix="org.Hs.eg",
chipSrc = "~/proj/mcarlson/sqliteGen/annosrc/db/chipsrc_human.sqlite",
metaDataSrc = my_metaDataSrc,
printSchema=TRUE)


##Builds hgu95av2.sqlite:
populateDB(
"HUMANCHIP_DB",
affy=TRUE,
prefix="hgu95av2",
fileName="~/proj/mcarlson/sqliteGen/srcFiles/hgu95av2/HG_U95Av2.na27.annot.csv",
metaDataSrc=my_metaDataSrc,
baseMapType="gbNRef")


##Builds the ag.sqlite:
populateDB("ARABIDOPSISCHIP_DB",
affy=TRUE,
prefix="ag",
metaDataSrc=my_metaDataSrc)


##Builds yeast2.sqlite:
populateDB(
"YEASTCHIP_DB",
affy=TRUE,
prefix="yeast2",
fileName="~/proj/mcarlson/sqliteGen/srcFiles/yeast2/Yeast_2.na27.annot.csv",
metaDataSrc=metaDataSrc)
Link to this function

sqlForge_wrapDBPackages()

Wrap up all the Base Databases into Packages for distribution

Description

Creates extremely simple packages from the base database files for distribution. This is a convenience function for wrapping up these packages in a consistent way each time.

Usage

wrapBaseDBPackages(dbPath, destDir, version)

Arguments

ArgumentDescription
dbPathdbPath is just the path to the location of the latest intermediate sqlite source files. These files are then used to make base DB packages.
destDirdestination path for the newly minted packages.
versionversion number to stamp onto these newly minted packages.

Examples

##Make all of the intermediate DBs and place the new packages right here.
wrapBaseDBPackages(dbPath, destDir = ".")