tximport: import and summarize transcript-level estimates for gene-level analysis

https://github.com/mikelove/tximport/blob/master/vignettes/tximport.md

kallisto

First, read in some kallisto example files:

library(tximportData)

dir <- system.file(“extdata”, package=”tximportData”)

list.files(dir)

[6] “samples.txt” “tmp”

samples <- read.table(file.path(dir,”samples.txt”), header=TRUE)

files <- file.path(dir,”kallisto”, samples$run, “abundance.tsv”)

names(files) <- paste0(“sample”,1:6)

Transcripts need to be associated with gene IDs for summarization. If that information is present in the files, we can skip this step. But for kallisto and Salmon, the files just provide the transcript ID. So we first make a data.frame with two columns: gene ID (column 1) and transcript ID (column 2). The column names are not relevant but this column order must be used. This can be accomplished from a TxDb object and the select function. (TODO: show example)


Do not forget to check our new membership site with a lot more information on bioinformatics.

Written by M. //