Contents

1 Installation

Install the most recent stable version from Bioconductor:

if (!requireNamespace("BiocManager", quietly=TRUE))
    install.packages("BiocManager")
BiocManager::install("nanotubes")

And load nanotubes:

library(nanotubes)

Alternatively, you can install the development version directly from GitHub using devtools:

devtools::install_github("MalteThodberg/nanotubes")

2 Citation

If you use nanotubes, please cite the following article:

citation("nanotubes")
## To cite package 'nanotubes' in publications use:
## 
##   Bornholdt et al. Identification of Gene Transcription Start Sites and
##   Enhancers Responding to Pulmonary Carbon Nanotube Exposure in Vivo,
##   ACS Nano (2017)
## 
## A BibTeX entry for LaTeX users is
## 
##   @Article{,
##     title = {Identification of Gene Transcription Start Sites and Enhancers Responding to Pulmonary Carbon Nanotube Exposure in Vivo},
##     author = {Jette Bornholdt and Anne Thoustrup Saber and Berit Lilje and Mette Boyd and Mette Jørgensen and Yun Chen and Morana Vitezic and Nicklas Raun Jacobsen and Sarah Søs Poulsen and Trine Berthing and Simon Bressendorff and Kristoffer Vitting-Seerup and Robin Andersson and Karin Sørig Hougaard and Carole L. Yauk and Sabina Halappanavar and Håkan Wallin and Ulla Vogel and Albin Sandelin},
##     year = {2017},
##     journal = {ACS Nano},
##     doi = {10.1021/acsnano.6b07533},
##     volume = {11},
##     number = {4},
##     pages = {3597-3613},
##   }

3 Getting help

For general questions about the usage of nanotubes, use the official Bioconductor support forum and tag your question “nanotubes”. We strive to answer questions as quickly as possible.

For technical questions, bug reports and suggestions for new features, we refer to the nanotubes github page

4 Data description

CAGE Transcription Start Sites (CTSSs), the number of CAGE tag 5’-end mapping to each genomic location data was obtained from the authors of the original study. Remaining data used in the study can be downloaded from GSE70386.

One technical replicate was not included.

5 Accessing study design and BigWig-Files

Load the data into R to get an overview of the included samples:

data("nanotubes")
knitr::kable(nanotubes)
Class Name BigWigPlus BigWigMinus
C547 Ctrl C547 mm9.CAGE_7J7P_NANO_KON_547.plus.bw mm9.CAGE_7J7P_NANO_KON_547.minus.bw
C548 Ctrl C548 mm9.CAGE_ULAC_NANO_KON_548.plus.bw mm9.CAGE_ULAC_NANO_KON_548.minus.bw
C549 Ctrl C549 mm9.CAGE_YM4F_Nano_KON_549.plus.bw mm9.CAGE_YM4F_Nano_KON_549.minus.bw
C559 Ctrl C559 mm9.CAGE_RSAM_NANO_559.plus.bw mm9.CAGE_RSAM_NANO_559.minus.bw
C560 Ctrl C560 mm9.CAGE_CCLF_NANO_560.plus.bw mm9.CAGE_CCLF_NANO_560.minus.bw
N13 Nano N13 mm9.CAGE_KTRA_Nano_13.plus.bw mm9.CAGE_KTRA_Nano_13.minus.bw
N14 Nano N14 mm9.CAGE_RSAM_NANO_14.plus.bw mm9.CAGE_RSAM_NANO_14.minus.bw
N15 Nano N15 mm9.CAGE_RFQS_Nano_15.plus.bw mm9.CAGE_RFQS_Nano_15.minus.bw
N16 Nano N16 mm9.CAGE_CCLF_NANO_16.plus.bw mm9.CAGE_CCLF_NANO_16.minus.bw
N17 Nano N17 mm9.CAGE_RSAM_NANO_17.plus.bw mm9.CAGE_RSAM_NANO_17.minus.bw
N18 Nano N18 mm9.CAGE_CCLF_NANO_18.plus.bw mm9.CAGE_CCLF_NANO_18.minus.bw

Load a data from a BigWig-file into R using the rtracklayer package:

library(rtracklayer)
bw_fname <- system.file("extdata", nanotubes$BigWigPlus[1], 
                        package = "nanotubes", 
                        mustWork = TRUE)
import(bw_fname)
## GRanges object with 1055261 ranges and 1 metadata column:
##                seqnames    ranges strand |     score
##                   <Rle> <IRanges>  <Rle> | <numeric>
##         [1]        chr1   3297935      * |         1
##         [2]        chr1   3405982      * |         1
##         [3]        chr1   3575580      * |         1
##         [4]        chr1   3612051      * |         1
##         [5]        chr1   3638567      * |         2
##         ...         ...       ...    ... .       ...
##   [1055257] chrY_random  52340197      * |         1
##   [1055258] chrY_random  55206004      * |         1
##   [1055259] chrY_random  55312786      * |         1
##   [1055260] chrY_random  56729517      * |         1
##   [1055261] chrY_random  56966442      * |         1
##   -------
##   seqinfo: 29 sequences from an unspecified genome

6 Importing into CAGEfightR

The data contained in the package is correctly formatted for analysis with the CAGEfightR package:

library(CAGEfightR)

# Setup paths
bw_plus <- system.file("extdata", nanotubes$BigWigPlus, 
                        package = "nanotubes", 
                        mustWork = TRUE)
bw_minus <- system.file("extdata", nanotubes$BigWigMinus, 
                        package = "nanotubes", 
                        mustWork = TRUE)

# Save as named BigWigFileList
bw_plus <- BigWigFileList(bw_plus)
bw_minus <- BigWigFileList(bw_minus)
names(bw_plus) <- names(bw_minus) <- nanotubes$Name

# Quantify
CTSSs <- quantifyCTSSs(bw_plus, bw_minus, design=nanotubes)

See the CAGEfightR vignette for more details.

7 Session info

sessionInfo()
## R version 4.4.0 beta (2024-04-15 r86425)
## Platform: x86_64-pc-linux-gnu
## Running under: Ubuntu 22.04.4 LTS
## 
## Matrix products: default
## BLAS:   /home/biocbuild/bbs-3.19-bioc/R/lib/libRblas.so 
## LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.10.0
## 
## locale:
##  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
##  [3] LC_TIME=en_GB              LC_COLLATE=C              
##  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
##  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
##  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
## 
## time zone: America/New_York
## tzcode source: system (glibc)
## 
## attached base packages:
## [1] stats4    stats     graphics  grDevices utils     datasets  methods  
## [8] base     
## 
## other attached packages:
## [1] rtracklayer_1.64.0   GenomicRanges_1.56.0 GenomeInfoDb_1.40.0 
## [4] IRanges_2.38.0       S4Vectors_0.42.0     BiocGenerics_0.50.0 
## [7] nanotubes_1.20.0     BiocStyle_2.32.0    
## 
## loaded via a namespace (and not attached):
##  [1] sass_0.4.9                  SparseArray_1.4.0          
##  [3] bitops_1.0-7                lattice_0.22-6             
##  [5] digest_0.6.35               evaluate_0.23              
##  [7] grid_4.4.0                  bookdown_0.39              
##  [9] fastmap_1.1.1               jsonlite_1.8.8             
## [11] Matrix_1.7-0                restfulr_0.0.15            
## [13] BiocManager_1.30.22         httr_1.4.7                 
## [15] UCSC.utils_1.0.0            XML_3.99-0.16.1            
## [17] Biostrings_2.72.0           codetools_0.2-20           
## [19] jquerylib_0.1.4             abind_1.4-5                
## [21] cli_3.6.2                   rlang_1.1.3                
## [23] crayon_1.5.2                XVector_0.44.0             
## [25] Biobase_2.64.0              cachem_1.0.8               
## [27] DelayedArray_0.30.0         yaml_2.3.8                 
## [29] S4Arrays_1.4.0              tools_4.4.0                
## [31] parallel_4.4.0              BiocParallel_1.38.0        
## [33] GenomeInfoDbData_1.2.12     Rsamtools_2.20.0           
## [35] SummarizedExperiment_1.34.0 curl_5.2.1                 
## [37] R6_2.5.1                    BiocIO_1.14.0              
## [39] matrixStats_1.3.0           lifecycle_1.0.4            
## [41] zlibbioc_1.50.0             bslib_0.7.0                
## [43] xfun_0.43                   GenomicAlignments_1.40.0   
## [45] MatrixGenerics_1.16.0       knitr_1.46                 
## [47] rjson_0.2.21                htmltools_0.5.8.1          
## [49] rmarkdown_2.26              compiler_4.4.0             
## [51] RCurl_1.98-1.14