Contents

The scanMiRData package contains KdModel collections corresponding to all human, mouse and rat mirbase miRNAs. These represent the miRNA binding affinities (or more exactly, dissociation rates) predicted using the CNN from McGeary, Lin et al. (2019). To know more about the KdModel and KdModelList classes, see the corresponding vignette in the scanMiR package.

1 Loading the collections

The objects can be loaded to the environment using data:

library(scanMiR)
data("mmu", package="scanMiRData")
summary(mmu)
## A `KdModelList` object created on 2020-11-22,
##  containing binding affinity models from 1978 miRNAs.
## Mus musculus, miRBase release 22.1
## 
##               Low-confidence             Poorly conserved 
##                         1215                          346 
##     Conserved across mammals Conserved across vertebrates 
##                          134                          203 
##                         <NA> 
##                           80
head(mmu)
## An object of class "KdModelList"
## [[1]]
## A `KdModel` for mmu-let-7a-1-3p (Low-confidence)
##   Sequence: CUAUACAAUCUACUGUCUUUCC
##   Canonical target seed: TTGTATA(A)
## [[2]]
## A `KdModel` for mmu-let-7a-2-3p (Low-confidence)
##   Sequence: CUGUACAGCCUCCUAGCUUUC
##   Canonical target seed: CTGTACA(A)
## [[3]]
## A `KdModel` for mmu-let-7a-5p (Conserved across vertebrates)
##   Sequence: UGAGGUAGUAGGUUGUAUAGUU
##   Canonical target seed: CTACCTC(A)
## [[4]]
## A `KdModel` for mmu-let-7b-3p (Low-confidence)
##   Sequence: CUAUACAACCUACUGCCUUCCC
##   Canonical target seed: TTGTATA(A)
## [[5]]
## A `KdModel` for mmu-let-7b-5p (Conserved across vertebrates)
##   Sequence: UGAGGUAGUAGGUUGUGUGGUU
##   Canonical target seed: CTACCTC(A)
## [[6]]
## A `KdModel` for mmu-let-7c-1-3p (Low-confidence)
##   Sequence: CUGUACAACCUUCUAGCUUUCC
##   Canonical target seed: TTGTACA(A)

Alternatively, they can also be loaded (and filtered) through a convenient function:

library(scanMiRData)
mmu <- getKdModels("mmu", categories=c("Conserved across vertebrates", 
                                       "Conserved across mammals"))
summary(mmu)
## A `KdModelList` object containing binding affinity models from 337 miRNAs.
## 
##               Low-confidence             Poorly conserved 
##                            0                            0 
##     Conserved across mammals Conserved across vertebrates 
##                          134                          203

Summary of the other two collections:

summary(getKdModels("hsa"))
## A `KdModelList` object containing binding affinity models from 2656 miRNAs.
## 
##               Low-confidence             Poorly conserved 
##                         1913                          290 
##     Conserved across mammals Conserved across vertebrates 
##                          136                          192 
##                         <NA> 
##                          125
summary(getKdModels("rno"))
## A `KdModelList` object containing binding affinity models from 764 miRNAs.
## 
##               Low-confidence             Poorly conserved 
##                          482                            2 
##     Conserved across mammals Conserved across vertebrates 
##                          112                          165 
##                         <NA> 
##                            3



Session info

## R version 4.4.0 beta (2024-04-15 r86425)
## Platform: x86_64-pc-linux-gnu
## Running under: Ubuntu 22.04.4 LTS
## 
## Matrix products: default
## BLAS:   /home/biocbuild/bbs-3.19-bioc/R/lib/libRblas.so 
## LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.10.0
## 
## locale:
##  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
##  [3] LC_TIME=en_GB              LC_COLLATE=C              
##  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
##  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
##  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
## 
## time zone: America/New_York
## tzcode source: system (glibc)
## 
## attached base packages:
## [1] stats     graphics  grDevices utils     datasets  methods   base     
## 
## other attached packages:
## [1] scanMiRData_1.10.0 scanMiR_1.10.0     BiocStyle_2.32.0  
## 
## loaded via a namespace (and not attached):
##  [1] sass_0.4.9              utf8_1.2.4              generics_0.1.3         
##  [4] stringi_1.8.3           digest_0.6.35           magrittr_2.0.3         
##  [7] evaluate_0.23           grid_4.4.0              bookdown_0.39          
## [10] fastmap_1.1.1           jsonlite_1.8.8          GenomeInfoDb_1.40.0    
## [13] BiocManager_1.30.22     httr_1.4.7              fansi_1.0.6            
## [16] UCSC.utils_1.0.0        scales_1.3.0            Biostrings_2.72.0      
## [19] codetools_0.2-20        jquerylib_0.1.4         cli_3.6.2              
## [22] rlang_1.1.3             crayon_1.5.2            XVector_0.44.0         
## [25] cowplot_1.1.3           munsell_0.5.1           cachem_1.0.8           
## [28] yaml_2.3.8              tools_4.4.0             parallel_4.4.0         
## [31] BiocParallel_1.38.0     seqLogo_1.70.0          dplyr_1.1.4            
## [34] colorspace_2.1-0        ggplot2_3.5.1           GenomeInfoDbData_1.2.12
## [37] BiocGenerics_0.50.0     vctrs_0.6.5             R6_2.5.1               
## [40] stats4_4.4.0            lifecycle_1.0.4         pwalign_1.0.0          
## [43] zlibbioc_1.50.0         S4Vectors_0.42.0        IRanges_2.38.0         
## [46] pkgconfig_2.0.3         pillar_1.9.0            bslib_0.7.0            
## [49] gtable_0.3.5            glue_1.7.0              data.table_1.15.4      
## [52] xfun_0.43               tibble_3.2.1            GenomicRanges_1.56.0   
## [55] tidyselect_1.2.1        knitr_1.46              htmltools_0.5.8.1      
## [58] rmarkdown_2.26          compiler_4.4.0