Compiled date: 2023-10-24

Last edited: 2018-03-08

License: MIT + file LICENSE

1 Background

iSEE coordinates the coloration in every plot via the ExperimentColorMap class (Rue-Albrecht et al. 2018). Colors for samples or features are defined from column or row metadata or assay values using “colormaps”. Each colormap is a function that takes a single integer argument and returns that number of distinct colors. The ExperimentColorMap is a container that stores these functions for use within the iSEE() function. Users can define their own colormaps to customize coloration for specific assays or covariates.

2 Defining colormaps

2.1 Colormaps for continuous variables

For continuous variables, the function will be asked to generate a number of colors (21, by default). Interpolation will then be performed internally to generate a color gradient. Users can use existing color scales like viridis::viridis or heat.colors:

# Coloring for log-counts:
logcounts_color_fun <- viridis::viridis

It is also possible to use a function that completely ignores any arguments, and simply returns a fixed number of interpolation points:

# Coloring for FPKMs:
fpkm_color_fun <- function(n){
    c("black","brown","red","orange","yellow")
}

2.2 Colormaps for categorical variables

For categorical variables, the function should accept the number of levels and return a color per level. Colors are automatically assigned to factor levels in the specified order of the levels.

# Coloring for the 'driver' metadata variable.
driver_color_fun <- function(n){
    RColorBrewer::brewer.pal(n, "Set2")
}

Alternatively, the function can ignore its arguments and simply return a named vector of colors if users want to specify the color for each level explicitly It is the user’s responsibility to ensure that all levels are accounted for1 Needless to say, these functions should not be used as shared or global colormaps.. For instance, the following colormap function will only be compatible with factors of two levels, namely "Y" and "N":

# Coloring for the QC metadata variable:
qc_color_fun <- function(n){
    qc_colors <- c("forestgreen", "firebrick1")
    names(qc_colors) <- c("Y", "N")
    qc_colors
}

3 The colormap hierarchy

3.1 Specific and shared colormaps

Colormaps can be defined by users at three different levels:

  • Each individual assay, column data field, and row data field can be assigned its own distinct colormap. Those colormaps are stored as named lists of functions in the assays, colData, and rowData slots, respectively, of the ExperimentColorMap. This can be useful to easily remember which assay is currently shown; to apply different color scale limits to assays that vary on different ranges of values; or display boolean information in an intuitive way, among many other scenarios.
  • Shared colormaps can be defined for all assays, all column data, and all row data. These colormaps are stored in the all_discrete and all_continuous slots of the ExperimentColorMap, as lists of functions named assays, colData, and rowData.
  • Global colormaps can be defined for all categorical or continuous data. Those two colormaps are stored in the global_discrete and global_continuous slots of the ExperimentColorMap.

3.2 Searching for colors

When queried for a specific colormap of any type (assay, column data, or row data), the following process takes place:

  • A specific individual colormap is looked up in the appropriate slot of the ExperimentColorMap.
  • If it is not found, the shared colormap of the appropriate slot is looked up, according to whether the data are categorical or continuous.
  • If it is not found, the global colormap is looked up, according to whether the data are categorical or continuous.
  • If none of the above colormaps were defined, the ExperimentColorMap will revert to the default colormaps.

By default, viridis is used as the default continuous colormap, and hcl is used as the default categorical colormap.

4 Creating the ExperimentColorMap

We store the set of colormap functions in an instance of the ExperimentColorMap class. Named functions passed as assays, colData, or rowData arguments will be used for coloring data in those slots, respectively.

library(iSEE)
ecm <- ExperimentColorMap(
    assays = list(
        counts = heat.colors,
        logcounts = logcounts_color_fun,
        cufflinks_fpkm = fpkm_color_fun
    ),
    colData = list(
        passes_qc_checks_s = qc_color_fun,
        driver_1_s = driver_color_fun
    ),
    all_continuous = list(
        assays = viridis::plasma
    )
)
ecm
#> Class: ExperimentColorMap
#> assays(3): counts logcounts cufflinks_fpkm
#> colData(2): passes_qc_checks_s driver_1_s
#> rowData(0):
#> all_discrete(0):
#> all_continuous(1): assays

Users can change the defaults for all assays or column data by modifying the shared colormaps. Similarly, users can modify the defaults for all continuous or categorical data by modifying the global colormaps. This is demonstrated below for the continuous variables:

ExperimentColorMap(
    all_continuous=list( # shared
        assays=viridis::plasma,
        colData=viridis::inferno
    ),
    global_continuous=viridis::magma # global
)
#> Class: ExperimentColorMap
#> assays(0):
#> colData(0):
#> rowData(0):
#> all_discrete(0):
#> all_continuous(2): assays colData
#> global_continuous(1)

5 Benefits

The ExperimentColorMap class offers the following major features:

  • A single place to define flexible and lightweight sets of colormaps, that may be saved and reused across sessions and projects outside the app, to apply consistent coloring schemes across entire projects
  • A simple interface through accessors colDataColorMap(colormap, "coldata_name") and setters assayColorMap(colormap, "assay_name") <- colormap_function
  • An elegant fallback mechanism to consistently return a colormap, even for undefined covariates, including a default categorical and continuous colormap, respectively.
  • Three levels of colormaps override: individual, shared within slot (i.e., assays, colData, rowData), or shared globally between all categorical or continuous data scales.

Detailed examples on the use of ExperimentColorMap objects are available in the documentation ?ExperimentColorMap, as well as below.

6 Demonstration

Here, we use the allen single-cell RNA-seq data set to demonstrate the use of the ExperimentColorMap class. Using the sce object that we created previously, we create an iSEE app with the SingleCellExperiment object and the colormap generated above.

app <- iSEE(sce, colormap = ecm)

We run this using runApp to open the app on our browser.

shiny::runApp(app)

Now, choose to color cells by Column data and select passes_qc_checks_s. We will see that all cells that passed QC (Y) are colored “forestgreen”, while the ones that didn’t pass are colored firebrick.

If we color any plot by gene expression, we see that use of counts follows the heat.colors coloring scheme; use of log-counts follows the viridis coloring scheme; and use of FPKMs follows the black-to-yellow scheme we defined in fpkm_color_fun.

Session Info

sessionInfo()
#> R version 4.3.1 (2023-06-16)
#> Platform: x86_64-pc-linux-gnu (64-bit)
#> Running under: Ubuntu 22.04.3 LTS
#> 
#> Matrix products: default
#> BLAS:   /home/biocbuild/bbs-3.18-bioc/R/lib/libRblas.so 
#> LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.10.0
#> 
#> locale:
#>  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
#>  [3] LC_TIME=en_GB              LC_COLLATE=C              
#>  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
#>  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
#>  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
#> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
#> 
#> time zone: America/New_York
#> tzcode source: system (glibc)
#> 
#> attached base packages:
#> [1] stats4    stats     graphics  grDevices utils     datasets  methods  
#> [8] base     
#> 
#> other attached packages:
#>  [1] TENxPBMCData_1.19.0         HDF5Array_1.30.0           
#>  [3] rhdf5_2.46.0                DelayedArray_0.28.0        
#>  [5] SparseArray_1.2.0           S4Arrays_1.2.0             
#>  [7] abind_1.4-5                 Matrix_1.6-1.1             
#>  [9] scater_1.30.0               ggplot2_3.4.4              
#> [11] scuttle_1.12.0              scRNAseq_2.15.0            
#> [13] iSEE_2.14.0                 SingleCellExperiment_1.24.0
#> [15] SummarizedExperiment_1.32.0 Biobase_2.62.0             
#> [17] GenomicRanges_1.54.0        GenomeInfoDb_1.38.0        
#> [19] IRanges_2.36.0              S4Vectors_0.40.0           
#> [21] BiocGenerics_0.48.0         MatrixGenerics_1.14.0      
#> [23] matrixStats_1.0.0           BiocStyle_2.30.0           
#> 
#> loaded via a namespace (and not attached):
#>   [1] splines_4.3.1                 later_1.3.1                  
#>   [3] BiocIO_1.12.0                 bitops_1.0-7                 
#>   [5] filelock_1.0.2                tibble_3.2.1                 
#>   [7] XML_3.99-0.14                 lifecycle_1.0.3              
#>   [9] doParallel_1.0.17             lattice_0.22-5               
#>  [11] ensembldb_2.26.0              magrittr_2.0.3               
#>  [13] sass_0.4.7                    rmarkdown_2.25               
#>  [15] jquerylib_0.1.4               yaml_2.3.7                   
#>  [17] httpuv_1.6.12                 DBI_1.1.3                    
#>  [19] RColorBrewer_1.1-3            zlibbioc_1.48.0              
#>  [21] Rtsne_0.16                    purrr_1.0.2                  
#>  [23] AnnotationFilter_1.26.0       RCurl_1.98-1.12              
#>  [25] rappdirs_0.3.3                circlize_0.4.15              
#>  [27] GenomeInfoDbData_1.2.11       ggrepel_0.9.4                
#>  [29] irlba_2.3.5.1                 DelayedMatrixStats_1.24.0    
#>  [31] codetools_0.2-19              DT_0.30                      
#>  [33] xml2_1.3.5                    tidyselect_1.2.0             
#>  [35] shape_1.4.6                   ScaledMatrix_1.10.0          
#>  [37] viridis_0.6.4                 shinyWidgets_0.8.0           
#>  [39] BiocFileCache_2.10.0          GenomicAlignments_1.38.0     
#>  [41] jsonlite_1.8.7                GetoptLong_1.0.5             
#>  [43] BiocNeighbors_1.20.0          ellipsis_0.3.2               
#>  [45] iterators_1.0.14              foreach_1.5.2                
#>  [47] tools_4.3.1                   progress_1.2.2               
#>  [49] Rcpp_1.0.11                   glue_1.6.2                   
#>  [51] gridExtra_2.3                 xfun_0.40                    
#>  [53] mgcv_1.9-0                    dplyr_1.1.3                  
#>  [55] shinydashboard_0.7.2          withr_2.5.1                  
#>  [57] BiocManager_1.30.22           fastmap_1.1.1                
#>  [59] rhdf5filters_1.14.0           fansi_1.0.5                  
#>  [61] shinyjs_2.1.0                 digest_0.6.33                
#>  [63] rsvd_1.0.5                    R6_2.5.1                     
#>  [65] mime_0.12                     colorspace_2.1-0             
#>  [67] biomaRt_2.58.0                RSQLite_2.3.1                
#>  [69] utf8_1.2.4                    generics_0.1.3               
#>  [71] rtracklayer_1.62.0            prettyunits_1.2.0            
#>  [73] httr_1.4.7                    htmlwidgets_1.6.2            
#>  [75] pkgconfig_2.0.3               gtable_0.3.4                 
#>  [77] blob_1.2.4                    ComplexHeatmap_2.18.0        
#>  [79] XVector_0.42.0                htmltools_0.5.6.1            
#>  [81] bookdown_0.36                 ProtGenerics_1.34.0          
#>  [83] rintrojs_0.3.2                clue_0.3-65                  
#>  [85] scales_1.2.1                  png_0.1-8                    
#>  [87] knitr_1.44                    rjson_0.2.21                 
#>  [89] nlme_3.1-163                  curl_5.1.0                   
#>  [91] shinyAce_0.4.2                cachem_1.0.8                 
#>  [93] GlobalOptions_0.1.2           stringr_1.5.0                
#>  [95] BiocVersion_3.18.0            parallel_4.3.1               
#>  [97] miniUI_0.1.1.1                vipor_0.4.5                  
#>  [99] AnnotationDbi_1.64.0          restfulr_0.0.15              
#> [101] pillar_1.9.0                  grid_4.3.1                   
#> [103] vctrs_0.6.4                   promises_1.2.1               
#> [105] BiocSingular_1.18.0           dbplyr_2.3.4                 
#> [107] beachmat_2.18.0               xtable_1.8-4                 
#> [109] cluster_2.1.4                 beeswarm_0.4.0               
#> [111] evaluate_0.22                 GenomicFeatures_1.54.0       
#> [113] cli_3.6.1                     compiler_4.3.1               
#> [115] Rsamtools_2.18.0              rlang_1.1.1                  
#> [117] crayon_1.5.2                  ggbeeswarm_0.7.2             
#> [119] stringi_1.7.12                viridisLite_0.4.2            
#> [121] BiocParallel_1.36.0           munsell_0.5.0                
#> [123] Biostrings_2.70.0             lazyeval_0.2.2               
#> [125] colourpicker_1.3.0            ExperimentHub_2.10.0         
#> [127] hms_1.1.3                     sparseMatrixStats_1.14.0     
#> [129] bit64_4.0.5                   Rhdf5lib_1.24.0              
#> [131] KEGGREST_1.42.0               shiny_1.7.5.1                
#> [133] interactiveDisplayBase_1.40.0 AnnotationHub_3.10.0         
#> [135] fontawesome_0.5.2             igraph_1.5.1                 
#> [137] memoise_2.0.1                 bslib_0.5.1                  
#> [139] bit_4.0.5
# devtools::session_info()

References

Rue-Albrecht, K., F. Marini, C. Soneson, and A. T. L. Lun. 2018. “ISEE: Interactive Summarizedexperiment Explorer.” F1000Research 7 (June): 741.