iSEEu 1.12.0
The iSEE package (Rue-Albrecht et al. 2018) provides a general and flexible framework for interactively exploring SummarizedExperiment
objects.
However, in many cases, more specialized panels are required for effective visualization of specific data types.
The iSEEu package implements a collection of such dedicated panel classes that work directly in the iSEE
application and can smoothly interact with other panels.
This allows users to quickly parametrize bespoke apps for their data to address scientific questions of interest.
We first load in the package:
library(iSEEu)
All the panels described in this document can be deployed by simply passing them into the iSEE()
function via the initial=
argument, as shown in the following examples.
To demonstrate the use of these panels,
we will perform a differential expression analysis on the airway dataset with the edgeR package.
We store the resulting statistics in the rowData
of the SummarizedExperiment
so that it can be accessed by iSEE
panels.
library(airway)
data(airway)
library(edgeR)
y <- DGEList(assay(airway), samples=colData(airway))
y <- y[filterByExpr(y, group=y$samples$dex),]
y <- calcNormFactors(y)
design <- model.matrix(~dex, y$samples)
y <- estimateDisp(y, design)
fit <- glmQLFit(y, design)
res <- glmQLFTest(fit, coef=2)
tab <- topTags(res, n=Inf)$table
rowData(airway) <- cbind(rowData(airway), tab[rownames(airway),])
The MAPlot
class creates a MA plot, i.e., with the log-fold change on the y-axis and the average expression on the x-axis.
Features with significant differences in each direction are highlighted and counted on the legend.
Users can vary the significance threshold and apply ad hoc filters on the log-fold change.
This is a subclass of the RowDataPlot
so points can be transmitted to other panels as multiple row selections.
Instances of this class are created like:
ma.panel <- MAPlot(PanelWidth=6L)
app <- iSEE(airway, initial=list(ma.panel))
The VolcanoPlot
class creates a volcano plot with the log-fold change on the x-axis and the negative log-p-value on the y-axis.
Features with significant differences in each direction are highlighted and counted on the legend.
Users can vary the significance threshold and apply ad hoc filters on the log-fold change.
This is a subclass of the RowDataPlot
so points can be transmitted to other panels as multiple row selections.
Instances of this class are created like:
vol.panel <- VolcanoPlot(PanelWidth=6L)
app <- iSEE(airway, initial=list(vol.panel))
The LogFCLogFCPlot
class creates a scatter plot of two log-fold changes from different DE comparisons.
This allows us to compare DE results on the same dataset - or even from different datasets, as long as the row names are shared.
Users can vary the significant threshold used to identify DE genes in either or both comparisons.
This is a subclass of the RowDataPlot
so points can be transmitted to other panels as multiple row selections.
Instances of this class are created like:
# Creating another comparison, this time by blocking on the cell line
design.alt <- model.matrix(~cell + dex, y$samples)
y.alt <- estimateDisp(y, design.alt)
fit.alt <- glmQLFit(y.alt, design.alt)
res.alt <- glmQLFTest(fit.alt, coef=2)
tab.alt <- topTags(res.alt, n=Inf)$table
rowData(airway) <- cbind(rowData(airway), alt=tab.alt[rownames(airway),])
lfc.panel <- LogFCLogFCPlot(PanelWidth=6L, YAxis="alt.logFC",
YPValueField="alt.PValue")
app <- iSEE(airway, initial=list(lfc.panel))
To demonstrate, we will perform a quick analysis of a small dataset from the scRNAseq package. This involves computing normalized expression values and low-dimensional results using the scater package.
library(scRNAseq)
sce <- ReprocessedAllenData(assays="tophat_counts")
library(scater)
sce <- logNormCounts(sce, exprs_values="tophat_counts")
sce <- runPCA(sce, ncomponents=4)
sce <- runTSNE(sce)
The DynamicReducedDimensionPlot
class creates a scatter plot with a dimensionality reduction result, namely principal components analysis (PCA), \(t\)-stochastic neighbor embedding (\(t\)-SNE) or uniform manifold and approximate projection (UMAP).
It does so dynamically on the subset of points that are selected in a transmitting panel,
allowing users to focus on finer structure when dealing with a heterogeneous population.
Calculations are performed using relevant functions from the scater package.
# Receives a selection from a reduced dimension plot.
dyn.panel <- DynamicReducedDimensionPlot(Type="UMAP", Assay="logcounts",
ColumnSelectionSource="ReducedDimensionPlot1", PanelWidth=6L)
# NOTE: users do not have to manually create this, just
# copy it from the "Panel Settings" of an already open app.
red.panel <- ReducedDimensionPlot(PanelId=1L, PanelWidth=6L,
BrushData = list(
xmin = -45.943, xmax = -15.399, ymin = -58.560,
ymax = 49.701, coords_css = list(xmin = 51.009,
xmax = 165.009, ymin = 39.009,
ymax = 422.009), coords_img = list(xmin = 66.313,
xmax = 214.514, ymin = 50.712,
ymax = 548.612), img_css_ratio = list(x = 1.300,
y = 1.299), mapping = list(x = "X", y = "Y"),
domain = list(left = -49.101, right = 57.228,
bottom = -70.389, top = 53.519),
range = list(left = 50.986, right = 566.922,
bottom = 603.013, top = 33.155),
log = list(x = NULL, y = NULL), direction = "xy",
brushId = "ReducedDimensionPlot1_Brush",
outputId = "ReducedDimensionPlot1"
)
)
app <- iSEE(sce, initial=list(red.panel, dyn.panel))
The DynamicMarkerTable
class dynamically computes basic differential statistics comparing assay values across groups of multiple selections in a transmitting panel.
If only the active selection exists in the transmitting panel, a comparison is performed between the points in that selection and all unselected points.
If saved selections are present, pairwise comparisons between the active selection and each saved selection is performed and the results are combined into a single table using the findMarkers()
function from scran.
diff.panel <- DynamicMarkerTable(PanelWidth=8L, Assay="logcounts",
ColumnSelectionSource="ReducedDimensionPlot1",)
# Recycling the reduced dimension panel above, adding a saved selection to
# compare to the active selection.
red.panel[["SelectionHistory"]] <- list(
BrushData = list(
xmin = 15.143, xmax = 57.228, ymin = -40.752,
ymax = 25.674, coords_css = list(xmin = 279.009,
xmax = 436.089, ymin = 124.009,
ymax = 359.009), coords_img = list(xmin = 362.716,
xmax = 566.922, ymin = 161.212,
ymax = 466.712), img_css_ratio = list(x = 1.300,
y = 1.299), mapping = list(x = "X", y = "Y"),
domain = list(left = -49.101, right = 57.228,
bottom = -70.389, top = 53.519),
range = list(left = 50.986, right = 566.922,
bottom = 603.013, top = 33.155),
log = list(x = NULL, y = NULL), direction = "xy",
brushId = "ReducedDimensionPlot1_Brush",
outputId = "ReducedDimensionPlot1"
)
)
red.panel[["PanelWidth"]] <- 4L # To fit onto one line.
app <- iSEE(sce, initial=list(red.panel, diff.panel))
The FeatureSetTable()
class is a bit unusual in that its rows do not correspond to any dimension of the SummarizedExperiment
.
Rather, each row is a feature set (e.g., from GO or KEGG) that, upon click, transmits a multiple row selection to other panels.
The multiple selection consists of all rows in the chosen feature set,
allowing users to identify the positions of all genes in a pathway of interest on, say, a volcano plot.
This is also a rare example of a panel that only transmits and does not receive any selections from other panels.
setFeatureSetCommands(createGeneSetCommands(identifier="ENSEMBL"))
gset.tab <- FeatureSetTable(Selected="GO:0002576",
Search="platelet", PanelWidth=6L)
# This volcano plot will highlight the genes in the selected gene set.
vol.panel <- VolcanoPlot(RowSelectionSource="FeatureSetTable1",
ColorBy="Row selection", PanelWidth=6L)
app <- iSEE(airway, initial=list(gset.tab, vol.panel))
iSEEu contains a number of “modes” that allow users to conveniently load an iSEE
instance in one of several common configurations:
modeEmpty()
will launch an empty app, i.e., with no panels.
This is occasionally useful to jump to the landing page where a user can then upload a SummarizedExperiment
object.modeGating()
will launch an app with multiple feature assay panels that are linked to each other.
This is useful for applying sequential restrictions on the data, equivalent to gating in a flow cytometry experiment.modeReducedDim()
will launch an app with multiple reduced dimension plots.
This is useful for examining different views of large high-dimensional datasets (e.g., single-cell studies).If you want to contribute to the development of the iSEEu package, here is a quick step-by-step guide:
iSEEu
repository from GitHub (https://github.com/iSEE/iSEEu) and clone it locally.git clone https://github.com/[your_github_username]/iSEEu.git
Add the desired new files - start from the R
folder, then document via roxygen2
- and push to your fork.
As an example you can check out to understand how things are supposed to work, there are several modes already defined in the R/
directory.
A typical contribution could include e.g. a function defining an iSEE mode, named modeXXX
, where XXX
provides a clear representation of the purpose of the mode.
Please place each mode in a file of its own, with the same name as the function.
The function should be documented (including an example), and any required packages should be added to the DESCRIPTION
file.
Once your mode
/function is done, consider adding some information in the package.
Some examples might be a screenshot of the mode in action (to be placed in the folder inst/modes_img
), and a well-documented example use case (maybe an entry in the vignettes
folder). Also add yourself as a contributor (ctb
) to the DESCRIPTION
file.
Make a pull request to the original repo - the GitHub site offers a practical framework to do so, enabling comments, code reviews, and other goodies. The iSEE core team will evaluate the contribution and get back to you!
That’s pretty much it!
Example data sets can often be obtained from an ExperimentHub package (e.g. from the scRNAseq package for single-cell RNA-sequencing data), and should not be added to the iSEEu package.
testthat
frameworkWe do follow some guidelines regarding the names given to variables, please abide to these for consistency with the rest of the codebase. Here are a few pointers:
git diff
operations easier to check.camelCase
for modes and other functions.function_name
for internalsPanelClassName
for panels.genericFunction
for the API.scope1.scope2.name
for variable names in the cached infoIf you intend to understand more in depth the internals of the iSEE framework, consider checking out the bookdown resource we put together at https://isee.github.io/iSEE-book/
Many of the “global” variables that are used in several places in iSEE are defined in the constants.R script in iSEE. We suggest to refer to those constants by their actual value rather than their internal variable name in downstream panel code. Both constant variable names and values may change at any time, but we will only announce changes to the constant value.
sessionInfo()
## R version 4.3.0 RC (2023-04-13 r84269)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 22.04.2 LTS
##
## Matrix products: default
## BLAS: /home/biocbuild/bbs-3.17-bioc/R/lib/libRblas.so
## LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.10.0
##
## locale:
## [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
## [3] LC_TIME=en_GB LC_COLLATE=C
## [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
## [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
## [9] LC_ADDRESS=C LC_TELEPHONE=C
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
##
## time zone: America/New_York
## tzcode source: system (glibc)
##
## attached base packages:
## [1] stats4 stats graphics grDevices utils datasets methods
## [8] base
##
## other attached packages:
## [1] scater_1.28.0 ggplot2_3.4.2
## [3] scuttle_1.10.0 scRNAseq_2.13.0
## [5] edgeR_3.42.0 limma_3.56.0
## [7] airway_1.19.1 iSEEu_1.12.0
## [9] iSEEhex_1.2.0 iSEE_2.12.0
## [11] SingleCellExperiment_1.22.0 SummarizedExperiment_1.30.0
## [13] Biobase_2.60.0 GenomicRanges_1.52.0
## [15] GenomeInfoDb_1.36.0 IRanges_2.34.0
## [17] S4Vectors_0.38.0 BiocGenerics_0.46.0
## [19] MatrixGenerics_1.12.0 matrixStats_0.63.0
## [21] BiocStyle_2.28.0
##
## loaded via a namespace (and not attached):
## [1] splines_4.3.0 later_1.3.0
## [3] BiocIO_1.10.0 bitops_1.0-7
## [5] filelock_1.0.2 tibble_3.2.1
## [7] XML_3.99-0.14 lifecycle_1.0.3
## [9] doParallel_1.0.17 lattice_0.21-8
## [11] ensembldb_2.24.0 magrittr_2.0.3
## [13] sass_0.4.5 rmarkdown_2.21
## [15] jquerylib_0.1.4 yaml_2.3.7
## [17] httpuv_1.6.9 DBI_1.1.3
## [19] RColorBrewer_1.1-3 zlibbioc_1.46.0
## [21] Rtsne_0.16 purrr_1.0.1
## [23] AnnotationFilter_1.24.0 RCurl_1.98-1.12
## [25] rappdirs_0.3.3 circlize_0.4.15
## [27] GenomeInfoDbData_1.2.10 ggrepel_0.9.3
## [29] irlba_2.3.5.1 DelayedMatrixStats_1.22.0
## [31] codetools_0.2-19 DelayedArray_0.26.0
## [33] DT_0.27 xml2_1.3.3
## [35] tidyselect_1.2.0 shape_1.4.6
## [37] viridis_0.6.2 ScaledMatrix_1.8.0
## [39] shinyWidgets_0.7.6 BiocFileCache_2.8.0
## [41] GenomicAlignments_1.36.0 jsonlite_1.8.4
## [43] GetoptLong_1.0.5 BiocNeighbors_1.18.0
## [45] ellipsis_0.3.2 iterators_1.0.14
## [47] foreach_1.5.2 tools_4.3.0
## [49] progress_1.2.2 Rcpp_1.0.10
## [51] glue_1.6.2 gridExtra_2.3
## [53] xfun_0.39 mgcv_1.8-42
## [55] dplyr_1.1.2 shinydashboard_0.7.2
## [57] withr_2.5.0 BiocManager_1.30.20
## [59] fastmap_1.1.1 fansi_1.0.4
## [61] shinyjs_2.1.0 digest_0.6.31
## [63] rsvd_1.0.5 R6_2.5.1
## [65] mime_0.12 colorspace_2.1-0
## [67] biomaRt_2.56.0 RSQLite_2.3.1
## [69] utf8_1.2.3 generics_0.1.3
## [71] hexbin_1.28.3 rtracklayer_1.60.0
## [73] prettyunits_1.1.1 httr_1.4.5
## [75] htmlwidgets_1.6.2 pkgconfig_2.0.3
## [77] gtable_0.3.3 blob_1.2.4
## [79] ComplexHeatmap_2.16.0 XVector_0.40.0
## [81] htmltools_0.5.5 bookdown_0.33
## [83] ProtGenerics_1.32.0 rintrojs_0.3.2
## [85] clue_0.3-64 scales_1.2.1
## [87] png_0.1-8 knitr_1.42
## [89] rjson_0.2.21 nlme_3.1-162
## [91] curl_5.0.0 shinyAce_0.4.2
## [93] cachem_1.0.7 GlobalOptions_0.1.2
## [95] stringr_1.5.0 BiocVersion_3.17.1
## [97] parallel_4.3.0 miniUI_0.1.1.1
## [99] vipor_0.4.5 AnnotationDbi_1.62.0
## [101] restfulr_0.0.15 pillar_1.9.0
## [103] grid_4.3.0 vctrs_0.6.2
## [105] promises_1.2.0.1 BiocSingular_1.16.0
## [107] dbplyr_2.3.2 beachmat_2.16.0
## [109] xtable_1.8-4 cluster_2.1.4
## [111] beeswarm_0.4.0 evaluate_0.20
## [113] GenomicFeatures_1.52.0 cli_3.6.1
## [115] locfit_1.5-9.7 compiler_4.3.0
## [117] Rsamtools_2.16.0 rlang_1.1.0
## [119] crayon_1.5.2 ggbeeswarm_0.7.1
## [121] stringi_1.7.12 viridisLite_0.4.1
## [123] BiocParallel_1.34.0 munsell_0.5.0
## [125] Biostrings_2.68.0 lazyeval_0.2.2
## [127] colourpicker_1.2.0 Matrix_1.5-4
## [129] ExperimentHub_2.8.0 hms_1.1.3
## [131] sparseMatrixStats_1.12.0 bit64_4.0.5
## [133] KEGGREST_1.40.0 shiny_1.7.4
## [135] interactiveDisplayBase_1.38.0 highr_0.10
## [137] AnnotationHub_3.8.0 fontawesome_0.5.1
## [139] igraph_1.4.2 memoise_2.0.1
## [141] bslib_0.4.2 bit_4.0.5
Rue-Albrecht, Kevin, Federico Marini, Charlotte Soneson, and Aaron T. L. Lun. 2018. “ISEE: Interactive Summarizedexperiment Explorer.” F1000Research 7 (June): 741. https://doi.org/10.12688/f1000research.14966.1.