SPIAT 1.0.4
library(SPIAT)
Cell colocalisation metrics allow capturing a dominant spatial pattern in an image. However, patterns are unlikely to be distributed evenly in a tissue, but rather there will be spatial heterogeneity of patterns. To measure this, SPIAT splits the image into smaller images (either using a grid or concentric circles around a reference cell population), followed by calculation of a spatial metric of a pattern of interest (e.g. cell colocalisation, entropy), and then measures the Prevalence and Distinctiveness of the pattern.
In this vignette we will use an inForm data file that’s already been
formatted for SPIAT with format_image_to_spe()
, which we can load with
data()
. We will use define_celltypes()
to define the cells with certain
combinations of markers.
data("simulated_image")
# define cell types
formatted_image <- define_celltypes(
simulated_image,
categories = c("Tumour_marker","Immune_marker1,Immune_marker2",
"Immune_marker1,Immune_marker3",
"Immune_marker1,Immune_marker2,Immune_marker4", "OTHER"),
category_colname = "Phenotype",
names = c("Tumour", "Immune1", "Immune2", "Immune3", "Others"),
new_colname = "Cell.Type")
Entropy in spatial analysis refers to the balance in the number of cells of distinct populations. An entropy score can be obtained for an entire image. However, the entropy of one image does not provide us spatial information of the image.
calculate_entropy(formatted_image, cell_types_of_interest = c("Immune1","Immune2"),
feature_colname = "Cell.Type")
## [1] 0.9294873
We therefore propose the concept of Localised Entropy which calculates an entropy score for a predefined local region. These local regions can be calculated as defined in the next two sections.
One approach to calculate localised metric is to split the image into
fishnet grid squares. For each grid square, grid_metrics()
calculates
the metric for that square and visualise the raster image. Users can
choose any metric as the localised metric. Here we use entropy as an
example.
For cases where the localised metric is not symmetrical (requires
specifying a target and reference cell type), the first item in the
vector used for cell_types_of_interest
marks the reference cells and
the second item the target cells. Here we are using Entropy, which is
symmetrical, so we can use any order of cell types in the input.
data("defined_image")
grid <- grid_metrics(defined_image, FUN = calculate_entropy, n_split = 20,
cell_types_of_interest=c("Tumour","Immune3"),
feature_colname = "Cell.Type")
After calculating the localised entropy for each of the grid squares, we can apply metrics like percentages of grid squares with patterns (Prevalence) and Moran’s I (Distinctiveness).
For the Prevalence, we need to select a threshold over which grid squares are considered ‘positive’ for the pattern. The selection of threshold depends on the pattern and metric the user chooses to find the localised pattern. Here we chose 0.75 for entropy because 0.75 is roughly the entropy of two cell types when their ratio is 1:5 or 5:1.
calculate_percentage_of_grids(grid, threshold = 0.75, above = TRUE)
## [1] 13
calculate_spatial_autocorrelation(grid, metric = "globalmoran")
## [1] 0.09446964
We can use the compute_gradient()
function to calculate metrics (entropy, mixing
score, percentage of cells within radius, marker intensity) for a range
of radii from reference cells. Here, an increasing circle is drawn
around each cell of the reference cell type and the desired score is
calculated for cells within each circle.
The first item in the vector used for cell_types_of_interest
marks the
reference cells and the second item the target cells. Here, Immune1
cells are reference cells and Immune2 are target cells.
gradient_positions <- c(30, 50, 100)
gradient_entropy <-
compute_gradient(defined_image, radii = gradient_positions,
FUN = calculate_entropy, cell_types_of_interest = c("Immune1","Immune2"),
feature_colname = "Cell.Type")
length(gradient_entropy)
## [1] 3
head(gradient_entropy[[1]])
## Cell.X.Position Cell.Y.Position Immune1 Immune2 total Immune1_log2
## Cell_15 109.67027 17.12956 1 0 1 0
## Cell_25 153.22795 128.29915 1 0 1 0
## Cell_30 57.29037 49.88533 1 0 1 0
## Cell_48 83.47798 295.75058 2 0 2 1
## Cell_56 35.24227 242.84862 1 0 1 0
## Cell_61 156.39943 349.08154 2 0 2 1
## Immune2_log2 total_log2 Immune1ratio Immune1_entropy Immune2ratio
## Cell_15 0 0 1 0 0
## Cell_25 0 0 1 0 0
## Cell_30 0 0 1 0 0
## Cell_48 0 1 1 0 0
## Cell_56 0 0 1 0 0
## Cell_61 0 1 1 0 0
## Immune2_entropy entropy
## Cell_15 0 0
## Cell_25 0 0
## Cell_30 0 0
## Cell_48 0 0
## Cell_56 0 0
## Cell_61 0 0
The compute_gradient()
function outputs the numbers cells within each
radii for each reference cell. The output is formatted as a list of
data.frames, one for each specified radii. In each data.frame, the rows
show the reference cells. The last column of the data.frame is the
entropy calculated for cells in the circle of the reference cell. Users
can then an average score or another aggregation metric to report the
results.
An alternative approach is to combine the results of all the circles
(rather than have one for each individual reference cell). Here, for
each radii, we simultaneously identify all the cells in the circles
surrounding each reference cell, and calculate a single entropy score.
We have created a specific function for this -
entropy_gradient_aggregated()
. The output of this function is an overall
entropy score for each radii.
gradient_pos <- seq(50, 500, 50) ##radii
gradient_results <- entropy_gradient_aggregated(defined_image, cell_types_of_interest = c("Immune3","Tumour"),
feature_colname = "Cell.Type", radii = gradient_pos)
# plot the results
plot(1:10,gradient_results$gradient_df[1, 3:12])
sessionInfo()
## R version 4.2.2 (2022-10-31)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 20.04.5 LTS
##
## Matrix products: default
## BLAS: /home/biocbuild/bbs-3.16-bioc/R/lib/libRblas.so
## LAPACK: /home/biocbuild/bbs-3.16-bioc/R/lib/libRlapack.so
##
## locale:
## [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
## [3] LC_TIME=en_GB LC_COLLATE=C
## [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
## [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
## [9] LC_ADDRESS=C LC_TELEPHONE=C
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
##
## attached base packages:
## [1] stats4 stats graphics grDevices utils datasets methods
## [8] base
##
## other attached packages:
## [1] SPIAT_1.0.4 SpatialExperiment_1.8.0
## [3] SingleCellExperiment_1.20.0 SummarizedExperiment_1.28.0
## [5] Biobase_2.58.0 GenomicRanges_1.50.2
## [7] GenomeInfoDb_1.34.4 IRanges_2.32.0
## [9] S4Vectors_0.36.1 BiocGenerics_0.44.0
## [11] MatrixGenerics_1.10.0 matrixStats_0.63.0
## [13] BiocStyle_2.26.0
##
## loaded via a namespace (and not attached):
## [1] circlize_0.4.15 plyr_1.8.8
## [3] lazyeval_0.2.2 sp_1.5-1
## [5] crosstalk_1.2.0 BiocParallel_1.32.4
## [7] elsa_1.1-28 ggplot2_3.4.0
## [9] digest_0.6.31 foreach_1.5.2
## [11] htmltools_0.5.4 magick_2.7.3
## [13] fansi_1.0.3 magrittr_2.0.3
## [15] tensor_1.5 cluster_2.1.4
## [17] doParallel_1.0.17 tzdb_0.3.0
## [19] limma_3.54.0 ComplexHeatmap_2.14.0
## [21] R.utils_2.12.2 vroom_1.6.0
## [23] spatstat.sparse_3.0-0 colorspace_2.0-3
## [25] ggrepel_0.9.2 xfun_0.35
## [27] dplyr_1.0.10 crayon_1.5.2
## [29] RCurl_1.98-1.9 jsonlite_1.8.4
## [31] spatstat.data_3.0-0 iterators_1.0.14
## [33] glue_1.6.2 polyclip_1.10-4
## [35] gtable_0.3.1 zlibbioc_1.44.0
## [37] XVector_0.38.0 GetoptLong_1.0.5
## [39] DelayedArray_0.24.0 DropletUtils_1.18.1
## [41] Rhdf5lib_1.20.0 shape_1.4.6
## [43] HDF5Array_1.26.0 apcluster_1.4.10
## [45] abind_1.4-5 scales_1.2.1
## [47] pheatmap_1.0.12 DBI_1.1.3
## [49] edgeR_3.40.1 spatstat.random_3.0-1
## [51] Rcpp_1.0.9 viridisLite_0.4.1
## [53] clue_0.3-63 archive_1.1.5
## [55] dqrng_0.3.0 bit_4.0.5
## [57] httr_1.4.4 htmlwidgets_1.6.0
## [59] dittoSeq_1.10.0 RColorBrewer_1.1-3
## [61] pkgconfig_2.0.3 R.methodsS3_1.8.2
## [63] farver_2.1.1 scuttle_1.8.3
## [65] sass_0.4.4 deldir_1.0-6
## [67] locfit_1.5-9.6 utf8_1.2.2
## [69] tidyselect_1.2.0 labeling_0.4.2
## [71] rlang_1.0.6 reshape2_1.4.4
## [73] munsell_0.5.0 tools_4.2.2
## [75] cachem_1.0.6 cli_3.5.0
## [77] dbscan_1.1-11 generics_0.1.3
## [79] mmand_1.6.2 ggridges_0.5.4
## [81] evaluate_0.19 stringr_1.5.0
## [83] fastmap_1.1.0 yaml_2.3.6
## [85] goftest_1.2-3 knitr_1.41
## [87] bit64_4.0.5 purrr_1.0.0
## [89] RANN_2.6.1 nlme_3.1-161
## [91] sparseMatrixStats_1.10.0 R.oo_1.25.0
## [93] pracma_2.4.2 compiler_4.2.2
## [95] plotly_4.10.1 png_0.1-8
## [97] spatstat.utils_3.0-1 tibble_3.1.8
## [99] bslib_0.4.2 stringi_1.7.8
## [101] highr_0.9 lattice_0.20-45
## [103] Matrix_1.5-3 vctrs_0.5.1
## [105] pillar_1.8.1 lifecycle_1.0.3
## [107] rhdf5filters_1.10.0 BiocManager_1.30.19
## [109] spatstat.geom_3.0-3 jquerylib_0.1.4
## [111] GlobalOptions_0.1.2 data.table_1.14.6
## [113] cowplot_1.1.1 bitops_1.0-7
## [115] raster_3.6-11 R6_2.5.1
## [117] bookdown_0.31 gridExtra_2.3
## [119] codetools_0.2-18 gtools_3.9.4
## [121] assertthat_0.2.1 rhdf5_2.42.0
## [123] rjson_0.2.21 withr_2.5.0
## [125] GenomeInfoDbData_1.2.9 parallel_4.2.2
## [127] terra_1.6-47 grid_4.2.2
## [129] beachmat_2.14.0 tidyr_1.2.1
## [131] rmarkdown_2.19 DelayedMatrixStats_1.20.0
## [133] Cairo_1.6-0 Rtsne_0.16
## [135] spatstat.explore_3.0-5