This guide provides an overview of the CCPlotR package, a small R package for visualising results from tools that predict cell-cell interactions from scRNA-seq data.
Predicting cell-cell interactions from scRNA-seq data is now a common component of a single-cell analysis workflow (Armingol et al. (2021), Almet et al. (2021)). There have been many tools published in recent years specifically for this purpose (Dimitrov et al. (2022), Efremova et al. (2020), Hou et al. (2020), Jin et al. (2021)). These tools typically return a table of predicted interactions that depicts the sending and receiving cell type, the ligand and receptor genes and some sort of score for ranking the interactions. These tables can be quite large and depending on the amount of cell types included in the analysis, this data can get pretty complex and challenging to visualise. Many tools for predicting cell-cell interactions have built-in visualisation methods but some don’t and it can be time consuming and impractical to install some of these methods just for the purpose of visualising the results if you’re using a different tool to predict the interactions. For these reasons, we have developed a lightweight R package that allows the user to easily generate visualisations of cell-cell interactions with minimal code, regardless of which tool was used for predicting the interactions.
if (!require("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("CCPlotR")
## or for development version:
devtools::install_github("Sarah145/CCPlotR")
CCPlotR makes generic plots that can be used to visualise results from multiple tools such as Liana, CellPhoneDB, NATMI etc. All it requires as input is a dataframe with columns source
, target
, ligand
, receptor
and score
. It should look something like this:
source | target | ligand | receptor | score |
---|---|---|---|---|
B | CD8 T | HLA-DQA1 | LAG3 | 7.22 |
B | CD8 T | HLA-DRA | LAG3 | 5.59 |
CD8 T | NK | B2M | KIR2DL3 | 5.52 |
B | CD8 T | HLA-DQA2 | LAG3 | 5.41 |
NK | B | LGALS1 | CD69 | 4.15 |
B | CD8 T | ICAM3 | ITGAL | 2.34 |
For some of the plots, there is an option to also show the expression of the ligands and receptors in each cell type. For those plots, a second dataframe is required, which holds the mean expression values for each gene in each cell type and should look something like this:
cell_type | gene | mean_exp |
---|---|---|
B | ACTR2 | 0.363 |
B | ADA | 0.0170 |
B | ADAM10 | 0.0833 |
B | ADAM28 | 0.487 |
B | ADCY7 | 0.0336 |
B | ADRB2 | 0.0178 |
The package comes with toy datasets (toy_data
, toy_exp
) which you can see for examples of input data.
library(CCPlotR)
data(toy_data, toy_exp, package = 'CCPlotR')
head(toy_data)
#> # A tibble: 6 × 5
#> source target ligand receptor score
#> <chr> <chr> <chr> <chr> <dbl>
#> 1 B CD8 T HLA-DQA1 LAG3 7.22
#> 2 B CD8 T HLA-DRA LAG3 5.59
#> 3 B CD8 T HLA-DQB1 LAG3 5.59
#> 4 B CD8 T HLA-DQA2 LAG3 5.41
#> 5 B CD8 T HLA-DRB5 LAG3 5.26
#> 6 B CD8 T HLA-DRB1 LAG3 5.12
head(toy_exp)
#> # A tibble: 6 × 3
#> # Groups: cell_type [1]
#> cell_type gene mean_exp
#> <chr> <chr> <dbl>
#> 1 B ACTR2 0.363
#> 2 B ADA 0.0170
#> 3 B ADAM10 0.0833
#> 4 B ADAM28 0.487
#> 5 B ADCY7 0.0336
#> 6 B ADRB2 0.0178
The package contains functions for making six types of plots: cc_heatmap
, cc_dotplot
, cc_network
, cc_circos
, cc_arrow
and cc_sigmoid
. Below are some examples of each plot type.
The cc_heatmap
function can generate heatmaps in four different styles. Option A just displays the total number of interactions between each pair of cell types and option B shows the ligands, receptors and cell types involved in each interaction as well as their score. For option B, only a small portion of top interactions are shown to avoid cluttering the plot. There is also an option to generate heatmaps in the style of popular cell-cell interaction prediction tools such as CellPhoneDB and Liana.
cc_heatmap(toy_data)
cc_heatmap(toy_data, option = "B", n_top_ints = 10)
cc_heatmap(toy_data, option = "CellPhoneDB")
The cc_dotplot
function can generate dotplots in four different styles. Option A just displays the total number of interactions between each pair of cell types and option B shows the ligands, receptors and cell types involved in each interaction as well as their score. For option B, only a small portion of top interactions are shown to avoid cluttering the plot. There is also an option to generate dotplots in the style of popular cell-cell interaction prediction tools such as CellPhoneDB and Liana.
cc_dotplot(toy_data)
cc_dotplot(toy_data, option = "B", n_top_ints = 10)
cc_dotplot(toy_data, option = "Liana", n_top_ints = 15)
The cc_network
function can generate two different types of network plots. In option A, the nodes are cell types and the weight of the edges corresponds to the total number of interactions between a given pair of cell types. In option B, the nodes are ligand and receptor genes, coloured by which cell type is expressing them. For option B, only a small portion of top interactions are shown to avoid cluttering the plot.
cc_network(toy_data)
cc_network(toy_data, colours = c("orange", "cornflowerblue", "hotpink"), option = "B")
The cc_circos
function can generate three different types of circos plots. Option A generates a circos plot where the width of the links represents the total number of interactions between each pair of cell types. Option B generates a circos plot showing the ligands, receptors and cell types involved in the top portion of interactions. Option C expands on option B by also showing the mean expression of the ligand and receptor genes in each cell type. In options B and C, the weight of the links represents the score of the interaction.
cc_circos(toy_data)
cc_circos(toy_data, option = "B", n_top_ints = 10, cex = 0.75)
#> Warning: 'gap.degree' can only be modified before `circos.initialize`, or maybe
#> you forgot to call `circos.clear` in your last plot.
cc_circos(toy_data, option = "C", n_top_ints = 15, exp_df = toy_exp, cell_cols = c(`B` = "hotpink", `NK` = "orange", `CD8 T` = "cornflowerblue"), palette = "PuRd", cex = 0.75)
The cc_arrow
function generates plots showing the interactions between a given pair of cell types. Option A just shows which ligands/receptors are interacting between a pair of cell types and option B also shows the expression of the ligand/receptor genes in each cell type. In both options, the weight of the arrow represents the score of the interaction.
cc_arrow(toy_data, cell_types = c("B", "CD8 T"), colours = c(`B` = "hotpink", `CD8 T` = "orange"))
cc_arrow(toy_data, cell_types = c("NK", "CD8 T"), option = "B", exp_df = toy_exp, n_top_ints = 10, palette = "OrRd")
The cc_sigmoid
function plots a portion of interactions using the geom_sigmoid
function from the ggbump
R package to connect ligands in sender cells to receptors in receiver cells.
cc_sigmoid(toy_data)
By default, CCPlotR uses a colour palette that contains 15 colourblind-friendly colours.
scales::show_col(paletteMartin())
Because all plotting functions (with the exception of cc_circos
) are built on top of ggplot2, it’s easy to customise and modify the plots according to the users preferences. Colours, fonts and themes can easily be updated using the usual ggplot2 syntax.
library(ggplot2)
library(patchwork)
p1 <- cc_network(toy_data, option = "B") + labs(title = "Default")
p2 <- cc_network(toy_data, option = "B") + scale_fill_manual(values = rainbow(3)) + labs(title = "Update colours")
#> Scale for fill is already present.
#> Adding another scale for fill, which will replace the existing scale.
p3 <- cc_network(toy_data, option = "B") + theme_grey() + labs(title = "Update theme")
p4 <- cc_network(toy_data, option = "B") + theme(legend.position = "top") + labs(title = "Update legend position")
p1 + p2 + p3 + p4
The cc_circos
function uses the circlize package to generate to the plots so more information about them can be found here.
sessionInfo()
#> R version 4.3.1 (2023-06-16)
#> Platform: x86_64-pc-linux-gnu (64-bit)
#> Running under: Ubuntu 22.04.3 LTS
#>
#> Matrix products: default
#> BLAS: /home/biocbuild/bbs-3.18-bioc/R/lib/libRblas.so
#> LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.10.0
#>
#> locale:
#> [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
#> [3] LC_TIME=en_GB LC_COLLATE=C
#> [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
#> [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
#> [9] LC_ADDRESS=C LC_TELEPHONE=C
#> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
#>
#> time zone: America/New_York
#> tzcode source: system (glibc)
#>
#> attached base packages:
#> [1] stats graphics grDevices utils datasets methods base
#>
#> other attached packages:
#> [1] patchwork_1.1.3 ggplot2_3.4.4 CCPlotR_1.0.0 BiocStyle_2.30.0
#>
#> loaded via a namespace (and not attached):
#> [1] tidyselect_1.2.0 viridisLite_0.4.2 dplyr_1.1.3
#> [4] farver_2.1.1 viridis_0.6.4 ggraph_2.1.0
#> [7] fastmap_1.1.1 tweenr_2.0.2 digest_0.6.33
#> [10] lifecycle_1.0.3 cluster_2.1.4 magrittr_2.0.3
#> [13] compiler_4.3.1 rlang_1.1.1 sass_0.4.7
#> [16] tools_4.3.1 igraph_1.5.1 utf8_1.2.4
#> [19] yaml_2.3.7 knitr_1.44 labeling_0.4.3
#> [22] graphlayouts_1.0.1 plyr_1.8.9 xml2_1.3.5
#> [25] RColorBrewer_1.1-3 withr_2.5.1 purrr_1.0.2
#> [28] BiocGenerics_0.48.0 grid_4.3.1 polyclip_1.10-6
#> [31] ggh4x_0.2.6 stats4_4.3.1 fansi_1.0.5
#> [34] colorspace_2.1-0 scales_1.2.1 iterators_1.0.14
#> [37] MASS_7.3-60 cli_3.6.1 rmarkdown_2.25
#> [40] crayon_1.5.2 generics_0.1.3 rjson_0.2.21
#> [43] commonmark_1.9.0 cachem_1.0.8 ggforce_0.4.1
#> [46] stringr_1.5.0 parallel_4.3.1 BiocManager_1.30.22
#> [49] matrixStats_1.0.0 vctrs_0.6.4 jsonlite_1.8.7
#> [52] bookdown_0.36 IRanges_2.36.0 GetoptLong_1.0.5
#> [55] S4Vectors_0.40.0 ggrepel_0.9.4 clue_0.3-65
#> [58] magick_2.8.1 foreach_1.5.2 tidyr_1.3.0
#> [61] jquerylib_0.1.4 glue_1.6.2 codetools_0.2-19
#> [64] ggtext_0.1.2 stringi_1.7.12 shape_1.4.6
#> [67] gtable_0.3.4 ComplexHeatmap_2.18.0 munsell_0.5.0
#> [70] tibble_3.2.1 pillar_1.9.0 htmltools_0.5.6.1
#> [73] circlize_0.4.15 R6_2.5.1 doParallel_1.0.17
#> [76] tidygraph_1.2.3 evaluate_0.22 markdown_1.11
#> [79] png_0.1-8 gridtext_0.1.5 ggbump_0.1.0
#> [82] ggfun_0.1.3 bslib_0.5.1 Rcpp_1.0.11
#> [85] gridExtra_2.3 scatterpie_0.2.1 xfun_0.40
#> [88] forcats_1.0.0 pkgconfig_2.0.3 GlobalOptions_0.1.2
Almet, Axel A., Zixuan Cang, Suoqin Jin, and Qing Nie. 2021. “The Landscape of Cell-Cell Communication Through Single-Cell Transcriptomics.” Current Opinion in Systems Biology 26 (June): 12–23. https://doi.org/10.1016/j.coisb.2021.03.007.
Armingol, Erick, Adam Officer, Olivier Harismendy, and Nathan E. Lewis. 2021. “Deciphering Cell–Cell Interactions and Communication from Gene Expression.” Nature Reviews Genetics 22 (2): 71–88. https://doi.org/10.1038/s41576-020-00292-x.
Dimitrov, Daniel, Dénes Türei, Martin Garrido-Rodriguez, Paul L. Burmedi, James S. Nagai, Charlotte Boys, Ricardo O. Ramirez Flores, et al. 2022. “Comparison of Methods and Resources for Cell-Cell Communication Inference from Single-Cell RNA-Seq Data.” Nature Communications 13 (1): 3224. https://doi.org/10.1038/s41467-022-30755-0.
Efremova, Mirjana, Miquel Vento-Tormo, Sarah A. Teichmann, and Roser Vento-Tormo. 2020. “CellPhoneDB: Inferring Cell–Cell Communication from Combined Expression of Multi-Subunit Ligand–Receptor Complexes.” Nature Protocols 15 (4): 1484–1506. https://doi.org/10.1038/s41596-020-0292-x.
Hou, Rui, Elena Denisenko, Huan Ting Ong, Jordan A. Ramilowski, and Alistair R. R. Forrest. 2020. “Predicting Cell-to-Cell Communication Networks Using NATMI.” Nature Communications 11 (1): 5011. https://doi.org/10.1038/s41467-020-18873-z.
Jin, Suoqin, Christian F. Guerrero-Juarez, Lihua Zhang, Ivan Chang, Raul Ramos, Chen-Hsiang Kuan, Peggy Myung, Maksim V. Plikus, and Qing Nie. 2021. “Inference and Analysis of Cell-Cell Communication Using CellChat.” Nature Communications 12 (1): 1088. https://doi.org/10.1038/s41467-021-21246-9.