Disease Ontology (DO)1 aims to provide an open source ontology for the integration of biomedical data that is associated with human disease. We developed DOSE2 package to promote the investigation of diseases. DOSE provides five methods including Resnik, Lin, Jiang, Rel and Wang for measuring semantic similarities among DO terms and gene products; Hypergeometric model and Gene Set Enrichment Analysis (GSEA) were also implemented for extracting disease association insight from genome wide expression profiles.
If you use DOSE in published research, please cite G. Yu (2015).
G Yu, LG Wang, GR Yan, QY He. DOSE: an R/Bioconductor package for Disease Ontology Semantic and Enrichment analysis. Bioinformatics 2015, 31(4):608-609. http://dx.doi.org/10.1093/bioinformatics/btu684.
DOSE provides five methods for measureing semantic similarity among DO terms and genes. It implemented over-representation analysis to associate disease with gene list (e.g. differential expressed genes) and gene set enrichment analysis to associate disease with genome wide expression profiles. The enrichment analyses support Disease Ontology (DO)1, Network of Cancer Gene (NCG)3 and DisGeNET4. In addition, several visualization methods were developed to help interpreting semantic and enrichment results.
If you have questions/issues, please visit DOSE homepage first. Your problems are mostly documented. If you think you found a bug, please follow the guide and provide a reproducible example to be posted on github issue tracker. For questions, please post to Bioconductor support site and tag your post with DOSE.
For Chinese user, you can follow me on WeChat (微信).
Here is the output of sessionInfo()
on the system on which this document was compiled:
## R version 3.5.0 (2018-04-23)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 16.04.4 LTS
##
## Matrix products: default
## BLAS: /home/biocbuild/bbs-3.7-bioc/R/lib/libRblas.so
## LAPACK: /home/biocbuild/bbs-3.7-bioc/R/lib/libRlapack.so
##
## locale:
## [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
## [3] LC_TIME=en_US.UTF-8 LC_COLLATE=C
## [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
## [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
## [9] LC_ADDRESS=C LC_TELEPHONE=C
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
##
## attached base packages:
## [1] stats graphics grDevices utils datasets methods base
##
## other attached packages:
## [1] DOSE_3.6.1 BiocStyle_2.8.2
##
## loaded via a namespace (and not attached):
## [1] Rcpp_0.12.17 pillar_1.2.3 plyr_1.8.4
## [4] compiler_3.5.0 tools_3.5.0 digest_0.6.15
## [7] bit_1.1-14 lattice_0.20-35 tibble_1.4.2
## [10] RSQLite_2.1.1 evaluate_0.10.1 memoise_1.1.0
## [13] gtable_0.2.0 rlang_0.2.1 pkgconfig_2.0.1
## [16] Matrix_1.2-14 fastmatch_1.1-0 DBI_1.0.0
## [19] yaml_2.1.19 parallel_3.5.0 xfun_0.2
## [22] gridExtra_2.3 fgsea_1.6.0 stringr_1.3.1
## [25] knitr_1.20 S4Vectors_0.18.3 IRanges_2.14.10
## [28] stats4_3.5.0 rprojroot_1.3-2 bit64_0.9-7
## [31] grid_3.5.0 qvalue_2.12.0 Biobase_2.40.0
## [34] data.table_1.11.4 AnnotationDbi_1.42.1 BiocParallel_1.14.1
## [37] GOSemSim_2.6.0 rmarkdown_1.10 bookdown_0.7
## [40] reshape2_1.4.3 GO.db_3.6.0 DO.db_2.9
## [43] ggplot2_2.2.1 blob_1.1.1 magrittr_1.5
## [46] splines_3.5.0 scales_0.5.0 backports_1.1.2
## [49] htmltools_0.3.6 BiocGenerics_0.26.0 colorspace_1.3-2
## [52] stringi_1.2.3 lazyeval_0.2.1 munsell_0.5.0
1. Schriml, L. M. et al. Disease ontology: A backbone for disease semantic integration. Nucleic Acids Research 40, D940–D946 (2011).
2. Yu, G., Wang, L.-G., Yan, G.-R. & He, Q.-Y. DOSE: An r/bioconductor package for disease ontology semantic and enrichment analysis. Bioinformatics 31, 608–609 (2015).
3. A., O., D., G. M., M., T. P. & C., F. D. NCG 5.0: Updates of a manually curated repository of cancer genes and associated properties from cancer mutational screenings. Nucleic Acids Research 44, D992–D999 (2016).
4. Janet, P. et al. DisGeNET: A discovery platform for the dynamical exploration of human diseases and their genes. Database 2015, bav028 (2015).