1 Introduction

Illumina Infinium HumanMethylation 450K BeadChip assay has become a standard tool to analyse methylation in human samples. Developed in 2011, it has already been used in projects such as The Cancer Genome Atlas (TCGA). Their 450.000 probes provide a good overall image of the methylation state of the genome, being one of the reasons of its success.

Given its complex design11 More information can be found at this minfi tutorial, many Bioconductor packages have been developed to assess normalization and pre-processing issues (e.g. minfi (Aryee et al. 2014) or lumi (Du, Kibbe, and Lin 2008)). In addition, these packages can detect differentially methylated probes (DMPs) and differentially methylated regions (DMRs). However, the interfaces are not very intuitive and several scripting steps are usually required.

MEAL aims to facilitate the analysis of Illumina Methylation 450K chips. We have included two methods to analyze DMPs (Differentially Methylated Probes), that test differences in means (limma) or differences in variance (DiffVar). We have included three DMRs (Differentially Methylated Regions) detection algorithms (bumphunter, blockFinder and DMRcate) and a new method to test differences in methylation in a target region (RDA). Finally, we have prepared plots for all these analyses as well as a wrapper to run all the analyses in the same dataset.

2 Input data

MEAL is meant to analyze methylation data already preprocessed. All our functions accept a GenomicRatioSet as input, which is a class from minfi package designed to manage preprocessed methylation data. Users willing to preprocess their own data are encouraged to take a look to minfi’s vignette

In this vignette, we will use methylation data from minfiData package.

library(MEAL)
library(MultiDataSet)
library(minfiData)
library(minfi)
library(ggplot2)

data("MsetEx")

MsetEx is a MethylationRatioSet that contains measurements for 485512 CpGs and 6 samples, as well as some phenotypic variables such as age or sex. The first step will be to convert it to a GenomicRatioSet. Then, we will add some extra features annotation. Finally, we will remove probes not measuring methylation, with SNPs or with NAs:

meth <- mapToGenome(ratioConvert(MsetEx))
rowData(meth) <- getAnnotation(meth)[, -c(1:3)]

## Remove probes measuring SNPs
meth <- dropMethylationLoci(meth)

## Remove probes with SNPs
meth <- dropLociWithSnps(meth)

## Remove probes with NAs
meth <- meth[!apply(getBeta(meth), 1, function(x) any(is.na(x))), ]

3 Analyzing Methylation data

3.1 Pipeline

The function runPipeline run all methods included in MEAL to the same dataset. We only need to pass to this function a GenomicRatioSet and the name of our variable of interest. In our case, we will analyze the effect of cancer on methylation:

res <- runPipeline(set = meth, variable_names = "status")

runPipeline includes several parameters to customize the analyses. The most important parameters are covariable_names, betas and sva. covariable_names is used to include covariates in our models. betas allows the user choosing between running the analyis with beta (TRUE) or M-values (FALSE). If sva is TRUE, Surrogate Variable Analysis is run and surrogate variables are included in the models. Finally, some parameters modify the behaviour of the methods included in the wrapper and they will be covered later on. More information about the parameters can be found in the documentation (by typing ?runPipeline).

We will run a new analysis including age as covariate:

resAdj <- runPipeline(set = meth, variable_names = "status", 
                      covariable_names = "age")
resAdj

## Object of class 'ResultSet'
##  . created with: runPipeline 
##  . sva:  no 
##  . #results: 5 ( error: 0 )
##  . featureData: 464876 probes x 35 variables

3.2 Managing the results

runPipeline generates a ResultSet object. ResultSet is a class designed to encapsulate different results from the same dataset. It contains the results of the different methods, the feature data and other data required to get tables or plots. We can examine the analyses included in a ResultSet with the function names:

names(resAdj)

## [1] "DiffMean"    "DiffVar"     "bumphunter"  "blockFinder" "dmrcate"

Both objects contains five analyses. DiffMean is an analysis of difference of means performed with limma while the others are named with the method name (DiffVar, bumphunter, blockFinder and dmrcate).

We can use the function getAssociation to get a data.frame with the results, independent of the original method. This function has two main arguments: object and rid. object is the ResultSet with our data and rid is the name or the index of the analysis we want to extract.

head(getAssociation(resAdj, "DiffMean"))

##                 logFC       CI.L       CI.R   AveExpr         t      P.Value
## cg09383816 -0.5938196 -0.6310585 -0.5565807 0.4885486 -42.94192 7.147556e-07
## cg27651090  0.5433331  0.5073130  0.5793532 0.5411453  40.62051 9.092184e-07
## cg21938148 -0.6659934 -0.7114994 -0.6204875 0.4712352 -39.41178 1.036252e-06
## cg25104555 -0.5254995 -0.5614327 -0.4895664 0.3906114 -39.38227 1.039618e-06
## cg25937714 -0.5906359 -0.6311714 -0.5501004 0.4350042 -39.23815 1.056248e-06
## cg15732851 -0.5760397 -0.6165580 -0.5355214 0.3869865 -38.28468 1.174927e-06
##             adj.P.Val        B         SE
## cg09383816 0.01808345 5.578902 0.03360340
## cg27651090 0.01808345 5.494733 0.01992266
## cg21938148 0.01808345 5.446147 0.03810597
## cg25104555 0.01808345 5.444916 0.02470939
## cg25937714 0.01808345 5.438874 0.01417408
## cg15732851 0.01808345 5.397542 0.02840324

head(getAssociation(resAdj, "DiffVar"))

##                logFC      CI.L      CI.R  AveExpr         t      P.Value
## cg02939019 -2.166225 -2.927983 -1.404467 1.203729 -6.824129 0.0003375482
## cg11847929 -2.327910 -3.149902 -1.505918 1.465679 -6.796091 0.0003457268
## cg22976979 -1.910704 -2.585431 -1.235977 1.173032 -6.795573 0.0003458801
## cg25385529 -2.223012 -3.010252 -1.435771 1.347720 -6.776338 0.0003516238
## cg22676401 -1.914225 -2.596416 -1.232034 1.233126 -6.733607 0.0003647752
## cg27402591  2.710765  1.744380  3.677151 1.706821  6.731349 0.0003654855
##            adj.P.Val         B         SE
## cg02939019 0.1789425 0.2046014 0.16205491
## cg11847929 0.1789425 0.1896636 0.09848762
## cg22976979 0.1789425 0.1893866 0.15402291
## cg25385529 0.1789425 0.1790827 0.11296652
## cg22676401 0.1789425 0.1560305 0.11488471
## cg27402591 0.1789425 0.1548062 0.10400866

head(getAssociation(resAdj, "bumphunter"))

##         chr     start       end      value     area cluster indexStart
## 86177  chr6 133561649 133562776 -0.4137316 15.30807  161402     173009
## 72799 chr10 118030848 118034357 -0.4244696 12.73409   27119     255180
## 85444  chr6  29520698  29521803 -0.3009453 11.73687  155567     151996
## 75309 chr13  78492568  78493590 -0.4345396 11.73257   53247     318486
## 81792 chr20  61050560  61051915 -0.4558366 11.39591  116792     439865
## 80368  chr2  63279693  63285365 -0.3692168 10.33807  102613      54191
##       indexEnd  L clusterL
## 86177   173045 37       41
## 72799   255209 30       30
## 85444   152034 39       40
## 75309   318512 27       49
## 81792   439889 25       26
## 80368    54218 28       41

head(getAssociation(resAdj, "blockFinder"))

##        chr     start       end     value     area cluster indexStart
## 423   chr2 217468708 219051445 0.1725776 28.47531     107      34602
## 2194 chr15  25145254  26095690 0.1317540 21.34416     699     158363
## 1237  chr7  40098119  42760642 0.1744860 21.11281     378      87342
## 119   chr1 152053201 153177304 0.1724642 19.14353      35      13139
## 505   chr3  54169983  56448270 0.1642120 18.06332     125      41752
## 1860 chr11 131278641 132728088 0.1503282 16.83676     588     134392
##      indexEnd   L clusterL
## 423     34801 165      500
## 2194   158548 162      623
## 1237    87490 121     1575
## 119     13278 111     1545
## 505     41884 110     1630
## 1860   134513 112     1659

head(getAssociation(resAdj, "dmrcate"))

##                          coord no.cpgs        minfdr     Stouffer  maxbetafc
## 3470    chr6:33130696-33148812     135  1.093719e-73 1.844138e-46  0.4429870
## 3630  chr6:133561368-133564578      51  0.000000e+00 2.905878e-31 -0.6600767
## 1574   chr16:51183363-51190201      47 2.603037e-189 2.562262e-30 -0.5757717
## 475  chr10:118030292-118034357      31  0.000000e+00 4.965712e-30 -0.6980408
## 3386    chr6:29520527-29521803      40 5.746100e-205 3.535914e-28 -0.5406367
## 1579   chr16:54964677-54973966      42 5.649098e-112 5.549090e-26 -0.6023573
##      meanbetafc
## 3470  0.2055207
## 3630 -0.3596354
## 1574 -0.3019712
## 475  -0.4162689
## 3386 -0.2956325
## 1579 -0.2411854

DiffMean and DiffVar are internally stored as a MArrayLM, the class from limma results. This class allows testing different constrasts or evaluating different variables simultaneously. The function getProbeResults helps the user performing these operations. It also has the arguments object and rid from getAssociation. coef is a numeric with the index of the coefficient from which we want the results. If we did not pass a custom model to runPipeline, the first coefficient (coef = 1) is the intercept and the second coefficient (coef = 2) is the first variable that we included in variable_names. We can evaluate different coefficients simultaneously by passing a vector to coef. contrast is a matrix with the contrasts that we want to evaluate. This option is useful when our variable of interest is a factor with several levels and we want to do all the different comparisons. Finally, the argument fNames is used to select the variables from features annotation that will be added to the tables.

To exemplify the use of this function, we will evaluate our whole adjusted model, including age coefficient. We will also add some annotation of the CpGs:

head(getProbeResults(resAdj, rid = 1, coef = 2:3, 
                     fNames = c("chromosome", "start")))

##            statusnormal           age   AveExpr        F      P.Value
## cg09383816   -0.5938196 -0.0026657333 0.4885486 930.0807 1.937243e-06
## cg27651090    0.5433331 -0.0009235097 0.5411453 826.0491 2.504222e-06
## cg25104555   -0.5254995 -0.0031493548 0.3906114 787.5881 2.776367e-06
## cg21938148   -0.6659934 -0.0028869823 0.4712352 782.9876 2.811782e-06
## cg25937714   -0.5906359  0.0009181122 0.4350042 770.6247 2.910287e-06
## cg15732851   -0.5760397 -0.0050629750 0.3869865 757.4667 3.020764e-06
##             adj.P.Val SE.statusnormal       SE.age chromosome     start
## cg09383816 0.04782436    0.0336034035 0.0016117775       chr8  67344556
## cg27651090 0.04782436    0.0199226569 0.0009555845      chr13 109270071
## cg25104555 0.04782436    0.0381059727 0.0018277420      chr10  16562998
## cg21938148 0.04782436    0.0247093863 0.0011851786      chr13 110958977
## cg25937714 0.04782436    0.0141740814 0.0006798557       chr3  44036492
## cg15732851 0.04782436    0.0284032366 0.0013623530       chr1 203598761

When more than one coefficient is evaluated, a estimate for each coefficient is returned and the t-statistic is substituted by a F-statistic. More information about linear models, including a detailed section of how to create a constrast matrix can be found in limma users’ guide.

Finally, we can obtain the results of CpGs mapped to some genes with the function getGeneVals. This function accepts the same arguments than getProbeResults but includes the arguments gene and genecol to pass the names of the genes to be selected and the column name of feature data containing gene names.

We will retrieve the difference in variance results for all CpGs mapped to ARMS2. We can see in the rowData of meth that gene names are in the column ‘UCSC_RefGene_Name’:

getGeneVals(resAdj, "ARMS2", genecol = "UCSC_RefGene_Name", fNames = c("chromosome", "start"))

##                logFC         CI.L      CI.R   AveExpr        t     P.Value
## cg24296920 0.3258748  0.159072418 0.4926771 0.5876988 5.261053 0.004972695
## cg24884230 0.1925476  0.066951396 0.3181439 0.6064401 4.128436 0.012262156
## cg00676728 0.1106853  0.012546532 0.2088240 0.8206485 3.037200 0.034639784
## cg03623097 0.1255012 -0.008866243 0.2598686 0.5342317 2.515231 0.060880693
## cg18222240 0.1874701 -0.015810610 0.3907509 0.5432686 2.483476 0.063093712
## cg13265583 0.1911058 -0.059969571 0.4421812 0.6897652 2.049717 0.104248164
##             adj.P.Val         B         SE chromosome     start
## cg24296920 0.06757389 -1.904354 0.03225175      chr10 124214120
## cg24884230 0.10906357 -2.942678 0.06287739      chr10 124216658
## cg00676728 0.19474309 -4.128563 0.06953081      chr10 124213760
## cg03623097 0.26596849 -4.759333 0.01297918      chr10 124213466
## cg18222240 0.27137500 -4.798762 0.01276563      chr10 124213527
## cg13265583 0.35287264 -5.344456 0.04044509      chr10 124214151
##            UCSC_RefGene_Name
## cg24296920             ARMS2
## cg24884230             ARMS2
## cg00676728             ARMS2
## cg03623097             ARMS2
## cg18222240             ARMS2
## cg13265583             ARMS2

3.3 Plotting the results

We can easily get Manhattan plots, Volcano plots and QQ-plots for the probes results (DiffMean and DiffVar) using plot method. Our extension of plot method to ResultSet includes the arguments rid or coef that were already present in getProbeResult. In addition, the argument type allows choosing between a Manhattan plot (“manhattan”), a Volcano plot (“volcano”) or a qq-plot (“qq”).

3.3.1 Manhattan plot

We can customize different aspects of a Manhattan plot. We can highlight the CpGs of a target region by passing a GenomicRanges to the argument highlight. Similarly, we can get a Manhattan plot with only the CpGs of our target region passing a GenomicRanges to the argument subset. It should be noticed that the GenomicRange should have the chromosome as a number (1-24).

We will show these capabilities by highlighting and subsetting a region of ten Mb in chromosome X:

targetRange <- GRanges("23:13000000-23000000")
plot(resAdj, rid = "DiffMean", type = "manhattan", highlight = targetRange)

plot(resAdj, rid = "DiffMean", type = "manhattan", subset = targetRange)

We can also change the height of lines marking different levels of significance. Height of blue line can be set with suggestiveline parameter and red line with genomewideline parameter. It should be noticed that these values are expressed as -log10 of p-value. Finally, as our Manhattan plot is done with base framework, we can customize the plot using base plotting functions such as points, lines or text or arguments of plot function like main:

plot(resAdj, rid = "DiffMean", type = "manhattan", suggestiveline = 3, 
     genomewideline = 6, main = "My custom Manhattan")
abline(h = 13, col = "yellow")

3.3.2 Volcano plot

In our Volcano plot, we can also customize the thresholds for statistical significance and magnitude of the effect using the arguments tPV and tFC. As in the previous case, tPV is expressed as -log10 of p-value. On the other hand, tFC units will change depending if we used beta or M-values. show.labels can turn on and turn off the labelling of significant features. Finally, Volcano plot is based on ggplot2 so we can further customize the plot adding new layers:

plot(resAdj, rid = "DiffMean", type = "volcano", tPV = 14, tFC = 0.4, 
     show.labels = FALSE) + ggtitle("My custom Volcano")

3.3.3 QQplot

Our QQplot include the computation of the lambda, a measure of the inflation of the p-values. We can remove this value with the parameter show.lambda.

Our qqplot is also based on ggplot2 so we will add a title to customize it:

plot(resAdj, rid = "DiffMean", type = "qq") + ggtitle("My custom QQplot")

3.3.4 Features

MEAL incorporates the function plotFeature to plot the beta values distribution of a CpG. plotFeature has three main arguments. set is the GenomicRatioSet with the methylation data. feat is the index or name of our target CpG. variables is a character vector with the names of the variables used in the plot. We can include two variables in our plot.

In the next line, we will plot a CpG with high difference in means between male and female (cg17547524) and a CpG with high difference in variance (cg02939019) vs sex. As plotFeature is based on ggplot2, we can customize it:

plotFeature(set = meth, feat = "cg17547524", variables = "sex") + 
  ggtitle("Diff Means")

plotFeature(set = meth, feat = "cg02939019", variables = "sex") + 
  ggtitle("Diff Vars")

3.3.5 Regional plotting

We can simultaneously plot the different results in a target region along with gene and CpG annotation with the function plotRegion. This function has two main arguments. rset is the ResultSet and range is a GenomicRanges with our target region.

We will plot a region of 1 Mb in chromosome X:

targetRange <- GRanges("chrX:13000000-14000000")
plotRegion(resAdj, targetRange)

Our plot has three main parts. The top contains the annotation of the regional genes and the CpGs included in the analysis. The middle part contains the results of the DMR detection methods (Bumphunter, blockFinder and DMRcate). The bottom part contains the results of the single probe analyses (differential mean and differential variance). Each analysis has two parts: the coefficients and the p-values. The line in the p-values plot marks the significance threshold.

By default,plotRegion includes all analyses run in the plot. However, we can plot only few analyses with the parameter results. We can also modify the height of the p-value line with the parameter tPV (units are -log10 of p-value):

plotRegion(resAdj, targetRange, results = c("DiffMean", "bumphunter"), 
           tPV = 10)

3.4 Methods wrappers

MEAL includes wrappers to run the different methods of the pipeline individually. All these functions accept a GenomicRatioSet as input and can return the results in a ResultSet. Consequently, functionalities described in the above section for the results of the pipeline also apply for the results of a single method.

3.4.1 Differences of mean analysis

We can test if a phenotype causes changes in methylation means using the runDiffMeanAnalysis. This function is a wrapper of lmFit function from limma and requires two arguments: set and model. set contains the methylation data, either in a GenomicRatioSet or a matrix. model can be a matrix with the linear model or a formula indicating the model. In the former case, set must be a GenomicRatioSet and the variables included in the model must be present in the colData of our set.

We exemplify the use of this function by running the same linear model than in our pipeline:

resDM <- runDiffMeanAnalysis(set = meth, model = ~ status)

runDiffMeanAnalysis also has other parameters to customize the analysis. If set is a GenomicRatioSet, the parameter betas allows us choosing between betas (TRUE) and M-values (FALSE). We can also run a robust linear model changing the parameter method to “robust”. Finally, resultSet indicates if the function will return a ResultSet (TRUE) or a MArrayLM (FALSE).

All these parameters can be set in the runPipeline function with the argument DiffMean_params.

3.4.2 Differences of Variance analysis

We can test if a phenotype causes changes in methylation variance using the runDiffVarAnalysis. This function is a wrapper of varFit function from missMethyl and requires three arguments: set, model and coefficient. set contains the methylation data in a GenomicRatioSet. model can be a matrix with the linear model or a formula indicating the model. In the former case, the variables included in the model must be present in the colData of our set. coefficient indicates the variables of the linear model for which the difference of variance will be computed. By default, all discrete variables will be included.

We exemplify the use of this function by running the same model than in our pipeline:

resDV <- runDiffVarAnalysis(set = meth, model = ~ status, coefficient = 2)

runDiffVarAnalysis also has the parameter resultSet that allows returning a MArrayLM object instead of a ResultSet. Finally, we can change other parameters of varFit function using the ... argument. These parameters can also be set in the runPipeline function passing them to the argument DiffVar_params.

3.4.3 Bumphunter

We can detect DMRs using Bumphunter from minfi with the function runBumphunter. This function requires three arguments: set, model and coefficient. set contains the methylation data in a GenomicRatioSet. model can be a matrix with the linear model or a formula indicating the model. In the former case, the variables included in the model must be present in the colData of our set. coefficient indicates the variable used to detect the DMRs.

We exemplify the use of this function by running bumphunter as in the pipeline:

resBH <- runBumphunter(set = meth, model = ~ status, coefficient = 2)

runBumphunter also has other parameters to customize the analysis. The parameter betas allows us choosing between betas (TRUE) and M-values (FALSE). bumphunter_cutoff specifies the minimum beta change to include a probe in a bump. num_permutations indicates the number of permutations run to compute bumps p-values (by default is 0 so no permutations are run and no p-values are returned). resultSet allows returning a data.frame object instead of a ResultSet. Finally, we can change other parameters of bumphunter function using the ... argument. These parameters can also be set in the runPipeline function passing them to the argument bumphunter_params.

3.4.4 Blockfinder

blockFinder is an adaptation of Bumphunter to detect DMRs from open sea probes. The function runBlockFinder has essentially the same arguments than runBumphunter.

We exemplify the use of this function by running blockFinder as in the pipeline:

resBF <- runBlockFinder(set = meth, model = ~ status, coefficient = 2)

To change the parameters in the runPipeline function, we can pass them to the argument blockFinder_params.

3.4.5 DMRcate

We can detect DMRs using DMRcate with the function runDMRcate. This function only has four parameters. set is the GenomicRatioSet, model is the linear model or a formula, coefficient is the variable used to detect the DMRs and resultSet to change the class of the output.

We exemplify the use of this function by running DMRcate as in the pipeline:

resDC <- runDMRcate(set = meth, model = ~ status, coefficient = 2)

We can change other parameters of DMRcate functions (cpg.annotate and dmrcate) passing them to the ... argument. These parameters can also be set in the runPipeline function passing them to the argument dmrcate_params.

3.4.6 RDA

We can determine if a genomic region is differentially methylated with RDA (Redundancy Analysis). This analysis can be run with the function runRDA that requires three arguments: set, model and range. As in the previous functions, set is a GenomicRatioSet with the methylation data and model contains the linear model either in a matrix or in a formula. range is a GenomicRanges with the coordinates of our target region.

We will exemplify the use of this function by running RDA in a region of chromosome X:

targetRange <- GRanges("chrX:13000000-23000000")
resRDA <- runRDA(set = meth, model = ~ status, range = targetRange)

runRDA also has other parameters to customize the analysis. The parameter betas allows us choosing between betas (TRUE) and M-values (FALSE). num_vars selects the number of columns in model matrix considered as variables. The remaining columns will be considered as covariates. num_permutations indicates the number of permutations run to compute p-values. resultSet allows returning a rda object from vegan package instead of a ResultSet.

We can run RDA in our pipeline when we are a priori interested in a target genomic range. In this case, we will pass our target region to the argument range of runPipeline. We can pass other parameters of runRDA using the argument rda_params.

3.4.6.1 Managing and plotting RDA results

We can retrieve RDA results using the function getAssociation:

getAssociation(resRDA, rid = "RDA")

## Call: rda(X = t(mat), Y = varsmodel, Z = covarsmodel)
## 
##                 Inertia Proportion Rank
## Total         1.385e+03  1.000e+00     
## Constrained   1.248e+02  9.011e-02    1
## Unconstrained 1.260e+03  9.099e-01    4
## Inertia is variance 
## Some constraints were aliased because they were collinear (redundant)
## 
## Eigenvalues for constrained axes:
##   RDA1 
## 124.77 
## 
## Eigenvalues for unconstrained axes:
##    PC1    PC2    PC3    PC4 
## 1081.7   93.3   61.0   23.8

RDA results are encapsulated in a rda object from vegan package. We can get a summary of RDA results with the function getRDAresults:

getRDAresults(resRDA)

##          R2        pval   global.R2 global.pval 
##  0.09011053  0.50000000  0.42268975  0.98970103

This function returns four values: R2, pval, global.R2 and global.pval. R2 is the ammount of variance that the model explains in our target region. pval is the probability of finding this ammount of variance of higher by change. global.R2 is the ammount of variance that our model explains in the whole genome. global.pval is the probability of finding a region with the same number of probes explaining the same or more variance than our target region. With these values, we can determine if our target region is differentially methylated and if this phenomena is local or global.

The function topRDAhits returns a data.frame with features associated to first two RDA components. This functions computes a Pearson correlation test between the methylation values and the RDA components. Only CpGs with a p-value lower than tPV parameter (by default 0.05) with any of the components are included in the data.frame:

topRDAhits(resRDA)

## [1] feat      RDA       cor       P.Value   adj.P.Val
## <0 rows> (or 0-length row.names)

Finally, we can plot the first two dimensions of our RDA with the function plotRDA. This function makes a biplot of samples and features. We can color the samples using categorical variables by passing in a data.frame to argument pheno.

We will plot RDA using status variable of our sets colData:

plotRDA(object = resRDA, pheno = colData(meth)[, "status", drop = FALSE])

The RDA plot prints a label at the center of each group and the summary of RDA results (R² and p-value) in the legend. plotRDA has two additional arguments. main is a character vector with the plot’s title. n_feat is a numeric with the number of feats that will have a label in the text. Only the n_feat features most associated to each of the components will be displayed.

plotRDA relies on base paradigm, so we can add layers using functions from this infrastructure (e.g. lines, points…):

plotRDA(object = resRDA, pheno = colData(meth)[, "status", drop = FALSE])
abline(h = -1)

4 Session Info

sessionInfo()

## R version 3.5.1 Patched (2018-07-12 r74967)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 16.04.5 LTS
## 
## Matrix products: default
## BLAS: /home/biocbuild/bbs-3.8-bioc/R/lib/libRblas.so
## LAPACK: /home/biocbuild/bbs-3.8-bioc/R/lib/libRlapack.so
## 
## locale:
##  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
##  [3] LC_TIME=en_US.UTF-8        LC_COLLATE=C              
##  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
##  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
##  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
## 
## attached base packages:
## [1] stats4    parallel  stats     graphics  grDevices utils     datasets 
## [8] methods   base     
## 
## other attached packages:
##  [1] doRNG_1.7.1                                       
##  [2] rngtools_1.3.1                                    
##  [3] pkgmaker_0.27                                     
##  [4] registry_0.5                                      
##  [5] ggplot2_3.1.0                                     
##  [6] minfiData_0.27.0                                  
##  [7] IlluminaHumanMethylation450kanno.ilmn12.hg19_0.6.0
##  [8] IlluminaHumanMethylation450kmanifest_0.4.0        
##  [9] minfi_1.28.0                                      
## [10] bumphunter_1.24.0                                 
## [11] locfit_1.5-9.1                                    
## [12] iterators_1.0.10                                  
## [13] foreach_1.4.4                                     
## [14] Biostrings_2.50.0                                 
## [15] XVector_0.22.0                                    
## [16] SummarizedExperiment_1.12.0                       
## [17] DelayedArray_0.8.0                                
## [18] BiocParallel_1.16.0                               
## [19] matrixStats_0.54.0                                
## [20] GenomicRanges_1.34.0                              
## [21] GenomeInfoDb_1.18.0                               
## [22] IRanges_2.16.0                                    
## [23] S4Vectors_0.20.0                                  
## [24] MEAL_1.12.0                                       
## [25] MultiDataSet_1.10.0                               
## [26] Biobase_2.42.0                                    
## [27] BiocGenerics_0.28.0                               
## [28] BiocStyle_2.10.0                                  
## 
## loaded via a namespace (and not attached):
##   [1] R.utils_2.7.0                                      
##   [2] tidyselect_0.2.5                                   
##   [3] RSQLite_2.1.1                                      
##   [4] AnnotationDbi_1.44.0                               
##   [5] htmlwidgets_1.3                                    
##   [6] grid_3.5.1                                         
##   [7] munsell_0.5.0                                      
##   [8] codetools_0.2-15                                   
##   [9] preprocessCore_1.44.0                              
##  [10] statmod_1.4.30                                     
##  [11] withr_2.1.2                                        
##  [12] colorspace_1.3-2                                   
##  [13] knitr_1.20                                         
##  [14] rstudioapi_0.8                                     
##  [15] labeling_0.3                                       
##  [16] GenomeInfoDbData_1.2.0                             
##  [17] bit64_0.9-7                                        
##  [18] rhdf5_2.26.0                                       
##  [19] rprojroot_1.3-2                                    
##  [20] xfun_0.4                                           
##  [21] qqman_0.1.4                                        
##  [22] biovizBase_1.30.0                                  
##  [23] R6_2.3.0                                           
##  [24] illuminaio_0.24.0                                  
##  [25] AnnotationFilter_1.6.0                             
##  [26] bitops_1.0-6                                       
##  [27] reshape_0.8.8                                      
##  [28] assertthat_0.2.0                                   
##  [29] scales_1.0.0                                       
##  [30] bsseq_1.18.0                                       
##  [31] nnet_7.3-12                                        
##  [32] gtable_0.2.0                                       
##  [33] methylumi_2.28.0                                   
##  [34] ensembldb_2.6.0                                    
##  [35] rlang_0.3.0.1                                      
##  [36] genefilter_1.64.0                                  
##  [37] calibrate_1.7.2                                    
##  [38] splines_3.5.1                                      
##  [39] rtracklayer_1.42.0                                 
##  [40] lazyeval_0.2.1                                     
##  [41] acepack_1.4.1                                      
##  [42] DSS_2.30.0                                         
##  [43] GEOquery_2.50.0                                    
##  [44] dichromat_2.0-0                                    
##  [45] checkmate_1.8.5                                    
##  [46] BiocManager_1.30.3                                 
##  [47] yaml_2.2.0                                         
##  [48] GenomicFeatures_1.34.0                             
##  [49] backports_1.1.2                                    
##  [50] Hmisc_4.1-1                                        
##  [51] tools_3.5.1                                        
##  [52] bookdown_0.7                                       
##  [53] nor1mix_1.2-3                                      
##  [54] RColorBrewer_1.1-2                                 
##  [55] siggenes_1.56.0                                    
##  [56] Rcpp_0.12.19                                       
##  [57] plyr_1.8.4                                         
##  [58] base64enc_0.1-3                                    
##  [59] progress_1.2.0                                     
##  [60] zlibbioc_1.28.0                                    
##  [61] purrr_0.2.5                                        
##  [62] RCurl_1.95-4.11                                    
##  [63] BiasedUrn_1.07                                     
##  [64] prettyunits_1.0.2                                  
##  [65] rpart_4.1-13                                       
##  [66] openssl_1.0.2                                      
##  [67] IlluminaHumanMethylationEPICmanifest_0.3.0         
##  [68] cluster_2.0.7-1                                    
##  [69] magrittr_1.5                                       
##  [70] data.table_1.11.8                                  
##  [71] ProtGenerics_1.14.0                                
##  [72] IlluminaHumanMethylationEPICanno.ilm10b4.hg19_0.6.0
##  [73] missMethyl_1.16.0                                  
##  [74] hms_0.4.2                                          
##  [75] evaluate_0.12                                      
##  [76] xtable_1.8-3                                       
##  [77] XML_3.98-1.16                                      
##  [78] mclust_5.4.1                                       
##  [79] gridExtra_2.3                                      
##  [80] compiler_3.5.1                                     
##  [81] biomaRt_2.38.0                                     
##  [82] tibble_1.4.2                                       
##  [83] crayon_1.3.4                                       
##  [84] R.oo_1.22.0                                        
##  [85] htmltools_0.3.6                                    
##  [86] mgcv_1.8-25                                        
##  [87] Formula_1.2-3                                      
##  [88] tidyr_0.8.2                                        
##  [89] DBI_1.0.0                                          
##  [90] MASS_7.3-51                                        
##  [91] DMRcate_1.18.0                                     
##  [92] Matrix_1.2-14                                      
##  [93] readr_1.1.1                                        
##  [94] permute_0.9-4                                      
##  [95] quadprog_1.5-5                                     
##  [96] R.methodsS3_1.7.1                                  
##  [97] Gviz_1.26.0                                        
##  [98] bindr_0.1.1                                        
##  [99] pkgconfig_2.0.2                                    
## [100] GenomicAlignments_1.18.0                           
## [101] foreign_0.8-71                                     
## [102] xml2_1.2.0                                         
## [103] annotate_1.60.0                                    
## [104] multtest_2.38.0                                    
## [105] beanplot_1.2                                       
## [106] ruv_0.9.7                                          
## [107] bibtex_0.4.2                                       
## [108] stringr_1.3.1                                      
## [109] VariantAnnotation_1.28.0                           
## [110] digest_0.6.18                                      
## [111] vegan_2.5-3                                        
## [112] rmarkdown_1.10                                     
## [113] base64_2.0                                         
## [114] htmlTable_1.12                                     
## [115] DelayedMatrixStats_1.4.0                           
## [116] curl_3.2                                           
## [117] Rsamtools_1.34.0                                   
## [118] gtools_3.8.1                                       
## [119] nlme_3.1-137                                       
## [120] bindrcpp_0.2.2                                     
## [121] Rhdf5lib_1.4.0                                     
## [122] limma_3.38.0                                       
## [123] BSgenome_1.50.0                                    
## [124] pillar_1.3.0                                       
## [125] lattice_0.20-35                                    
## [126] httr_1.3.1                                         
## [127] survival_2.43-1                                    
## [128] GO.db_3.7.0                                        
## [129] glue_1.3.0                                         
## [130] DMRcatedata_1.17.0                                 
## [131] bit_1.1-14                                         
## [132] stringi_1.2.4                                      
## [133] HDF5Array_1.10.0                                   
## [134] blob_1.1.1                                         
## [135] org.Hs.eg.db_3.7.0                                 
## [136] latticeExtra_0.6-28                                
## [137] memoise_1.1.0                                      
## [138] dplyr_0.7.7

References

Aryee, Martin J, Andrew E Jaffe, Hector Corrada-Bravo, Christine Ladd-Acosta, Andrew P Feinberg, Kasper D Hansen, and Rafael A Irizarry. 2014. “Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays.” Bioinformatics (Oxford, England) 30 (10):1363–9. https://doi.org/10.1093/bioinformatics/btu049.

Du, Pan, Warren A Kibbe, and Simon M Lin. 2008. “lumi: a pipeline for processing Illumina microarray.” Bioinformatics (Oxford, England) 24 (13):1547–8. https://doi.org/10.1093/bioinformatics/btn224.

Methylation Analysis with MEAL

30 October 2018

Package