Contents

1 Overview

The SCATEData is an extensive data resources containing the location of genomic regions bins, the mean and standard deviation of normalized bulk DNase-seq signals across ENCODE samples, and the clustering of the genomic bins based on bulk DNase-seq signals. These data have been formatted as GenomicRanges Bioconductor objects and hosted on Bioconductor’s ExperimentHub platform.

SCATEData provides data resources for the package SCATE to extract and enhance the sparse and discrete Single-cell ATAC-seq Signal.

For more details, see our paper describing the SCATE package:

2 Datasets

The package contains the following datasets, which can be used by SCATE to extract and enhance the sparse Single-cell ATAC-seq signals. They are grouped into datasets useful for (i) running SCATE, and (ii) providing examples.

These .bam files are Single-cell ATAC-seq reads and the corresponding .bam.bai files are for indexing and searching for the reads in these files. For more extensive documentation, please refer to the metadata or help files for the objects.

3 How to load data

Please refer to SCATE tutorial for the loading and use of the data.

4 Citation

If the SCATEData package is useful in your work, please cite the following paper: - Zhicheng Ji, Weiqiang Zhou, Wenpin Hou, Hongkai Ji, Single-cell ATAC-seq Signal Extraction and Enhancement with SCATE, Genome Biol 21, 161 (2020).