ipdDb

Steffen Klasberg

2018-10-30

Database package of HLA and KIR alleles from the IPD IMGT/HLA | Github and IPD KIR | Github databases.

Reference:

Robinson J, Maccari G, Marsh SGE, Walter L, Blokhuis J, Bimber B, Parham P, De Groot NG, Bontrop RE, Guethlein LA, and Hammond JA
KIR Nomenclature in non-human species
Immunogenetics (2018), in preparation

This package holds all information from the IPD databases. For HLA this is limited to HLA-A, -B, -C, -DPB1, -DQB1 and -DRB1.

For alleles which are not known in full-length, also the closest full-length allele is stored.

The data is stored in an SQLite database and its compatible with Bioconductor’s database package rules, i.e. the select method including columns, keytypes and keys.

Additionally, some helper functions are implemented to fetch all contained loci, all alleles of a locus, the sequences of alleles and the sequence of closest allele which is available in full-length for an allele.

library(ipdDb)

hla <- loadHlaData()
## get all loci stored in the db
available_loci <- hla$getLoci()

## get all alleles of a locus
alleles <- hla$getAlleles(available_loci[1])
alleles <- hla$getAlleles("HLA-A")

## get all sequences of a bunch of alleles as DNAStringSet
sequences <- hla$getReference(alleles)
sequences <- hla$getReference(c("HLA-A*01:01:01:01", "HLA-A*01:01:01:03" ))

## get the closest complete reference for ONE allele as DNAStringSet
closest_complete <- hla$getClosestComplete(alleles[1])
closest_complete <- hla$getClosestComplete("HLA-A*01:01:01:01")

## Get the gene structure for a bunch of alleles as GRanges object
structures <- hla$getStructure(alleles)
structures <- hla$getStructure(c("HLA-A*01:01:01:01", "HLA-A*01:01:01:03" ))