Introduction

SpliceWiz is a graphical interface for differential alternative splicing and visualization in R. It differs from other alternative splicing tools as it is designed for users with basic bioinformatic skills to analyze datasets containing up to hundreds of samples! SpliceWiz contains a number of innovations including:

  • Super-fast handling of alignment BAM files using ompBAM, our developer resource for multi-threaded BAM processing,
  • Alternative splicing event (ASE) filters to remove problematic ASEs from analysis
  • Group-averaged coverage plots: publication-ready figures to clearly visualize differential alternative splicing between biological / experimental conditions
  • Interactive figures, including scatter and volcano plots, gene ontology (GO) analysis, heatmaps, and scrollable coverage plots, powered using the shinyDashboard interface

This vignette is a runnable working example of the SpliceWiz workflow. The purpose is to quickly demonstrate the basic functionalities of SpliceWiz.

We provide here a brief outline of the workflow for users to get started as quickly as possible. However, we also provide more details for those wishing to know more. Many sections will contain extra information that can be displayed when clicked on, such as these:

Click on me for more details
In most sections, we offer more details about each step of the workflow, that can be revealed in text segments like this one. Be sure to click on buttons like these, where available.


FAQ

What are the system memory requirements for running SpliceWiz
We recommend the following memory requirements (RAM) for running various steps of SpliceWiz:

buildRef()

  • Building human / mouse SpliceWiz reference: 8 gigabytes

processBAM()

  • Processing small alignment (BAM) files (~20 million paired end reads): 8 gigabytes
  • Processing large BAM files (~100 million paired end reads): 16 gigabytes

collateData()

  • Collating routine experiments (e.g. 3 replicates, 2 conditions): 8-16 gigabytes
  • Collating large experiments (20+ samples, using lowMemoryMode=TRUE): 32 gigabytes
  • Collating large experiments (20+ samples, using lowMemoryMode=FALSE): 8 gigabytes per thread

Differential analysis

  • Differential analysis (routine experiments): 8 gigabytes
  • Differential analysis (large experiments - 20+ samples): 16 gigabytes
  • DESeq2-based differential analysis of large experiments: 32 gigabytes


How does SpliceWiz measure alternative splicing?

SpliceWiz defines alternative splicing events (ASEs) as binary events between two possibilities, the included and excluded isoform. It detects and measures: skipped (casette) exons (SE), mutually-exclusive exons (MXE), alternative 5’/3’ splice site usage (A5SS / A3SS), alternate first / last exon usage (AFE / ALE), and retained introns (IR or RI).

SpliceWiz uses splice-specific read counts to measure ASEs. Namely, these are junction reads (reads that align across splice sites). The exception is intron retention (IR) whereby the (trimmed) mean read depth across the intron is measured (identical to the method used in IRFinder).

SpliceWiz provides two metrics:

  • Percent spliced in (PSI): is the expression of the included isoform as a proportion of both included/excluded isoform. PSIs are measured for all types of alternative splicing, including annotated retained introns (RI)
  • IR-ratio: For introns, we also measure IR-ratios, which is the expression of IR-transcript as a proportion of IR- and spliced-transcripts. Spliced transcript expression is measured using either SpliceOver or SpliceMax method (the latter is identical to that used in IRFinder)


Does SpliceWiz detect novel splicing events?
Novel splicing events are those in which at least one isoform is not an annotated transcript in the given gene annotation. SpliceWiz DOES detect novel splicing events.

It detects novel events by using novel junctions, using pairs of junctions that originate from or terminate at a common coordinate (novel alternate splice site usage).

Additionally, SpliceWiz detects “tandem junction reads”. These are reads that span across two or more splice junctions. The region between splice junctions can then be annotated as novel exons (if they are not identical to annotated exons). These novel exons can then be used to measure novel casette exon usage.


Workflow from a glance

The basic steps of SpliceWiz are as follows:

  • Building the SpliceWiz reference
  • Process BAM files using SpliceWiz
  • Collate results of individual samples into an experiment
  • Importing the collated experiment as an NxtSE object
  • Alternative splicing event filtering
  • Differential ASE analysis
  • Visualization

Quick-Start

Installation

To install SpliceWiz, start R (devel version) and enter:

if (!requireNamespace("BiocManager", quietly = TRUE))
    install.packages("BiocManager")

BiocManager::install("SpliceWiz")

Setting up a Conda environment to use SpliceWiz
For those wishing to set up a self-contained environment with SpliceWiz installed (e.g. on a high performance cluster), we recommend using miniconda. For installation instructions, see the documentation on how to install miniconda

After installing miniconda, create a conda environment as follows:

conda create -n envSpliceWiz python=3.9

After following the prompts, activate the environment:

conda activate envSpliceWiz

Next, install R 4.2.1 as follows:

conda install -c conda-forge r-base=4.2.1

NB: We have not been able to successfuly use r-base=4.3, so we recommend using r-base=4.2.1 (until further notice).

Many of SpliceWiz’s dependencies are up-to-date from the conda-forge channel, so they are best installed via conda:

conda install -c conda-forge r-devtools r-essentials r-xml r-biocmanager \
  r-fst r-plotly r-rsqlite r-rcurl

After this is done, the remainder of the packages need to be installed from the R terminal. This is because most Bioconductor packages are from the bioconda channel and appear not to be routinely updated.

So, lets enter the R terminal from the command line:

R

Set up Bioconductor 3.16 (which is the latest version compatible with R 4.2):

BiocManager::install(version = "3.16")

Again, follow the prompts to update any necessary packages.

Once this is done, install SpliceWiz (devel) from github:

# BiocManager::install("SpliceWiz")
devtools::install_github("alexchwong/SpliceWiz")

The last step will install any remaining dependencies, taking approximately 20-30 minutes depending on your system.


Enabling OpenMP (multi-threading) for MacOS users (Optional)
For MacOS users, make sure OpenMP libraries are installed correctly. We recommend users follow this guide, but the quickest way to get started is to install libomp via brew:

brew install libomp


Installing statistical package dependencies (Optional)
SpliceWiz uses established statistical tools to perform alternative splicing differential analysis:

  • limma: models included and excluded counts as log-normal distributions
  • DESeq2: models included and excluded counts as negative binomial distributions
  • edgeR: models included and excluded counts as negative binomial distributions. SpliceWiz uses the quasi-likelihood method which deals better with zero-counts.
  • DoubleExpSeq: models included and excluded counts using beta binomial distributions

To install all of these packages:

install.packages("DoubleExpSeq")

BiocManager::install(c("DESeq2", "limma", "edgeR"))


Loading SpliceWiz

library(SpliceWiz)
Details
The SpliceWiz package loads the NxtIRFdata data package. This data package contains the example “chrZ” genome / annotations and 6 example BAM files that are used in this working example. Also, NxtIRFdata provides pre-generated mappability exclusion annotations for building human and mouse SpliceWiz references


The SpliceWiz Graphics User Interface (GUI)

SpliceWiz offers a graphical user interface (GUI) for interactive users, e.g. in the RStudio environment. To start using SpliceWiz GUI:

if(interactive()) {
    spliceWiz(demo = TRUE)
}


Building the SpliceWiz reference

Why do we need the SpliceWiz reference?
SpliceWiz first needs to generate a set of reference files. The SpliceWiz reference is used to quantitate alternative splicing in BAM files, as well as in downstream collation, differential analysis and visualisation.

SpliceWiz generates a reference from a user-provided genome FASTA and genome annotation GTF file, and is optimised for Ensembl references but can accept other reference GTF files. Alternatively, SpliceWiz accepts AnnotationHub resources, using the record names of AnnotationHub records as input.


Using the example FASTA and GTF files, use the buildRef() function to build the SpliceWiz reference:

ref_path <- file.path(tempdir(), "Reference")
buildRef(
    reference_path = ref_path,
    fasta = chrZ_genome(),
    gtf = chrZ_gtf(),
    ontologySpecies = "Homo sapiens"
)

The SpliceWiz reference can be viewed as data frames using various getter functions. For example, to view the annotated alternative splicing events (ASE):

df <- viewASE(ref_path)

See ?View-Reference-methods for a comprehensive list of getter functions

Using the GUI
After starting the SpliceWiz GUI in demo mode, click the Reference tab from the menu side bar. The following interface will be shown:

Building the SpliceWiz reference using the GUI

Building the SpliceWiz reference using the GUI

  1. The first step to building a SpliceWiz reference is to select a directory in which to create the reference.
  2. SpliceWiz provides an interface to retrieve the genome sequence (FASTA) and transcriptome annotation (GTF) files from the Ensembl FTP server, by first selecting the “Release” and then “Species” from the drop-down boxes.
  3. Alternatively, users can provide their own FASTA and GTF files.
  4. Human (hg38, hg19) and mouse genomes (mm10, mm9) have the option of further refining IR analysis using built-in mappability exclusion annotations, allowing SpliceWiz to ignore intronic regions of low mappability.
For now, to continue with the demo and create the reference using the GUI, click on the Load Demo FASTA/GTF (5), and then click Build Reference (6)

Where did the FASTA and GTF files come from?
The helper functions chrZ_genome() and chrZ_gtf() returns the paths to the example genome (FASTA) and transcriptome (GTF) file included with the NxtIRFdata package that contains the working example used by SpliceWiz:

# Provides the path to the example genome:
chrZ_genome()
#> [1] "/home/biocbuild/bbs-3.19-bioc/R/site-library/NxtIRFdata/extdata/genome.fa"

# Provides the path to the example gene annotation:
chrZ_gtf()
#> [1] "/home/biocbuild/bbs-3.19-bioc/R/site-library/NxtIRFdata/extdata/transcripts.gtf"

What is the chrZ genome?
For the purpose of generating a running example to demonstrate SpliceWiz, we created an artificial genome / gene annotation. This was created using 7 human genes (SRSF1, SRSF2, SRSF3, TRA2A, TRA2B, TP53 and NSUN5). The SRSF and TRA family of genes all contain poison exons flanked by retained introns. Additionally, NSUN5 contains an annotated IR event in its terminal intron. Sequences from these 7 genes were aligned into one sequence to create an artificial chromosome Z (chrZ). The gene annotations were modified to only contain the 7 genes with the modified genomic coordinates.

What is the gene ontology species?
SpliceWiz supports gene ontology analysis. To enable this capability, we first need to generate the gene ontology annotations for the appropriate species.

To see a list of supported species:

getAvailableGO()
#>    [1] "Triticum aestivum"                                         
#>    [2] "Triticum aestivum_subsp._aestivum"                         
#>    [3] "Triticum vulgare"                                          
#>    [4] "Brassica napus"                                            
#>    [5] "Arachis hypogaea"                                          
#>    [6] "Hibiscus syriacus"                                         
#>    [7] "Acridium cancellatum"                                      
#>    [8] "Schistocerca cancellata"                                   
#>    [9] "Triticum dicoccoides"                                      
#>   [10] "Triticum turgidum_subsp._dicoccoides"                      
#>   [11] "Triticum turgidum_var._dicoccoides"                        
#>   [12] "Dendrohyas sarda"                                          
#>   [13] "Hyla arborea_sarda"                                        
#>   [14] "Hyla sarda"                                                
#>   [15] "Locusta gregaria"                                          
#>   [16] "Schistocerca gregaria"                                     
#>   [17] "Gossypium hirsutum"                                        
#>   [18] "Gossypium hirsutum_subsp._mexicanum"                       
#>   [19] "Gossypium lanceolatum"                                     
#>   [20] "Gossypium purpurascens"                                    
#>   [21] "Camelina sativa"                                           
#>   [22] "Carassius auratus_gibelio"                                 
#>   [23] "Carassius gibelio_gibelio"                                 
#>   [24] "Carassius gibelio"                                         
#>   [25] "Carassius gibelio_subsp._gibelio"                          
#>   [26] "Cyprinus gibelio"                                          
#>   [27] "Schistocerca piceifrons"                                   
#>   [28] "Papaver somniferum"                                        
#>   [29] "Zingiber officinale"                                       
#>   [30] "Trichomonas vaginalis_G3"                                  
#>   [31] "Trichomonas vaginalis_strain_G3"                           
#>   [32] "Carassius auratus"                                         
#>   [33] "Carassius carassius_auratus"                               
#>   [34] "Cyprinus auratus"                                          
#>   [35] "Helianthus annuus"                                         
#>   [36] "Schistocerca americana"                                    
#>   [37] "Acipenser ruthenus"                                        
#>   [38] "Schistocerca serialis_cubense"                             
#>   [39] "Panicum virgatum"                                          
#>   [40] "Nicotiana tabacum"                                         
#>   [41] "Oncorhynchus mykiss"                                       
#>   [42] "Oncorhynchus nerka_mykiss"                                 
#>   [43] "Parasalmo mykiss"                                          
#>   [44] "Salmo mykiss"                                              
#>   [45] "Schistocerca nitens"                                       
#>   [46] "Schistocerca vaga"                                         
#>   [47] "Salvia splendens"                                          
#>   [48] "Carassius carassius"                                       
#>   [49] "Cyprinus carassius"                                        
#>   [50] "Vicia villosa"                                             
#>   [51] "Camellia sinensis"                                         
#>   [52] "Thea sinensis"                                             
#>   [53] "Oncorhynchus keta"                                         
#>   [54] "Salmo keta"                                                
#>   [55] "Pisum sativum"                                             
#>   [56] "Salmo salar"                                               
#>   [57] "Raphanus sativus"                                          
#>   [58] "Oncorhynchus kisutch"                                      
#>   [59] "Oncorhyncus kisutch"                                       
#>   [60] "Salmo kisatch"                                             
#>   [61] "Lolium rigidum"                                            
#>   [62] "Aegilops squarrosa_subsp._squarrosa"                       
#>   [63] "Aegilops squarrosa"                                        
#>   [64] "Aegilops tauschii"                                         
#>   [65] "Patropyrum tauschii_subsp._tauschii"                       
#>   [66] "Patropyrum tauschii"                                       
#>   [67] "Triticum aegilops"                                         
#>   [68] "Triticum tauschii"                                         
#>   [69] "Salmo trutta"                                              
#>   [70] "Cryptomeria japonica"                                      
#>   [71] "Coregonus clupeaformis"                                    
#>   [72] "Salmo clupeaformis"                                        
#>   [73] "Oncorhynchus gorbuscha"                                    
#>   [74] "Salmo gorbuscha"                                           
#>   [75] "Cyprinus carpio"                                           
#>   [76] "Glycine max_subsp._soja"                                   
#>   [77] "Glycine soja"                                              
#>   [78] "Salmo fontinalis"                                          
#>   [79] "Salvelinus fontinalis"                                     
#>   [80] "Glycine max"                                               
#>   [81] "Phaseolus max"                                             
#>   [82] "Chenopodium quinoa"                                        
#>   [83] "Hordeum sativum"                                           
#>   [84] "Hordeum vulgare_subsp._vulgare"                            
#>   [85] "Hordeum vulgare_var._nudum"                                
#>   [86] "Hordeum vulgare_var._vulgare"                              
#>   [87] "Festuca perennis_(L.)_Columbus_&_J.P.Sm.,_2010"            
#>   [88] "Festuca perennis"                                          
#>   [89] "Lolium perenne"                                            
#>   [90] "Lolium vulgare"                                            
#>   [91] "Coffea arabica"                                            
#>   [92] "Barbus grahami"                                            
#>   [93] "Sinocyclocheilus grahami"                                  
#>   [94] "Sinocyclocheilus rhinocerous"                              
#>   [95] "Gossypium arboreum"                                        
#>   [96] "Brassica oleracea"                                         
#>   [97] "Malus sylvestris"                                          
#>   [98] "Pyrus malus_var._sylvestris"                               
#>   [99] "Astyanax mexicanus"                                        
#>  [100] "Tetragonopterus mexicanus"                                 
#>  [101] "Arachis stenosperma"                                       
#>  [102] "Prosopis alba"                                             
#>  [103] "Sinocyclocheilus anshuiensis"                              
#>  [104] "Brassica rapa"                                             
#>  [105] "Lactuca sativa"                                            
#>  [106] "Dreissena polymorpha"                                      
#>  [107] "Mytilus polymorphus"                                       
#>  [108] "Hydractinia symbiolongicarpus"                             
#>  [109] "Hevea brasiliensis"                                        
#>  [110] "Oncorhynchus tschawytscha"                                 
#>  [111] "Oncorhynchus tshawytscha"                                  
#>  [112] "Salmo tshawytscha"                                         
#>  [113] "Arachis ipaensis"                                          
#>  [114] "Zea mays"                                                  
#>  [115] "Zea mays_var._japonica"                                    
#>  [116] "Salmo namaycush"                                           
#>  [117] "Salvelinus namaycush"                                      
#>  [118] "Capsicum annuum"                                           
#>  [119] "Brienomyrus brachyistius"                                  
#>  [120] "Marcusenius brachyistius"                                  
#>  [121] "Convolvulus nil"                                           
#>  [122] "Ipomoea nil"                                               
#>  [123] "Pharbitis nil"                                             
#>  [124] "Olea europaea_subsp._europaea_var._sylvestris"             
#>  [125] "Olea europaea_var._oleaster"                               
#>  [126] "Olea europaea_var._sylvestris"                             
#>  [127] "Olea europea_subsp._sylvestris"                            
#>  [128] "Alosa sapidissima"                                         
#>  [129] "Clupea sapidissima"                                        
#>  [130] "Carpiodes asiaticus"                                       
#>  [131] "Myxocyprinus asiaticus"                                    
#>  [132] "Actinidia eriantha"                                        
#>  [133] "Gossypium raimondii"                                       
#>  [134] "Salmo alpinus"                                             
#>  [135] "Salvelinus alpinus"                                        
#>  [136] "Catostomus texanus"                                        
#>  [137] "Xyrauchen texanus"                                         
#>  [138] "Doryrhamphus excisus"                                      
#>  [139] "Quercus lobata"                                            
#>  [140] "Malus communis"                                            
#>  [141] "Malus domestica"                                           
#>  [142] "Malus pumila_auct."                                        
#>  [143] "Malus pumila_var._domestica"                               
#>  [144] "Malus sylvestris_var._domestica"                           
#>  [145] "Malus x_domestica"                                         
#>  [146] "Pyrus malus"                                               
#>  [147] "Pyrus malus_var._domestica"                                
#>  [148] "Quercus suber"                                             
#>  [149] "Oncorhynchus nerka"                                        
#>  [150] "Salmo nerka"                                               
#>  [151] "Nicotiana tomentosiformis"                                 
#>  [152] "Carya illinoensis"                                         
#>  [153] "Carya illinoinensis"                                       
#>  [154] "Mercenaria mercenaria"                                     
#>  [155] "Venus mercenaria"                                          
#>  [156] "Quercus robur"                                             
#>  [157] "Durio zibethinus"                                          
#>  [158] "Pongo abelii"                                              
#>  [159] "Pongo pygmaeus_abelii"                                     
#>  [160] "Pongo pygmaeus_abeli"                                      
#>  [161] "Mya arenaria"                                              
#>  [162] "Arachis duranensis"                                        
#>  [163] "Arachis spegazzinii"                                       
#>  [164] "Pyrus x_bretschneideri"                                    
#>  [165] "Trifolium pratense"                                        
#>  [166] "Gorilla gorilla_gorilla"                                   
#>  [167] "Cobitis anguillicaudata"                                   
#>  [168] "Misgurnus anguillicaudatus"                                
#>  [169] "Scaphiopus bombifrons"                                     
#>  [170] "Spea bombifrons"                                           
#>  [171] "Haliotis rufenscens"                                       
#>  [172] "Haliotis rufescens"                                        
#>  [173] "Oreochromis nilotica"                                      
#>  [174] "Oreochromis niloticus"                                     
#>  [175] "Perca nilotica"                                            
#>  [176] "Tilapia nilotica"                                          
#>  [177] "Acropora convexa"                                          
#>  [178] "Acropora millepora"                                        
#>  [179] "Acropora singularis"                                       
#>  [180] "Cebus apella"                                              
#>  [181] "Sapajus apella"                                            
#>  [182] "Simia apella"                                              
#>  [183] "Eucalyptus grandis"                                        
#>  [184] "Dasypus novemcinctus"                                      
#>  [185] "Callithrix jacchus_jacchus"                                
#>  [186] "Callithrix jacchus"                                        
#>  [187] "Simia jacchus"                                             
#>  [188] "Pistacia vera"                                             
#>  [189] "greater Indian_fruit_bat"                                  
#>  [190] "Pteropus giganteus"                                        
#>  [191] "Pteropus medius"                                           
#>  [192] "Salvia miltiorhiza"                                        
#>  [193] "Salvia miltiorrhiza"                                       
#>  [194] "Daphnia pulicaria"                                         
#>  [195] "Magnolia sinica"                                           
#>  [196] "Manglietia sinica"                                         
#>  [197] "Manglietiastrum sinicum"                                   
#>  [198] "Pachylarnax sinica_(Y.W.Law)_N.H.Xia_&_C.Y.Wu"             
#>  [199] "Rosa chinensis"                                            
#>  [200] "Rosa indica_auct.,_non_L."                                 
#>  [201] "Mytilus californianus"                                     
#>  [202] "Pteropus vampyrus"                                         
#>  [203] "Vespertilio vampyrus"                                      
#>  [204] "Chinemys reevesii"                                         
#>  [205] "Chinemys reevesi"                                          
#>  [206] "Emys reevesii"                                             
#>  [207] "Geoclemys reevesii"                                        
#>  [208] "Geoclemys reevessi"                                        
#>  [209] "Mauremys reevesii"                                         
#>  [210] "Mauremys reevesi"                                          
#>  [211] "Choloepus brasiliensis_Fitzinger_1871"                     
#>  [212] "Choloepus brasiliensis"                                    
#>  [213] "Choloepus didactylus"                                      
#>  [214] "Macaca nemestrina"                                         
#>  [215] "Simia nemestrina"                                          
#>  [216] "Bubalus arnee_carabanensis"                                
#>  [217] "Bubalus bubalis_carabanesis"                               
#>  [218] "Bubalus carabanensis_carabanensis"                         
#>  [219] "Bubalus carabanensis"                                      
#>  [220] "Lotus corniculatus_var._japonicus"                         
#>  [221] "Lotus japonicus"                                           
#>  [222] "Nicotiana sylvestris"                                      
#>  [223] "Tupaia belangeri_chinensis"                                
#>  [224] "Tupaia chinensis"                                          
#>  [225] "Clarias gariepinus"                                        
#>  [226] "Clarias lazera"                                            
#>  [227] "Silurus gariepinus"                                        
#>  [228] "Canis dingo"                                               
#>  [229] "Canis familiaris_dingo"                                    
#>  [230] "Canis lupus_dingo"                                         
#>  [231] "Barbus tetrazona"                                          
#>  [232] "Capoeta tetrazona"                                         
#>  [233] "Puntigrus tetrazona"                                       
#>  [234] "Puntius tetrazona"                                         
#>  [235] "Systomus tetrazona"                                        
#>  [236] "Lycium ferocissimum"                                       
#>  [237] "Nicotiana attenuata"                                       
#>  [238] "Denticeps clupeoides"                                      
#>  [239] "Octodon degus"                                             
#>  [240] "Haliotis rubra"                                            
#>  [241] "Aedes albopictus"                                          
#>  [242] "Stegomyia albopicta"                                       
#>  [243] "Spinacia oleracea"                                         
#>  [244] "Paramecium aurelia_syngen_4"                               
#>  [245] "Paramecium tetraurelia"                                    
#>  [246] "Salvia hispanica"                                          
#>  [247] "Medicago truncatula"                                       
#>  [248] "Crassostrea virginica"                                     
#>  [249] "Ostrea virginica"                                          
#>  [250] "Felis catus"                                               
#>  [251] "Felis domesticus"                                          
#>  [252] "Felis silvestris_catus"                                    
#>  [253] "Anubis baboon"                                             
#>  [254] "Papio anubis"                                              
#>  [255] "Papio cynocephalus_anubis"                                 
#>  [256] "Papio doguera"                                             
#>  [257] "Papio hamadryas_anubis"                                    
#>  [258] "Papio hamadryas_doguera"                                   
#>  [259] "Pongo pygmaeus"                                            
#>  [260] "Simia pygmaeus"                                            
#>  [261] "Sorex etruscus"                                            
#>  [262] "Suncus etruscus"                                           
#>  [263] "Prosopis cineraria"                                        
#>  [264] "Nycticebus coucang"                                        
#>  [265] "Tardigradus coucang"                                       
#>  [266] "Rhododendron vialii"                                       
#>  [267] "Pan paniscus"                                              
#>  [268] "Nematostella vectensis"                                    
#>  [269] "Ixodes dammini"                                            
#>  [270] "Ixodes scapularis"                                         
#>  [271] "Lupinus angustifolius"                                     
#>  [272] "Ipomoea triloba"                                           
#>  [273] "Equus asinus"                                              
#>  [274] "Emiliania huxleyi_CCMP1516"                                
#>  [275] "Emiliania huxleyi_CCMP2090"                                
#>  [276] "Mangifera indica"                                          
#>  [277] "Pteropus alecto"                                           
#>  [278] "Rana temporaria"                                           
#>  [279] "Crassostrea gigas"                                         
#>  [280] "Ostrea gigas"                                              
#>  [281] "Crassostrea angulata"                                      
#>  [282] "Etheostoma spectabile"                                     
#>  [283] "Poecilichthys spectabilis"                                 
#>  [284] "Macadamia integrifolia"                                    
#>  [285] "Megalobrama amblycephala"                                  
#>  [286] "Halichoerus grypus"                                        
#>  [287] "Phoca grypus"                                              
#>  [288] "Juglans regia"                                             
#>  [289] "Selaginella moellendorffii"                                
#>  [290] "Selaginella moellendorfii"                                 
#>  [291] "Pleuronectes platessa"                                     
#>  [292] "Presbytis francoisi"                                       
#>  [293] "Trachypithecus francoisi"                                  
#>  [294] "Tripterygium wilfordii"                                    
#>  [295] "Argiope bruennichi"                                        
#>  [296] "Lepus cuniculus"                                           
#>  [297] "Oryctolagus cuniculus"                                     
#>  [298] "Huro salmoides"                                            
#>  [299] "Labrus salmoides"                                          
#>  [300] "Labrus salmonides"                                         
#>  [301] "Micropterus nigricans"                                     
#>  [302] "Micropterus salmoides"                                     
#>  [303] "Solanum stenotomum"                                        
#>  [304] "Heterocephalus glaber"                                     
#>  [305] "Neosciurus carolinensis"                                   
#>  [306] "Sciurus carolinensis"                                      
#>  [307] "Cervus elaphus"                                            
#>  [308] "Polyodon spathula"                                         
#>  [309] "Squalus spathula"                                          
#>  [310] "Gadus chalcogrammus"                                       
#>  [311] "Theragra chalcogramma_finnmarchica"                        
#>  [312] "Theragra chalcogramma"                                     
#>  [313] "Theragra finnmarchica"                                     
#>  [314] "Nothobranchius furzeri"                                    
#>  [315] "Bos bubalis"                                               
#>  [316] "Bubalus arnee_bubalis"                                     
#>  [317] "Bubalus bubalis"                                           
#>  [318] "Pleuronectes solea"                                        
#>  [319] "Solea solea"                                               
#>  [320] "Solea vulgaris"                                            
#>  [321] "Mastomys coucha"                                           
#>  [322] "Praomys coucha"                                            
#>  [323] "Impatiens glandulifera"                                    
#>  [324] "Dermacentor andersoni"                                     
#>  [325] "Felis nebulosa"                                            
#>  [326] "Neofelis nebulosa"                                         
#>  [327] "Pteropus egyptiacus"                                       
#>  [328] "Rousettus aegyptiacus"                                     
#>  [329] "Rousettus aegypticus"                                      
#>  [330] "Rousettus egyptiacus"                                      
#>  [331] "Phoenix dactylifera"                                       
#>  [332] "Pimephales promelas"                                       
#>  [333] "Ostrea edulis"                                             
#>  [334] "Cebus capucinus_imitator"                                  
#>  [335] "Cebus imitator"                                            
#>  [336] "Peromyscus maniculatus_bairdii"                            
#>  [337] "Gasterosteus pungitius"                                    
#>  [338] "Pungitius pungitius"                                       
#>  [339] "Populus alba"                                              
#>  [340] "Cricetus auratus"                                          
#>  [341] "Golden hamsters"                                           
#>  [342] "Mesocricetus auratus"                                      
#>  [343] "Syrian hamsters"                                           
#>  [344] "Chromis aureus"                                            
#>  [345] "Oreochromis aurea"                                         
#>  [346] "Oreochromis aureus"                                        
#>  [347] "Daucus carota_subsp._sativus"                              
#>  [348] "Daucus carota_var._sativus"                                
#>  [349] "Dermacentor silvarum"                                      
#>  [350] "Hylobates syndactylus"                                     
#>  [351] "Simia syndactyla"                                          
#>  [352] "Symphalangus syndactylus"                                  
#>  [353] "Saccharolobus solfataricus"                                
#>  [354] "Sulfolobus solfataricus"                                   
#>  [355] "Felis geoffroyi"                                           
#>  [356] "Leopardus geoffroyi"                                       
#>  [357] "Oncifelis geoffroyi"                                       
#>  [358] "Felis yagouaroundi"                                        
#>  [359] "Herpailurus yagouaroundi"                                  
#>  [360] "Herpailurus yaguarondi"                                    
#>  [361] "Puma yagouaroundii"                                        
#>  [362] "Puma yagouaroundi"                                         
#>  [363] "Cervus canadensis"                                         
#>  [364] "Populus diversifolia"                                      
#>  [365] "Populus euphratica"                                        
#>  [366] "Cucurbita pepo_subsp._pepo"                                
#>  [367] "Cucurbita pepo_var._medullosa"                             
#>  [368] "Cucurbita pepo_var._pepo"                                  
#>  [369] "Macaca cynomolgus"                                         
#>  [370] "Macaca fascicularis"                                       
#>  [371] "Macaca irus"                                               
#>  [372] "Simia fascicularis"                                        
#>  [373] "Emys muticus"                                              
#>  [374] "Geoclemmys mutica"                                         
#>  [375] "Mauremys mutica"                                           
#>  [376] "Suricata suricatta"                                        
#>  [377] "Viverra suricatta"                                         
#>  [378] "Hylobates moloch"                                          
#>  [379] "Simia moloch"                                              
#>  [380] "Solanum dulcamara"                                         
#>  [381] "Cucurbita moschata"                                        
#>  [382] "Coffea eugeniodes"                                         
#>  [383] "Coffea eugenioides"                                        
#>  [384] "Cucurbita maxima"                                          
#>  [385] "Colobus tephrosceles"                                      
#>  [386] "Piliocolobus tephrosceles"                                 
#>  [387] "Procolobus badius_tephrosceles"                            
#>  [388] "Procolobus rufomitratus_tephrosceles"                      
#>  [389] "Labrus bergylta"                                           
#>  [390] "Centropristis striata"                                     
#>  [391] "Labrus striatus"                                           
#>  [392] "Oryza sativa_(japonica_cultivar-group)"                    
#>  [393] "Oryza sativa_Japonica_Group"                               
#>  [394] "Oryza sativa_subsp._japonica"                              
#>  [395] "Jaculus jaculus"                                           
#>  [396] "Mus jaculus"                                               
#>  [397] "Dioscorea cayenensis_subsp._rotundata"                     
#>  [398] "Dioscorea rotundata"                                       
#>  [399] "Cercopithecus aethiops_sabaeus"                            
#>  [400] "Cercopithecus sabaeus"                                     
#>  [401] "Cercopithecus sabeus"                                      
#>  [402] "Chlorocebus aethiops_sabaeus"                              
#>  [403] "Chlorocebus aethiops_sabeus"                               
#>  [404] "Chlorocebus sabaeus"                                       
#>  [405] "Chlorocebus sabeus"                                        
#>  [406] "Simia sabaea"                                              
#>  [407] "Marmota monax"                                             
#>  [408] "Mus monax"                                                 
#>  [409] "Pygathrix roxellana"                                       
#>  [410] "Rhinopithecus roxellana"                                   
#>  [411] "Semnopithecus roxellana"                                   
#>  [412] "Callorhinus ursinus"                                       
#>  [413] "Callorhynus ursius"                                        
#>  [414] "Phoca ursina"                                              
#>  [415] "Cricetulus barabensis_griseus"                             
#>  [416] "Cricetulus griseus"                                        
#>  [417] "Elephantulus edwardii"                                     
#>  [418] "Macroscelides edwardii"                                    
#>  [419] "Cobitis heteroclita"                                       
#>  [420] "Fundulus heteroclitus"                                     
#>  [421] "Neothunnus macropterus"                                    
#>  [422] "Scomber albacares"                                         
#>  [423] "Thunnus albacares"                                         
#>  [424] "Telopea speciosissima"                                     
#>  [425] "Danio aesculapii"                                          
#>  [426] "Marmota marmota_marmota"                                   
#>  [427] "Apodemus sylvaticus"                                       
#>  [428] "Mus sylvaticus"                                            
#>  [429] "Sylvaemus sylvaticus"                                      
#>  [430] "Populus balsamifera_subsp._trichocarpa"                    
#>  [431] "Populus trichocarpa"                                       
#>  [432] "Mercurialis annua"                                         
#>  [433] "Syzygium oleosum"                                          
#>  [434] "Citellus tridecemlineatus"                                 
#>  [435] "Ictidomys tridecemlineatus"                                
#>  [436] "Spermophilus tridecemlineatus"                             
#>  [437] "Ovis ammon_aries"                                          
#>  [438] "Ovis aries"                                                
#>  [439] "Ovis orientalis_aries"                                     
#>  [440] "Ovis ovis"                                                 
#>  [441] "Solanum verrucosum"                                        
#>  [442] "Leo pardus"                                                
#>  [443] "Panthera pardus"                                           
#>  [444] "Microtus oregoni"                                          
#>  [445] "Arabidopsis lyrata_subsp._lyrata"                          
#>  [446] "Arabis lyrata_subsp._lyrata"                               
#>  [447] "Arabis lyrata"                                             
#>  [448] "Cardaminopsis lyrata"                                      
#>  [449] "Manihot esculenta"                                         
#>  [450] "Manihot utilissima"                                        
#>  [451] "Mustela erminea"                                           
#>  [452] "Phaseolus unguiculatus"                                    
#>  [453] "Vigna unguiculata"                                         
#>  [454] "Lycopersicon pennellii_(Correll)_D'Arcy,_1982"             
#>  [455] "Solanum pennellii_Correll,_1958"                           
#>  [456] "Solanum pennellii"                                         
#>  [457] "Setaria viridis"                                           
#>  [458] "Musa AA_Group"                                             
#>  [459] "Musa acuminata_AA_Group"                                   
#>  [460] "Musa acuminata"                                            
#>  [461] "Musa nana"                                                 
#>  [462] "Gymnostomus macrolepis"                                    
#>  [463] "Onychostoma macrolepis"                                    
#>  [464] "Scaphesthes macrolepis"                                    
#>  [465] "Varicorhinus macrolepis"                                   
#>  [466] "Varicorhinus (Scaphesthes)_macrolepis"                     
#>  [467] "Oryza glaberrima"                                          
#>  [468] "Pelteobagrus fulvidraco"                                   
#>  [469] "Pimelodus fulvidraco"                                      
#>  [470] "Pseudobagrus fulvidraco"                                   
#>  [471] "Tachysurus fulvidraco"                                     
#>  [472] "Hylobates concolor_leucogenys"                             
#>  [473] "Hylobates concolor_leucogyneus"                            
#>  [474] "Hylobates leucogenys_leucogenys"                           
#>  [475] "Hylobates leucogenys"                                      
#>  [476] "Nomascus leucogenys_leucogenys"                            
#>  [477] "Nomascus leucogenys"                                       
#>  [478] "Nomascus leukogenys"                                       
#>  [479] "Nannospalax ehrenbergi_galili"                             
#>  [480] "Nannospalax galili"                                        
#>  [481] "Spalax galili"                                             
#>  [482] "Equus caballus"                                            
#>  [483] "Equus przewalskii_f._caballus"                             
#>  [484] "Equus przewalskii_forma_caballus"                          
#>  [485] "Thunnus maccoyii"                                          
#>  [486] "Thynnus maccoyii"                                          
#>  [487] "Chromis diagramma"                                         
#>  [488] "Simochromis diagramma"                                     
#>  [489] "Diplophysa dalaica"                                        
#>  [490] "Triplophysa dalaica"                                       
#>  [491] "Panthera tigris"                                           
#>  [492] "Strongylocentrotus purpuratus"                             
#>  [493] "Lucioperca lucioperca"                                     
#>  [494] "Perca lucioperca"                                          
#>  [495] "Sander lucioperca"                                         
#>  [496] "Stizostedion lucioperca"                                   
#>  [497] "Dipodomys spectabilis"                                     
#>  [498] "Acinonyx jubatus"                                          
#>  [499] "Felis jubata"                                              
#>  [500] "Conyza canadensis"                                         
#>  [501] "Erigeron canadensis"                                       
#>  [502] "Mustela lutreola"                                          
#>  [503] "Camelus bactrianus_ferus"                                  
#>  [504] "Camelus ferus"                                             
#>  [505] "Cajanus cajan"                                             
#>  [506] "Didelphys domestica"                                       
#>  [507] "Monodelphis domestica"                                     
#>  [508] "Pygathrix bieti"                                           
#>  [509] "Rhinopithecus bieti"                                       
#>  [510] "Saimiri boliviensis"                                       
#>  [511] "Hesperomys eremicus"                                       
#>  [512] "Peromyscus eremicus"                                       
#>  [513] "Arabidopsis salsuginea"                                    
#>  [514] "Eutrema salsugineum"                                       
#>  [515] "Hesperis salsuginea"                                       
#>  [516] "Sisymbrium salsugineum"                                    
#>  [517] "Stenophragma salsugineum"                                  
#>  [518] "Thellungiella salsuginea"                                  
#>  [519] "Thelypodium salsugineum"                                   
#>  [520] "Coetomys damarensis"                                       
#>  [521] "Cryptomys damarensis"                                      
#>  [522] "Fukomys damarensis"                                        
#>  [523] "Gadus morhua"                                              
#>  [524] "Leptonychotes weddellii"                                   
#>  [525] "Leptonychotes weddelli"                                    
#>  [526] "Otaria weddellii"                                          
#>  [527] "Grammomys dolichurus_surdaster"                            
#>  [528] "Grammomys surdaster"                                       
#>  [529] "Thamnomys surdaster"                                       
#>  [530] "Solanum tuberosum"                                         
#>  [531] "Andropogon sorghum"                                        
#>  [532] "Sorghum bicolor"                                           
#>  [533] "Sorghum bicolor_subsp._bicolor"                            
#>  [534] "Sorghum nervosum"                                          
#>  [535] "Sorghum saccharatum"                                       
#>  [536] "Sorghum vulgare"                                           
#>  [537] "Holocentrus calcarifer"                                    
#>  [538] "Lates calcarifer"                                          
#>  [539] "Hippopotamus amphibius_kiboko"                             
#>  [540] "Ixodes sanguineus"                                         
#>  [541] "Rhipicephalus sanguineus"                                  
#>  [542] "Clupea harengus_harengus"                                  
#>  [543] "Clupea harengus"                                           
#>  [544] "Bos indicus_x_Bos_taurus"                                  
#>  [545] "Bos primigenius_indicus_x_Bos_primigenius_taurus"          
#>  [546] "Bos taurus_indicus_x_Bos_taurus_taurus"                    
#>  [547] "Bos taurus_x_Bos_indicus"                                  
#>  [548] "Chrysochloris asiatica"                                    
#>  [549] "Talpa asiatica"                                            
#>  [550] "Macacus gelada"                                            
#>  [551] "Theropithecus gelada"                                      
#>  [552] "Bufo bufo"                                                 
#>  [553] "Rana bufo"                                                 
#>  [554] "Maylandia zebra"                                           
#>  [555] "Metriaclima zebra"                                         
#>  [556] "Pseudotropheus sp._'Pseudotropheus_zebra_complex'"         
#>  [557] "Pseudotropheus zebra"                                      
#>  [558] "Otaria californiana"                                       
#>  [559] "Zalophus californianus"                                    
#>  [560] "Ictalurus punctatus"                                       
#>  [561] "Silurus punctatus"                                         
#>  [562] "Mus caroli"                                                
#>  [563] "Mus formosanus"                                            
#>  [564] "Oryx dammah"                                               
#>  [565] "Camelus dromedarius"                                       
#>  [566] "Asparagus officinalis"                                     
#>  [567] "Amaranthus gangeticus"                                     
#>  [568] "Amaranthus mangostanus"                                    
#>  [569] "Amaranthus tricolor"                                       
#>  [570] "Pneumatophorus japonicus"                                  
#>  [571] "Scomber japonicus"                                         
#>  [572] "Lutra lutra"                                               
#>  [573] "Peromyscus leucopus"                                       
#>  [574] "Perca fluviatilis"                                         
#>  [575] "Pagothenia bernacchii"                                     
#>  [576] "Pseudotrematomus bernacchii"                               
#>  [577] "Trematomus bernacchii"                                     
#>  [578] "Trematomus bernacchi"                                      
#>  [579] "Phascolarctos cinereus"                                    
#>  [580] "Mustela furo"                                              
#>  [581] "Mustela putorius_furo"                                     
#>  [582] "Chaetochloa italica"                                       
#>  [583] "Panicum italicum"                                          
#>  [584] "Pennisetum macrochaetum"                                   
#>  [585] "Setaria italica"                                           
#>  [586] "Setaria viridis_subsp._italica"                            
#>  [587] "Elaeis guineensis"                                         
#>  [588] "Mus rattus"                                                
#>  [589] "Rattus rattoides"                                          
#>  [590] "Rattus rattus"                                             
#>  [591] "Rattus wroughtoni"                                         
#>  [592] "Acropora digitifera"                                       
#>  [593] "Madrepora digitifera"                                      
#>  [594] "Leo leo"                                                   
#>  [595] "Panthera leo"                                              
#>  [596] "Echinops telfairii"                                        
#>  [597] "Echinops telfairi"                                         
#>  [598] "Madrepora verrucosa"                                       
#>  [599] "Pocillopora danae"                                         
#>  [600] "Pocillopora verrucosa"                                     
#>  [601] "Myotis daubentonii"                                        
#>  [602] "Myotis daubentoni"                                         
#>  [603] "Vespertilio daubentonii"                                   
#>  [604] "Limia formosa"                                             
#>  [605] "Mollienesia formosa"                                       
#>  [606] "Poecilia formosa"                                          
#>  [607] "Phyllostomus discolor"                                     
#>  [608] "Aurata aurata"                                             
#>  [609] "Sparus aurata"                                             
#>  [610] "Sparus auratus"                                            
#>  [611] "Microcebus murinus"                                        
#>  [612] "Peromyscus californicus_insignis"                          
#>  [613] "Peromyscus californicus_subsp._insignis"                   
#>  [614] "Galago garnettii"                                          
#>  [615] "Galago garnetti"                                           
#>  [616] "Otolemur garnettii"                                        
#>  [617] "Arvicanthis niloticus"                                     
#>  [618] "Didelphis ursina"                                          
#>  [619] "Vombatus ursinus"                                          
#>  [620] "Phaseolus angularis"                                       
#>  [621] "Vigna angularis"                                           
#>  [622] "Haitia acuta"                                              
#>  [623] "Physa acuta"                                               
#>  [624] "Physa heterostropha"                                       
#>  [625] "Physa integra"                                             
#>  [626] "Physella acuta"                                            
#>  [627] "Physella heterostropha"                                    
#>  [628] "Physella integra"                                          
#>  [629] "Ctenopharyngodon idella"                                   
#>  [630] "Ctenopharyngodon idellus"                                  
#>  [631] "Leuciscus idella"                                          
#>  [632] "Thalassophryne amazonica"                                  
#>  [633] "Cyprinus rohita"                                           
#>  [634] "Labeo rohita"                                              
#>  [635] "Talpa occidentalis"                                        
#>  [636] "Bombina bombina"                                           
#>  [637] "Rana bombina"                                              
#>  [638] "Cavia aperea_porcellus"                                    
#>  [639] "Cavia cobaya"                                              
#>  [640] "Cavia porcellus"                                           
#>  [641] "Mus porcellus"                                             
#>  [642] "Odocoileus virginianus"                                    
#>  [643] "Amphibalanus amphitrite"                                   
#>  [644] "Balanus amphitrite"                                        
#>  [645] "Panicum hallii"                                            
#>  [646] "Angill angill"                                             
#>  [647] "Anguilla anguilla_anguilla"                                
#>  [648] "Anguilla anguilla"                                         
#>  [649] "Muraena anguilla"                                          
#>  [650] "Orcinus orca"                                              
#>  [651] "Cannabis sativa"                                           
#>  [652] "Penaeus bubulus"                                           
#>  [653] "Penaeus carinatus"                                         
#>  [654] "Penaeus durbani"                                           
#>  [655] "Penaeus monodon"                                           
#>  [656] "Penaeus (Penaeus)_monodon"                                 
#>  [657] "Didelphis vulpecula"                                       
#>  [658] "Trichosurus vulpecula"                                     
#>  [659] "Myotis lucifugus"                                          
#>  [660] "Vespertilio lucifugus"                                     
#>  [661] "Brachypodium distachyon"                                   
#>  [662] "Aotus nancymaae"                                           
#>  [663] "Aotus nancymai"                                            
#>  [664] "Astatotilapia calliptera"                                  
#>  [665] "Chromis callipterus"                                       
#>  [666] "Ctenochromis callipterus"                                  
#>  [667] "Haplochromis calliptera"                                   
#>  [668] "Haplochromis callipterus"                                  
#>  [669] "Rhamnus zizyphus"                                          
#>  [670] "Ziziphus jujuba"                                           
#>  [671] "Ailuropoda melanoleuca"                                    
#>  [672] "Micropterus dolomieu"                                      
#>  [673] "Micropterus velox"                                         
#>  [674] "Lycopersicon esculentum"                                   
#>  [675] "Lycopersicon esculentum_var._esculentum"                   
#>  [676] "Solanum esculentum"                                        
#>  [677] "Solanum lycopersicum"                                      
#>  [678] "Solanum lycopersicum_var._humboldtii"                      
#>  [679] "Poecilia mexicana"                                         
#>  [680] "Manis pentadactyla"                                        
#>  [681] "Meles meles"                                               
#>  [682] "Ursus meles"                                               
#>  [683] "Ornithorhynchus anatinus"                                  
#>  [684] "Platypus anatinus"                                         
#>  [685] "Felis uncia"                                               
#>  [686] "Panthera uncia"                                            
#>  [687] "Uncia uncia"                                               
#>  [688] "Alligator mississippiensis"                                
#>  [689] "Crocodilus mississipiensis"                                
#>  [690] "Myrmecophaga aculeata"                                     
#>  [691] "Tachyglossus aculeatus"                                    
#>  [692] "Colossoma macropomum"                                      
#>  [693] "Myletes macropomus"                                        
#>  [694] "Cordylus capensis"                                         
#>  [695] "Cordylus (Hemicordylus)_capensis"                          
#>  [696] "Hemicordylus capensis"                                     
#>  [697] "Pseudocordylus capensis"                                   
#>  [698] "Zonurus capensis"                                          
#>  [699] "Eptesicus fuscus"                                          
#>  [700] "Vespertilio fuscus"                                        
#>  [701] "Dromiciops australis"                                      
#>  [702] "Dromiciops gliroides"                                      
#>  [703] "Camelus pacos"                                             
#>  [704] "Lama guanicoe_pacos"                                       
#>  [705] "Lama pacos"                                                
#>  [706] "Vicugna pacos"                                             
#>  [707] "Mollienesia latipinna"                                     
#>  [708] "Poecilia latipinna"                                        
#>  [709] "Elephas maximus_indicus"                                   
#>  [710] "Corylus avellana"                                          
#>  [711] "Ostrea maxima"                                             
#>  [712] "Pecten maximus"                                            
#>  [713] "Felis viverrina"                                           
#>  [714] "Prionailurus viverrinus"                                   
#>  [715] "Gymnodraco acuticeps"                                      
#>  [716] "Thalarctos maritimus"                                      
#>  [717] "Ursus maritimus"                                           
#>  [718] "Lemur catta"                                               
#>  [719] "Myotis myotis"                                             
#>  [720] "Vespertilio myotis"                                        
#>  [721] "Lytechinus pictus"                                         
#>  [722] "Litopenaeus vannamei"                                      
#>  [723] "Penaeus (Litopenaeus)_vannamei"                            
#>  [724] "Penaeus vannamei"                                          
#>  [725] "Ursus arctos"                                              
#>  [726] "Vitis riparia"                                             
#>  [727] "Felis bengalensis"                                         
#>  [728] "Prionailurus bengalensis"                                  
#>  [729] "Clethrionomys glareolus"                                   
#>  [730] "Mus glareolus"                                             
#>  [731] "Myodes glareolus"                                          
#>  [732] "Mustela nigripes"                                          
#>  [733] "Putorius nigripes"                                         
#>  [734] "Pygocentrus nattereri"                                     
#>  [735] "Serrasalmus nattereri"                                     
#>  [736] "Alopex lagopus"                                            
#>  [737] "Canis lagopus"                                             
#>  [738] "Vulpes lagopus"                                            
#>  [739] "Cercocebus atys"                                           
#>  [740] "Cercocebus torquatus_atys"                                 
#>  [741] "Simia atys"                                                
#>  [742] "Lepidosiren annectens"                                     
#>  [743] "Protopterus annectens"                                     
#>  [744] "Rhinocryptis annectens"                                    
#>  [745] "Cerasus avium"                                             
#>  [746] "Prunus avium"                                              
#>  [747] "Prunus cerasus_var._avium"                                 
#>  [748] "Procambarus clarkii"                                       
#>  [749] "Sorex fumeus"                                              
#>  [750] "Macrorhinus angustirostris"                                
#>  [751] "Mirounga angustirostris"                                   
#>  [752] "Beta vulgaris_subsp._vulgaris"                             
#>  [753] "Beta vulgaris_subsp._vulgaris_var._altissima"              
#>  [754] "Beta vulgaris_Sugar_Beet_Group"                            
#>  [755] "Beta vulgaris_var._altissima"                              
#>  [756] "Eumetopias jubatus"                                        
#>  [757] "Phoca jubata"                                              
#>  [758] "Centruroides sculpturatus"                                 
#>  [759] "Diceros bicornis_minor"                                    
#>  [760] "Cicer arietinum"                                           
#>  [761] "Cleome hassleriana_Chodat,_1898"                           
#>  [762] "Tarenaya hassleriana"                                      
#>  [763] "Sebastes umbrosus"                                         
#>  [764] "Sebastichthys umbrosus"                                    
#>  [765] "Eriocheir chinensis"                                       
#>  [766] "Eriocheir japonica_sinensis"                               
#>  [767] "Eriocheir sinensis"                                        
#>  [768] "Dicentrarchus labrax"                                      
#>  [769] "Labrax labrax"                                             
#>  [770] "Morone labrax"                                             
#>  [771] "Perca labrax"                                              
#>  [772] "Roccus labrax"                                             
#>  [773] "Sciaena labrax"                                            
#>  [774] "Acanthopagrus latus"                                       
#>  [775] "Sparus latus"                                              
#>  [776] "Xiphophorus hellerii"                                      
#>  [777] "Xiphophorus helleri"                                       
#>  [778] "Acanthochromis polyacanthus"                               
#>  [779] "Acanthochromis polyacathus"                                
#>  [780] "Dascyllus polyacanthus"                                    
#>  [781] "Mustela vison"                                             
#>  [782] "Neogale vison"                                             
#>  [783] "Neovison vison"                                            
#>  [784] "Lingula anatina"                                           
#>  [785] "Lingula lingua"                                            
#>  [786] "Lingula nipponica"                                         
#>  [787] "Lingula unguis"                                            
#>  [788] "Madrepora faveolata"                                       
#>  [789] "Montastraea faveolata"                                     
#>  [790] "Montastrea faveolata"                                      
#>  [791] "Orbicella faveolata"                                       
#>  [792] "Chinchilla lanigera"                                       
#>  [793] "Chinchilla velligera"                                      
#>  [794] "Chinchilla villidera"                                      
#>  [795] "Mirounga leonina"                                          
#>  [796] "Phoca leonina"                                             
#>  [797] "Perognathus longimembris_pacificus"                        
#>  [798] "Cynocephalus variegatus"                                   
#>  [799] "Galeopithecus variegatus"                                  
#>  [800] "Galeopterus variegatus"                                    
#>  [801] "Vigna radiata"                                             
#>  [802] "Vitis vinifera"                                            
#>  [803] "Vitis vinifera_subsp._vinifera"                            
#>  [804] "Characodon multiradiatus"                                  
#>  [805] "Girardinichthys multiradiatus"                             
#>  [806] "Marmota flaviventris"                                      
#>  [807] "Phaseolus calcaratus"                                      
#>  [808] "Phaseolus chrysanthos"                                     
#>  [809] "Phaseolus chrysanthus"                                     
#>  [810] "Vigna calcarata"                                           
#>  [811] "Vigna umbellata"                                           
#>  [812] "Balaenoptera acutorostrata"                                
#>  [813] "Canis procyonoides"                                        
#>  [814] "Nyctereutes procyonoides"                                  
#>  [815] "Amphioxus floridae"                                        
#>  [816] "Branchiostoma floridae"                                    
#>  [817] "Moschus berezovskii"                                       
#>  [818] "Erythranthe guttata"                                       
#>  [819] "Mimulus guttatus_subsp._guttatus"                          
#>  [820] "Mimulus guttatus"                                          
#>  [821] "Camelus bactrianus"                                        
#>  [822] "Desmodus rotundus"                                         
#>  [823] "Phyllostoma rotundum"                                      
#>  [824] "Octopus sinensis"                                          
#>  [825] "Physeter catodon"                                          
#>  [826] "Physeter macrocephalus"                                    
#>  [827] "Alexandromys fortis"                                       
#>  [828] "Microtus fortis"                                           
#>  [829] "Apogon orbicularis"                                        
#>  [830] "Sphaeramia orbicularis"                                    
#>  [831] "Dendronephthya gigantea"                                   
#>  [832] "Canis hyaena"                                              
#>  [833] "Hyaena hyaena"                                             
#>  [834] "Helicophagus hypophthalmus"                                
#>  [835] "Pangasianodon hypophthalmus"                               
#>  [836] "Pangasius hypophthalmus"                                   
#>  [837] "Pangasius sutchi"                                          
#>  [838] "Castor canadensis"                                         
#>  [839] "Coelomys parahi"                                           
#>  [840] "Mus pahari"                                                
#>  [841] "Pseudochaenichthys georgianus"                             
#>  [842] "Capsella rubella"                                          
#>  [843] "Perkinsus marinus_ATCC_50983"                              
#>  [844] "Holocentrus leopardus"                                     
#>  [845] "Plectropomus leopardus"                                    
#>  [846] "Hippocampus zosterae"                                      
#>  [847] "Seriola dorsalis"                                          
#>  [848] "Seriola lalandi_dorsalis"                                  
#>  [849] "Felis canadensis"                                          
#>  [850] "Lynx canadensis"                                           
#>  [851] "Artibeus jamaicensis"                                      
#>  [852] "Citrus sinensis"                                           
#>  [853] "Citrus x_sinensis"                                         
#>  [854] "Punica granatum"                                           
#>  [855] "Abrus cyaneus"                                             
#>  [856] "Abrus precatorius"                                         
#>  [857] "Polypterus senegalus"                                      
#>  [858] "Acomys russatus"                                           
#>  [859] "Hemibagrus wyckioides"                                     
#>  [860] "Macrones wyckioides"                                       
#>  [861] "Mystus wyckioides"                                         
#>  [862] "Melanotaenia boesemani"                                    
#>  [863] "Sturnira hondurensis"                                      
#>  [864] "Amphilophus centrarchus"                                   
#>  [865] "Archocentrus centrarchus"                                  
#>  [866] "Cichlasoma centrarchus"                                    
#>  [867] "Heros centrarchus"                                         
#>  [868] "Delphinus melas"                                           
#>  [869] "Globicephala melaena"                                      
#>  [870] "Globicephala melas"                                        
#>  [871] "Manis javanica"                                            
#>  [872] "Phyllostomus hastatus"                                     
#>  [873] "Vespertilio hastatus"                                      
#>  [874] "Scyliorhinus canicula"                                     
#>  [875] "Silurana tropicalis"                                       
#>  [876] "Xenopus laevis_tropicalis"                                 
#>  [877] "Xenopus (Silurana)_tropicalis"                             
#>  [878] "Xenopus tropicalis"                                        
#>  [879] "Pipistrellus kuhlii"                                       
#>  [880] "Pipistrellus kuhli"                                        
#>  [881] "Vespertilio kuhlii"                                        
#>  [882] "Solea senegalensis"                                        
#>  [883] "Blennius fasciatus"                                        
#>  [884] "Salarias fasciatus"                                        
#>  [885] "Mugil cephalotus"                                          
#>  [886] "Mugil cephalus"                                            
#>  [887] "Mugil galapagensis"                                        
#>  [888] "Mugil japonicus"                                           
#>  [889] "Siphostoma scovelli"                                       
#>  [890] "Syngnathus scovelli"                                       
#>  [891] "Canis vulpes"                                              
#>  [892] "Vulpes vulpes"                                             
#>  [893] "Capra aegagrus_hircus"                                     
#>  [894] "Capra hircus"                                              
#>  [895] "Poeciliopsis prolifica"                                    
#>  [896] "Gopherus flavomarginatus"                                  
#>  [897] "Lontra canadensis"                                         
#>  [898] "Lutra canadensis"                                          
#>  [899] "Hesperomys torridus"                                       
#>  [900] "Onychomys torridus"                                        
#>  [901] "Elephas africanus"                                         
#>  [902] "Loxodonta africana_africana"                               
#>  [903] "Loxodonta africana"                                        
#>  [904] "Limia couchiana"                                           
#>  [905] "Xiphophorus couchianus"                                    
#>  [906] "Boophilus microplus"                                       
#>  [907] "Rhipicephalus (Boophilus)_microplus"                       
#>  [908] "Rhipicephalus microplus"                                   
#>  [909] "Betta splendens"                                           
#>  [910] "Molossus molossus"                                         
#>  [911] "Vespertilio molossus"                                      
#>  [912] "Lagenorhynchus obliquidens"                                
#>  [913] "Delphinus truncatus"                                       
#>  [914] "Tursiops truncatus"                                        
#>  [915] "Morone flavescens"                                         
#>  [916] "Perca flavescens"                                          
#>  [917] "Euarctos americanus"                                       
#>  [918] "Ursus americanus"                                          
#>  [919] "Arvicola nivalis"                                          
#>  [920] "Chionomys nivalis"                                         
#>  [921] "Microtus nivalis"                                          
#>  [922] "Felis rufus"                                               
#>  [923] "Lynx rufus"                                                
#>  [924] "Myotis brandtii"                                           
#>  [925] "Vespertilio brandtii"                                      
#>  [926] "Astatotilapia burtoni"                                     
#>  [927] "Chromis burtoni"                                           
#>  [928] "Haplochromis burtoni"                                      
#>  [929] "Sorex araneus"                                             
#>  [930] "Aplocheilus melastigmus"                                   
#>  [931] "Oryzias melastigma"                                        
#>  [932] "Silurus meridionalis"                                      
#>  [933] "Silurus soldatovi_meridionalis"                            
#>  [934] "Cucumis melo"                                              
#>  [935] "Hydra attenuata"                                           
#>  [936] "Hydra carnea"                                              
#>  [937] "Hydra littoralis"                                          
#>  [938] "Hydra magnipapillata"                                      
#>  [939] "Hydra vulgaris"                                            
#>  [940] "Anoplopoma fimbria"                                        
#>  [941] "Gadus fimbria"                                             
#>  [942] "Alosa alosa"                                               
#>  [943] "Clupea alosa"                                              
#>  [944] "Chelonia mydas"                                            
#>  [945] "Testudo mydas"                                             
#>  [946] "Ctenocephalides felis"                                     
#>  [947] "Brienomyrus kingsleyae"                                    
#>  [948] "Brienomyrus sp._CAB"                                       
#>  [949] "Mormyrus kingsleyae"                                       
#>  [950] "Paramormyrops kingsleyae"                                  
#>  [951] "Pollimyrus kingsleyae"                                     
#>  [952] "Stylophora pistillata"                                     
#>  [953] "Cyrtodiopsis dalmanii"                                     
#>  [954] "Diopsis dalmanni"                                          
#>  [955] "Teleopsis dalmanni"                                        
#>  [956] "Rhagoletis zephyria"                                       
#>  [957] "Rhodamnia argentea"                                        
#>  [958] "Gasterosteus aculeatus"                                    
#>  [959] "Labrus celidotus"                                          
#>  [960] "Notolabrus celidotus"                                      
#>  [961] "Budorcas taxicolor"                                        
#>  [962] "Nelumbo nucifera"                                          
#>  [963] "Amphiprion ocellaris"                                      
#>  [964] "Arvicola amphibius"                                        
#>  [965] "Arvicola terrestris_(Linnaeus,_1758)"                      
#>  [966] "Mus amphibius"                                             
#>  [967] "Daphnia magna"                                             
#>  [968] "Phaseolus vulgaris"                                        
#>  [969] "Psammomys obesus"                                          
#>  [970] "Carlito syrichta"                                          
#>  [971] "Simia syrichta"                                            
#>  [972] "Tarsius syrichta"                                          
#>  [973] "Cyprinodon tularosa"                                       
#>  [974] "Gouania willdenowi"                                        
#>  [975] "Lepadogaster willdenowi"                                   
#>  [976] "Ochotona princeps"                                         
#>  [977] "Phytophthora sojae"                                        
#>  [978] "Equus caballus_przewalskii"                                
#>  [979] "Equus ferus_przewalskii"                                   
#>  [980] "Equus przewalskii"                                         
#>  [981] "Phoca vitulina"                                            
#>  [982] "Coecilia bivitatum"                                        
#>  [983] "Rhinatrema bivitattum"                                     
#>  [984] "Rhinatrema bivittatum"                                     
#>  [985] "Gambusia affinis"                                          
#>  [986] "Heterandria affinis"                                       
#>  [987] "Lagomys curzoniae"                                         
#>  [988] "Ochotona curzonae"                                         
#>  [989] "Ochotona curzoniae"                                        
#>  [990] "Kogia breviceps"                                           
#>  [991] "Physeter breviceps"                                        
#>  [992] "Ambassis ranga"                                            
#>  [993] "Chanda ranga"                                              
#>  [994] "Parambassis ranga"                                         
#>  [995] "Pseudambassis ranga"                                       
#>  [996] "Clupea cyprinoides"                                        
#>  [997] "Megalops cyprinoides"                                      
#>  [998] "Diospyros lotus"                                           
#>  [999] "Hippoglossus stenolepis"                                   
#> [1000] "Phacochoerus africanus"                                    
#> [1001] "Corythoichthys intestinalis"                               
#> [1002] "Syngnatus intestinalis"                                    
#> [1003] "Mandrillus leucophaeus"                                    
#> [1004] "Papio leucophaeus"                                         
#> [1005] "Simia leucophaea"                                          
#> [1006] "Epinephelus fuscoguttatus"                                 
#> [1007] "Perca summana_fuscoguttata"                                
#> [1008] "Asterina miniata"                                          
#> [1009] "Patiria miniata"                                           
#> [1010] "Rhinolophus rouxii_sinicus"                                
#> [1011] "Rhinolophus sinicus"                                       
#> [1012] "Lampris incognitus"                                        
#> [1013] "Monachus schauinslandi"                                    
#> [1014] "Neomonachus schauinslandi"                                 
#> [1015] "Hippoglossus hippoglossus"                                 
#> [1016] "Pleuronectes hippoglossus"                                 
#> [1017] "Andrographis paniculata"                                   
#> [1018] "Etheostoma cragini"                                        
#> [1019] "Perca chuatsi"                                             
#> [1020] "Siniperca chuatsi"                                         
#> [1021] "Meriones unguiculatus"                                     
#> [1022] "Colobus angolensis_palliatus"                              
#> [1023] "Notothenia coriiceps"                                      
#> [1024] "Hypomesus transpacificus"                                  
#> [1025] "Dermochelys coriacea"                                      
#> [1026] "Testudo coriacea"                                          
#> [1027] "Bufo bufo_gargarizans"                                     
#> [1028] "Bufo gargarizans"                                          
#> [1029] "Bufo japonicus_gargarizans"                                
#> [1030] "Delphinapterus leucas"                                     
#> [1031] "Delphinus leucas"                                          
#> [1032] "Fugu flavidus"                                             
#> [1033] "Takifugu flavidus"                                         
#> [1034] "Pteronotus mesoamericanus"                                 
#> [1035] "Pteronotus parnellii_mesoamericanus"                       
#> [1036] "Citrus clementina"                                         
#> [1037] "Citrus deliciosa_x_Citrus_sinensis"                        
#> [1038] "Fugu rubripes"                                             
#> [1039] "Sphaeroides rubripes"                                      
#> [1040] "Takifugu rubripes"                                         
#> [1041] "Tetraodon rubripes"                                        
#> [1042] "Homarus americanus"                                        
#> [1043] "Osteoglossum formosum"                                     
#> [1044] "Scleropages formosus"                                      
#> [1045] "Larimichthys crocea"                                       
#> [1046] "Pseudosciaena amblyceps"                                   
#> [1047] "Pseudosciaena crocea"                                      
#> [1048] "Sciaena crocea"                                            
#> [1049] "Fragaria vesca"                                            
#> [1050] "Folsomia candida"                                          
#> [1051] "Limulus polyphemus"                                        
#> [1052] "Doryrhamphus dactyliophorus"                               
#> [1053] "Dunckerocampus dactyliophorus"                             
#> [1054] "Syngnathus dactyliophorus"                                 
#> [1055] "Epinephelus lanceolatus"                                   
#> [1056] "Holocentrus lanceolatus"                                   
#> [1057] "Promicrops lanceolatus"                                    
#> [1058] "Mizuhopecten yessoensis"                                   
#> [1059] "Patinopecten yessoensis"                                   
#> [1060] "Patiopecten yessoensis"                                    
#> [1061] "Pecten yessoensis"                                         
#> [1062] "Calamoichthys calabaricus"                                 
#> [1063] "Erpetoichthys calabaricus"                                 
#> [1064] "Platypoecilus maculatus"                                   
#> [1065] "Xiphophorus maculatus"                                     
#> [1066] "Echeneis naucrates"                                        
#> [1067] "Triplophysa rosa"                                          
#> [1068] "Antechinus flavipes"                                       
#> [1069] "Phascogale flavipes"                                       
#> [1070] "Balaena musculus"                                          
#> [1071] "Balaenoptera musculus"                                     
#> [1072] "Rhinolophus ferrumequinum"                                 
#> [1073] "Vespertilio ferrumequinum"                                 
#> [1074] "Oryza brachyantha"                                         
#> [1075] "Chrysemys picta"                                           
#> [1076] "Testudo picta"                                             
#> [1077] "Trachemys picta"                                           
#> [1078] "Tetrahymena thermophila_SB210"                             
#> [1079] "Myripristis murdjan"                                       
#> [1080] "Ostichthys murdjan"                                        
#> [1081] "Perca murdjan"                                             
#> [1082] "Sciaena murdjan"                                           
#> [1083] "Amphiprion testudineus"                                    
#> [1084] "Anabas testudineus"                                        
#> [1085] "Anthias testudineus"                                       
#> [1086] "Antias testudineus"                                        
#> [1087] "Amygdalus communis"                                        
#> [1088] "Prunus amygdalus"                                          
#> [1089] "Prunus communis"                                           
#> [1090] "Prunus dulcis"                                             
#> [1091] "Prunus dulcis_var._sativa"                                 
#> [1092] "Oryzias latipes"                                           
#> [1093] "Poecilia latipes"                                          
#> [1094] "Sarcophilus harrisii"                                      
#> [1095] "Sarcophilus laniarius_(Owen,_1838)"                        
#> [1096] "Sarcophilus laniarius"                                     
#> [1097] "Ursinus harrisii"                                          
#> [1098] "Ictalurus furcatus"                                        
#> [1099] "Pimelodus furcatus"                                        
#> [1100] "Branchiostoma belcheri"                                    
#> [1101] "Gigantopelta aegis"                                        
#> [1102] "Lytechinus variegatus"                                     
#> [1103] "Diaphorina citri"                                          
#> [1104] "Epinephelus moara"                                         
#> [1105] "Serranus moara"                                            
#> [1106] "Stegodyphus dumicola"                                      
#> [1107] "Boleophthalmus pectinirostris"                             
#> [1108] "Gobius pectinirostris"                                     
#> [1109] "Lacerrta muralis"                                          
#> [1110] "Podarcis muralis"                                          
#> [1111] "Seps muralis"                                              
#> [1112] "Austrofundulus limnaeus"                                   
#> [1113] "Columba livia_domestica"                                   
#> [1114] "Columba livia"                                             
#> [1115] "Citellus parryii"                                          
#> [1116] "Spermophilus parryii"                                      
#> [1117] "Spermophilus parryi"                                       
#> [1118] "Urocitellus parryii"                                       
#> [1119] "Latimeria chalumnae"                                       
#> [1120] "Pleuronectes maximus"                                      
#> [1121] "Psetta maxima"                                             
#> [1122] "Rhombus maximus"                                           
#> [1123] "Scophthalmus maximus"                                      
#> [1124] "Sesamum indicum"                                           
#> [1125] "Sesamum orientale"                                         
#> [1126] "Cyclopterus lumpus"                                        
#> [1127] "Armeniaca mume"                                            
#> [1128] "Prunus mume"                                               
#> [1129] "Myotis davidii"                                            
#> [1130] "Vespertilio Davidii"                                       
#> [1131] "Didelphys agilis"                                          
#> [1132] "Gracilinanus agilis"                                       
#> [1133] "Phocoena sinus"                                            
#> [1134] "Acanthophacelus reticulata"                                
#> [1135] "Poecilia (Acanthophacelus)_reticulata"                     
#> [1136] "Poecilia latipinna_reticulata"                             
#> [1137] "Poecilia reticulata"                                       
#> [1138] "Gopherus evgoodei"                                         
#> [1139] "Australorbis glabratus"                                    
#> [1140] "Biomphalaria glabrata"                                     
#> [1141] "Planorbis glabratus"                                       
#> [1142] "Hypudaeus ochrogaster"                                     
#> [1143] "Microtus ochrogaster"                                      
#> [1144] "Amygdalus persica"                                         
#> [1145] "Persica vulgaris"                                          
#> [1146] "Prunus persica"                                            
#> [1147] "Prunus persica_var._densa"                                 
#> [1148] "Chiloscyllium plagiosum"                                   
#> [1149] "Scyllium plagiosum"                                        
#> [1150] "Cheilinus undulatus"                                       
#> [1151] "Phodopus roborovskii"                                      
#> [1152] "Caenorhabditis remanei"                                    
#> [1153] "Caenorhabditis vulgaris"                                   
#> [1154] "Lamprologus brichardi"                                     
#> [1155] "Neolamprologus brichardi"                                  
#> [1156] "Gymnopis unicolor"                                         
#> [1157] "Microcaecilia unicolor"                                    
#> [1158] "Rhinatrema unicolor"                                       
#> [1159] "Rhizophagus irregularis_DAOM_181602=DAOM_197198"           
#> [1160] "Sciaena jaculatrix"                                        
#> [1161] "Toxotes jaculatrix"                                        
#> [1162] "Bos indicus"                                               
#> [1163] "Bos primigenius_indicus"                                   
#> [1164] "Bos taurus_indicus"                                        
#> [1165] "Lacerta sicula_raffonei"                                   
#> [1166] "Podarcis raffoneae"                                        
#> [1167] "Podarcis raffonei"                                         
#> [1168] "Podarcis wagleriana_raffonei"                              
#> [1169] "Benincasa cerifera"                                        
#> [1170] "Benincasa hispida"                                         
#> [1171] "Benincasa pruriens"                                        
#> [1172] "Cucurbita hispida"                                         
#> [1173] "Lagenaria siceraria_var._hispida"                          
#> [1174] "Dendrobium catenatum"                                      
#> [1175] "Marsupenaeus japonicus"                                    
#> [1176] "Penaeus japonicus"                                         
#> [1177] "Penaeus (Marsupenaeus)_japonicus"                          
#> [1178] "Penaeus (Melicertus)_japonicus"                            
#> [1179] "Chaetodon argus"                                           
#> [1180] "Scatophagus argus"                                         
#> [1181] "Chanos chanos"                                             
#> [1182] "Mugil chanos"                                              
#> [1183] "Bison bison_bison"                                         
#> [1184] "Bos bison_bison"                                           
#> [1185] "Amblyraja radiata"                                         
#> [1186] "Raja radiata"                                              
#> [1187] "Amphimedon queenslandica"                                  
#> [1188] "Electrophorus electricus"                                  
#> [1189] "Gymnotus electricus"                                       
#> [1190] "Hippocampus comes"                                         
#> [1191] "Hipposideros armiger"                                      
#> [1192] "Rhinolophus armiger"                                       
#> [1193] "Monodon monoceros"                                         
#> [1194] "Cynoglossus (Arelia)_semilaevis"                           
#> [1195] "Cynoglossus semilaevis"                                    
#> [1196] "Anneissia japonica"                                        
#> [1197] "Oxycomanthus japonicus"                                    
#> [1198] "Ananas comosus"                                            
#> [1199] "Ananas comosus_var._comosus"                               
#> [1200] "Ananas lucidus"                                            
#> [1201] "Bromelia comosa"                                           
#> [1202] "Callionymus splendidus"                                    
#> [1203] "Pterosynchiropus splendidus"                               
#> [1204] "Synchiropus splendidus"                                    
#> [1205] "Neophocaena asiaeorientalis_asiaeorientalis"               
#> [1206] "Coluber guttatus"                                          
#> [1207] "Elaphe guttata"                                            
#> [1208] "Pantherophis guttatus"                                     
#> [1209] "Pollicipes cornucopia"                                     
#> [1210] "Pollicipes pollicipes"                                     
#> [1211] "Pseudoliparis swirei"                                      
#> [1212] "Chelonoidis abingdonii"                                    
#> [1213] "Chelonoidis abingdoni"                                     
#> [1214] "Chelonoidis nigra_abingdonii"                              
#> [1215] "Geochelone nigra_abigdonii"                                
#> [1216] "Geochelone nigra_abingdoni"                                
#> [1217] "Geochelone nigra_ephippium"                                
#> [1218] "Testudo abingdonii"                                        
#> [1219] "Rhincodon typus"                                           
#> [1220] "Aphritis gobio"                                            
#> [1221] "Cottoperca gobio"                                          
#> [1222] "Ricinus communis"                                          
#> [1223] "Ricinus sanguineus"                                        
#> [1224] "Malania oleifera"                                          
#> [1225] "Ceratotherium simum_simum"                                 
#> [1226] "Kryptolebias marmoratus"                                   
#> [1227] "Rivulus marmoratus"                                        
#> [1228] "Patella vulgata"                                           
#> [1229] "Rhagoletis pomonella"                                      
#> [1230] "Trypanosoma cruzi"                                         
#> [1231] "Cistudo triunguis"                                         
#> [1232] "Terrapene carolina_triunguis"                              
#> [1233] "Terrapene mexicana_triunguis"                              
#> [1234] "Terrapene triunguis"                                       
#> [1235] "Odobenus rosmarus_divergens"                               
#> [1236] "Trichechus manatus_latirostris"                            
#> [1237] "Carcharodon carcharias"                                    
#> [1238] "Squalus carcharias"                                        
#> [1239] "Macrognathus armatus"                                      
#> [1240] "Mastacembelus armatus"                                     
#> [1241] "Anas boschas"                                              
#> [1242] "Anas domesticus"                                           
#> [1243] "Anas platyrhynchos_f._domestica"                           
#> [1244] "Anas platyrhynchos"                                        
#> [1245] "Theobroma cacao"                                           
#> [1246] "Diabrotica virgifera_virgifera"                            
#> [1247] "Actinia diaphana"                                          
#> [1248] "Aiptasia pallida"                                          
#> [1249] "Aiptasia pulchella"                                        
#> [1250] "Dysactis pallida"                                          
#> [1251] "Exaiptasia diaphana"                                       
#> [1252] "Exaiptasia pallida"                                        
#> [1253] "Syngnathus acus_rubescens"                                 
#> [1254] "Syngnathus acus"                                           
#> [1255] "Syngnathus rubescens"                                      
#> [1256] "Caretta caretta"                                           
#> [1257] "Testudo caretta"                                           
#> [1258] "Guillardia theta_CCMP2712"                                 
#> [1259] "Anarrhichthys ocellatus"                                   
#> [1260] "Pelodiscus sinensis"                                       
#> [1261] "Trionyx sinensis"                                          
#> [1262] "Hippoglossus olivaceus"                                    
#> [1263] "Paralichthys olivaceus"                                    
#> [1264] "Xiphias gladius"                                           
#> [1265] "Cyprinodon variegatus"                                     
#> [1266] "Bos grunniens_mutus"                                       
#> [1267] "Bos mutus"                                                 
#> [1268] "Poephagus mutus"                                           
#> [1269] "Alligator sinensis"                                        
#> [1270] "Morus notabilis"                                           
#> [1271] "Nymphaea colorata"                                         
#> [1272] "Photinus pyralis"                                          
#> [1273] "Periophthalmus magnuspinnatus"                             
#> [1274] "Meleagris gallopavo"                                       
#> [1275] "Pomacea canaliculata"                                      
#> [1276] "Haplochromis nyererei"                                     
#> [1277] "Pundamilia nyererei"                                       
#> [1278] "Cyanistes caeruleus"                                       
#> [1279] "Parus caeruleus"                                           
#> [1280] "Caranx dumerili"                                           
#> [1281] "Seriola dumerili"                                          
#> [1282] "Macrosteles (Macrosteles)_quadrilineatus"                  
#> [1283] "Macrosteles quadrilineatus"                                
#> [1284] "Enhydra lutris_kenyoni"                                    
#> [1285] "Fluta alba"                                                
#> [1286] "Monopterus albus"                                          
#> [1287] "Muraena alba"                                              
#> [1288] "Caecilia seraphini"                                        
#> [1289] "Caecilia Seraphini"                                        
#> [1290] "Geotrypetes seraphini"                                     
#> [1291] "Hypogeophis seraphini"                                     
#> [1292] "Chaetodon rostratus"                                       
#> [1293] "Chelmon rostratus"                                         
#> [1294] "Cucumis sativus"                                           
#> [1295] "Cyrtodactylus macularius"                                  
#> [1296] "Eublepharis macularius"                                    
#> [1297] "Felis concolor"                                            
#> [1298] "Panthera concolor"                                         
#> [1299] "Puma concolor"                                             
#> [1300] "Fenneropenaeus chinensis"                                  
#> [1301] "Penaeus chinensis"                                         
#> [1302] "Pomacentrus partitus"                                      
#> [1303] "Stegastes partitus"                                        
#> [1304] "Phascum patens"                                            
#> [1305] "Physcomitrella patens_subsp._patens"                       
#> [1306] "Physcomitrella patens"                                     
#> [1307] "Physcomitrium patens"                                      
#> [1308] "Anas jamaicensis"                                          
#> [1309] "Oxyura jamaicensis"                                        
#> [1310] "Drosophila miranda"                                        
#> [1311] "Lottia gigantea"                                           
#> [1312] "Eurytemora affinis"                                        
#> [1313] "Crotalus tigris"                                           
#> [1314] "Argentina anserina_subsp._anserina"                        
#> [1315] "Argentina anserina"                                        
#> [1316] "Potentilla anserina"                                       
#> [1317] "Achaearanea tepidariorum"                                  
#> [1318] "Parasteatoda tepidariorum"                                 
#> [1319] "Theridion tepidariorum"                                    
#> [1320] "Uranotaenia lowii"                                         
#> [1321] "Cynolebias whitei"                                         
#> [1322] "Nematolebias whitei"                                       
#> [1323] "Simpsonichthys whitei"                                     
#> [1324] "Sceloporus undulatus"                                      
#> [1325] "Stellio undulatus"                                         
#> [1326] "Helobdella robusta"                                        
#> [1327] "Styela clava"                                              
#> [1328] "Orycteropus afer_afer"                                     
#> [1329] "Dipodomys ordii"                                           
#> [1330] "Leucoraja erinacea"                                        
#> [1331] "Raja erinacea"                                             
#> [1332] "Raja erinaceus"                                            
#> [1333] "Raja erinacia"                                             
#> [1334] "Phytophthora parasitica_INRA-310"                          
#> [1335] "Anas olor"                                                 
#> [1336] "Cygnus olor"                                               
#> [1337] "Lacerta agilis"                                            
#> [1338] "Naja scutata"                                              
#> [1339] "Notechis scutatus"                                         
#> [1340] "Millepora damicornis"                                      
#> [1341] "Pocillopora caespitosa_laysanensis"                        
#> [1342] "Pocillopora damicornis_laysanensis"                        
#> [1343] "Pocillopora damicornis"                                    
#> [1344] "Morone saxatilis"                                          
#> [1345] "Perca saxatilis"                                           
#> [1346] "Miniopterus natalensis"                                    
#> [1347] "Miniopterus schreibersii_natalensis"                       
#> [1348] "Vespertilio natalensis"                                    
#> [1349] "Anas cygnoid"                                              
#> [1350] "Anser cygnoides"                                           
#> [1351] "Actinia tenebrosa"                                         
#> [1352] "Neptunus trituberculatus"                                  
#> [1353] "Portunus (Portunus)_trituberculatus"                       
#> [1354] "Portunus trituberculatus"                                  
#> [1355] "Lacerta vivipara"                                          
#> [1356] "Zootoca vivipara"                                          
#> [1357] "Propithecus coquereli"                                     
#> [1358] "Propithecus verreauxi_coquereli"                           
#> [1359] "Erinaceus europaeus"                                       
#> [1360] "Jatropha curcas"                                           
#> [1361] "Caenorhabditis briggsae"                                   
#> [1362] "Cherax quadricarinatus"                                    
#> [1363] "Homalodisca coagulata"                                     
#> [1364] "Homalodisca vitripennis"                                   
#> [1365] "Tettigonia coagulata"                                      
#> [1366] "Tettigonia vitripennis"                                    
#> [1367] "Furina textilis"                                           
#> [1368] "Pseudonaja textilis"                                       
#> [1369] "Anolis carolinensis"                                       
#> [1370] "Python bivittatus"                                         
#> [1371] "Python molurus_bivittatus"                                 
#> [1372] "Chrysemys scripta_elegans"                                 
#> [1373] "Emys elegans"                                              
#> [1374] "Pseudemys scripta_elegans"                                 
#> [1375] "Trachemys scripta_elegans"                                 
#> [1376] "Protobothrops mucrosquamatus"                              
#> [1377] "Trigonocephalus mucrosquamatus"                            
#> [1378] "Trimeresurus mucrosquamatus"                               
#> [1379] "Daphnia pulex"                                             
#> [1380] "Paramacrobiotus metropolitanus"                            
#> [1381] "Lipotes vexillifer"                                        
#> [1382] "Petromyzon marinus"                                        
#> [1383] "Poephila guttata"                                          
#> [1384] "Taeniopygia guttata"                                       
#> [1385] "Taenopygia guttata"                                        
#> [1386] "Amphibolurus vitticeps"                                    
#> [1387] "Pogona vitticeps"                                          
#> [1388] "Aplysia californica"                                       
#> [1389] "Phalaenopsis equestris"                                    
#> [1390] "Saccoglossus kowalevskii"                                  
#> [1391] "Saccoglossus kowalevskyi"                                  
#> [1392] "Numida meleagris"                                          
#> [1393] "Phasianus meleagris"                                       
#> [1394] "Momordica charantia"                                       
#> [1395] "Callorhinchus milii"                                       
#> [1396] "Sphaerodactylus townsendi"                                 
#> [1397] "Eutainia elegans"                                          
#> [1398] "Thamnophis elegans"                                        
#> [1399] "Corvus hawaiiensis"                                        
#> [1400] "Manacus candei"                                            
#> [1401] "Pipra candei"                                              
#> [1402] "Euleptes europaea"                                         
#> [1403] "Euleptes europea"                                          
#> [1404] "Phyllodactylus europaea"                                   
#> [1405] "Phyllodactylus europaeus"                                  
#> [1406] "Ptyodactylus caudivolvolus"                                
#> [1407] "Lepisosteus oculatus"                                      
#> [1408] "Altirana parkeri"                                          
#> [1409] "Nanorana parkeri"                                          
#> [1410] "Aquila chrysaetos_chrysaetos"                              
#> [1411] "Ahaetulla prasina"                                         
#> [1412] "Fusarium oxysporum_f._sp._lycopersici_4287"                
#> [1413] "Heteropelma chrysocephalum"                                
#> [1414] "Neopelma chrysocephalum"                                   
#> [1415] "Musca domestica"                                           
#> [1416] "Pristis pectinata"                                         
#> [1417] "Ischnura elegans"                                          
#> [1418] "Vidua chalybeata"                                          
#> [1419] "Coturnix coturnix_japanica"                                
#> [1420] "Coturnix coturnix_japonica"                                
#> [1421] "Coturnix coturnix_Japonicus"                               
#> [1422] "Coturnix japonica_japonica"                                
#> [1423] "Coturnix japonica"                                         
#> [1424] "Gekko japonicus"                                           
#> [1425] "Platydactylus japonicus"                                   
#> [1426] "Nilaparvata lugens"                                        
#> [1427] "Ardea americana"                                           
#> [1428] "Grus americana"                                            
#> [1429] "Grus americanus"                                           
#> [1430] "Harpia harpyja"                                            
#> [1431] "Vultur harpyja"                                            
#> [1432] "Corvus moneduloides"                                       
#> [1433] "Pipra filicauda"                                           
#> [1434] "Herrania umbratica"                                        
#> [1435] "Ilyonectria robusta"                                       
#> [1436] "Tympanuchus pallidicinctus"                                
#> [1437] "Topomyia yanbarensis"                                      
#> [1438] "Parus atricapillus"                                        
#> [1439] "Poecile atricapilla"                                       
#> [1440] "Poecile atricapillus"                                      
#> [1441] "Corapipo altera"                                           
#> [1442] "Acyrthosiphon pisum"                                       
#> [1443] "Acyrthosiphum pisum"                                       
#> [1444] "Varanus komodoensis"                                       
#> [1445] "Saprolegnia parasitica_CBS_223.65"                         
#> [1446] "Vidua macroura"                                            
#> [1447] "Carica papaya"                                             
#> [1448] "Chiroxiphia lanceolata"                                    
#> [1449] "Pipra lanceolata"                                          
#> [1450] "Octopus bimaculoides"                                      
#> [1451] "Lagopus muta"                                              
#> [1452] "Tetrao mutus"                                              
#> [1453] "Bradysia coprophila"                                       
#> [1454] "Sciara coprophila"                                         
#> [1455] "Coluber sirtalis"                                          
#> [1456] "Thamnophis sirtalis"                                       
#> [1457] "Falco peregrinus"                                          
#> [1458] "Falco cherrug"                                             
#> [1459] "Asteracanthion distichum"                                  
#> [1460] "Asterias attenuata"                                        
#> [1461] "Asterias clathrata"                                        
#> [1462] "Asterias disticha"                                         
#> [1463] "Asterias gigantea"                                         
#> [1464] "Asterias pallida"                                          
#> [1465] "Asterias rubens"                                           
#> [1466] "Asterias stimpsoni"                                        
#> [1467] "Asterias vulgaris"                                         
#> [1468] "Manduca sexta"                                             
#> [1469] "Sphinx sexta"                                              
#> [1470] "Condylura cristata"                                        
#> [1471] "Sorex cristatus"                                           
#> [1472] "Cuculus canorus"                                           
#> [1473] "Pezoporus wallicus"                                        
#> [1474] "Aedes aegypti"                                             
#> [1475] "Aedes (Stegomyia)_aegypti"                                 
#> [1476] "Stegomyia aegypti"                                         
#> [1477] "Falco naumanni"                                            
#> [1478] "Corvus kubaryi"                                            
#> [1479] "Necator americanus"                                        
#> [1480] "Larus tridactylus"                                         
#> [1481] "Rissa tridactyla"                                          
#> [1482] "Aphanomyces astaci"                                        
#> [1483] "Culex (Culex)_pipiens_pallens"                             
#> [1484] "Culex pipiens_pallens"                                     
#> [1485] "Catharus ustulatus"                                        
#> [1486] "Turdus ustulatus"                                          
#> [1487] "Accipiter gentilis"                                        
#> [1488] "Accipiter gentillis"                                       
#> [1489] "Falco gentilis"                                            
#> [1490] "Crocodylus porosus"                                        
#> [1491] "Amborella trichopoda"                                      
#> [1492] "Falco biarmicus"                                           
#> [1493] "Lagopus leucura"                                           
#> [1494] "Lagopus leucurus"                                          
#> [1495] "Falco rusticolus"                                          
#> [1496] "Phasianus colchicus"                                       
#> [1497] "Strigops habroptila"                                       
#> [1498] "Strigops habroptilis"                                      
#> [1499] "Strigops habroptilus"                                      
#> [1500] "Corvus brachyrhynchos"                                     
#> [1501] "Uloborus diversus"                                         
#> [1502] "Phytophthora infestans_strain_T30-4"                       
#> [1503] "Phytophthora infestans_T30-4"                              
#> [1504] "Empidonax traillii"                                        
#> [1505] "Muscicapa traillii"                                        
#> [1506] "Strix alba"                                                
#> [1507] "Tyto alba"                                                 
#> [1508] "Parus major"                                               
#> [1509] "Lepeophtheirus salmonis"                                   
#> [1510] "Gavialis gangeticus"                                       
#> [1511] "Lacerta gangetica"                                         
#> [1512] "Casuarius novaehollandiae"                                 
#> [1513] "Dromaius novaehollandiae"                                  
#> [1514] "Dromaius novae-hollandiae"                                 
#> [1515] "Lepidothrix coronata"                                      
#> [1516] "Pipra coronata"                                            
#> [1517] "Daphnia carinata"                                          
#> [1518] "Aphis gossypii"                                            
#> [1519] "Hyalella azteca"                                           
#> [1520] "Hyalella knickerbockeri"                                   
#> [1521] "Colletotrichum lupini"                                     
#> [1522] "Gloeosporium lupini"                                       
#> [1523] "Sphaeroforma arctica_JP610"                                
#> [1524] "Suillus fuscotomentosus"                                   
#> [1525] "Mollisia scopiformis"                                      
#> [1526] "Phialocephala scopiformis"                                 
#> [1527] "Myiozetetes cayanensis"                                    
#> [1528] "Hyaloscypha bicolor_E"                                     
#> [1529] "Lonchura domestica"                                        
#> [1530] "Lonchura striata_domestica"                                
#> [1531] "Melopsittacus undulatus"                                   
#> [1532] "Psittacus undulatus"                                       
#> [1533] "Apteryx australis_mantelli"                                
#> [1534] "Apteryx mantelli_mantelli"                                 
#> [1535] "Fringilla montana"                                         
#> [1536] "Passer montanus"                                           
#> [1537] "Coccinella axyridis"                                       
#> [1538] "Harmonia axyridis"                                         
#> [1539] "Calidris pugnax"                                           
#> [1540] "Machetes pugnax"                                           
#> [1541] "Pavoncella pugnax"                                         
#> [1542] "Philomachus pugnax"                                        
#> [1543] "Tringa pugnax"                                             
#> [1544] "Aimophila crissalis"                                       
#> [1545] "Kieneria crissalis"                                        
#> [1546] "Kieneria crissalis_(Vigors,_1839)"                         
#> [1547] "Melozone crissalis"                                        
#> [1548] "Pipilo crissalis"                                          
#> [1549] "Pipilo fuscus_crissalis"                                   
#> [1550] "Stomoxis calcitrans"                                       
#> [1551] "Stomoxys calcitrans"                                       
#> [1552] "Anas atrata"                                               
#> [1553] "Cygnus atratus"                                            
#> [1554] "Culex fatigans"                                            
#> [1555] "Culex pipiens_fatigans"                                    
#> [1556] "Culex pipiens_quinquefasciatus"                            
#> [1557] "Culex quinquefasciatus"                                    
#> [1558] "Hirundo rustica"                                           
#> [1559] "Acanthaster planci"                                        
#> [1560] "Molothrus ater"                                            
#> [1561] "Oriolus ater"                                              
#> [1562] "Laccaria bicolor_S238N-H82"                                
#> [1563] "Apteryx australis_rowii"                                   
#> [1564] "Apteryx rowii"                                             
#> [1565] "Apteryx rowi"                                              
#> [1566] "Anastrepha obliqua"                                        
#> [1567] "Grapholitha glycinivorella"                                
#> [1568] "Leguminivora glycinivorella"                               
#> [1569] "Crypturus perdicarius"                                     
#> [1570] "Nothoprocta perdicaria"                                    
#> [1571] "Ammospiza nelsoni"                                         
#> [1572] "Nylanderia fulva"                                          
#> [1573] "Paratrechina fulva"                                        
#> [1574] "Agelaius phoeniceus"                                       
#> [1575] "Agelaius phoniceus"                                        
#> [1576] "Oriolus phoeniceus"                                        
#> [1577] "Colletotrichum fructicola"                                 
#> [1578] "Colletotrichum ignotum"                                    
#> [1579] "Euthrips occidentalis"                                     
#> [1580] "Frankliniella brunnescens"                                 
#> [1581] "Frankliniella californica"                                 
#> [1582] "Frankliniella occidentalis_brunnescens"                    
#> [1583] "Frankliniella occidentalis"                                
#> [1584] "Motacilla alba_alba"                                       
#> [1585] "Fusarium solani_(Mart.)_Sacc.,_1881"                       
#> [1586] "Fusarium solani"                                           
#> [1587] "Fusisporium solani"                                        
#> [1588] "Neocosmospora solani"                                      
#> [1589] "Sitophilus oryzae"                                         
#> [1590] "Corvus cornix_cornix"                                      
#> [1591] "Fringilla canaria_Linnaeus,_1758"                          
#> [1592] "Serinus canaria"                                           
#> [1593] "Serinus canarius"                                          
#> [1594] "Drosophila subpulchrella"                                  
#> [1595] "Chlamydomonas reinhardtii"                                 
#> [1596] "Chlamydomonas smithii"                                     
#> [1597] "Puccinia striiformis_f._sp._tritici"                       
#> [1598] "Bactrocera cucurbitae"                                     
#> [1599] "Bactrocera (Zeugodacus)_cucurbitae"                        
#> [1600] "Zeugodacus cucurbitae"                                     
#> [1601] "Zeugodacus (Zeugodacus)_cucurbitae"                        
#> [1602] "Antrodia serialis"                                         
#> [1603] "Neoantrodia serialis"                                      
#> [1604] "Drosophila suzukii"                                        
#> [1605] "Wyeomyia smithii"                                          
#> [1606] "Montifringilla ruficollis"                                 
#> [1607] "Pyrgilauda ruficollis"                                     
#> [1608] "Gymnogyps californianus"                                   
#> [1609] "Vultur californianus"                                      
#> [1610] "Bactrocera (Bactrocera)_dorsalis"                          
#> [1611] "Bactrocera (Bactrocera)_invadens"                          
#> [1612] "Bactrocera dorsalis"                                       
#> [1613] "Bactrocera invadens"                                       
#> [1614] "Bactrocera papayae"                                        
#> [1615] "Bactrocera philippinensis"                                 
#> [1616] "Trichoplusia ni"                                           
#> [1617] "Leptothorax curvispinosus"                                 
#> [1618] "Temnothorax curvispinosus"                                 
#> [1619] "Saprolegnia declina_VS20"                                  
#> [1620] "Saprolegnia diclina_VS20"                                  
#> [1621] "Zonotrichia albicollis"                                    
#> [1622] "Bactrocera neohumeralis"                                   
#> [1623] "Sphaeria pertusa"                                          
#> [1624] "Trematosphaeria pertusa"                                   
#> [1625] "Fusarium oxysporum_var._redolens"                          
#> [1626] "Fusarium redolens"                                         
#> [1627] "Anastrepha ludens"                                         
#> [1628] "Cantharellus anzutake"                                     
#> [1629] "Malurus melanocephalus"                                    
#> [1630] "Muscicapa melanocephala"                                   
#> [1631] "Melitaea cinxia"                                           
#> [1632] "Papilio cinxia"                                            
#> [1633] "Maniola jurtina"                                           
#> [1634] "Papilio jurtina"                                           
#> [1635] "Anas fuligula"                                             
#> [1636] "Aythya fuligula"                                           
#> [1637] "Bombyx mori"                                               
#> [1638] "Phalaena mori"                                             
#> [1639] "Botys furnacalis"                                          
#> [1640] "Ostrinia furnacalis"                                       
#> [1641] "Priapula caudata"                                          
#> [1642] "Priapulus caudatus"                                        
#> [1643] "Apanteles glomeratus"                                      
#> [1644] "Cotesia glomerata"                                         
#> [1645] "Centrocercus urophasianus"                                 
#> [1646] "Centrocerus urophasianus"                                  
#> [1647] "Tetrao urophasianus"                                       
#> [1648] "Montifringilla taczanowskii_(Przewalski,_1876)"            
#> [1649] "Onychostruthus taczanowskii"                               
#> [1650] "Monomorium pharaonis"                                      
#> [1651] "Daktulosphaira vitifoliae"                                 
#> [1652] "Pemphigus vitifoliae"                                      
#> [1653] "Viteus vitifoliae"                                         
#> [1654] "Helicoverpa armigera"                                      
#> [1655] "Heliothis armigera"                                        
#> [1656] "Heliothis (Helicoverpa)_armigera"                          
#> [1657] "Noctua armigera"                                           
#> [1658] "Drosophila biarmipes"                                      
#> [1659] "Myzus (Nectarosiphon)_persicae"                            
#> [1660] "Myzus persicae"                                            
#> [1661] "Lucilia sericata"                                          
#> [1662] "Phaenicia sericata"                                        
#> [1663] "Tinamus guttatus"                                          
#> [1664] "Solenopsis invicta"                                        
#> [1665] "Fringilla georgiana"                                       
#> [1666] "Melospiza georgiana"                                       
#> [1667] "Helicoverpa zea"                                           
#> [1668] "Heliothis zea"                                             
#> [1669] "Phalaena zea"                                              
#> [1670] "Drosophila ananassae"                                      
#> [1671] "Drosophila annanassae"                                     
#> [1672] "Fusarium odoratissimum_NRRL_54006"                         
#> [1673] "Coccinella 7-punctata"                                     
#> [1674] "Coccinella septempunctata"                                 
#> [1675] "Spodoptera frugiperda"                                     
#> [1676] "Tigriopus californicus"                                    
#> [1677] "Ficedula albicollis"                                       
#> [1678] "Muscicapa albicollis"                                      
#> [1679] "Drosophila pseudoobscura"                                  
#> [1680] "Mytilidion resinicola"                                     
#> [1681] "Mytilinidion resinicola"                                   
#> [1682] "Halyomorpha halys"                                         
#> [1683] "Phycomyces blakesleeanus_NRRL_1555(-)"                     
#> [1684] "Camarhynchus parvulus"                                     
#> [1685] "Geospiza parvula"                                          
#> [1686] "Drosophila willistoni"                                     
#> [1687] "Monoraphidium neglectum"                                   
#> [1688] "Sturnus vulgaris"                                          
#> [1689] "Bactrocera tryoni"                                         
#> [1690] "Apus apus"                                                 
#> [1691] "Hirundo apus"                                              
#> [1692] "Suillus paluster"                                          
#> [1693] "Naegleria gruberi"                                         
#> [1694] "Suillus discolor"                                          
#> [1695] "Suillus tomentosus_var._discolor"                          
#> [1696] "Manacus vitellinus"                                        
#> [1697] "Pipra vitellina"                                           
#> [1698] "Trichinella spiralis"                                      
#> [1699] "Onthophagus taurus"                                        
#> [1700] "Epistrophe balteatus"                                      
#> [1701] "Episyrphus balteatus"                                      
#> [1702] "Episyrphus (Episyrphus)_balteatus"                         
#> [1703] "Leptinotarsa decemlineata"                                 
#> [1704] "Leptinotarsa decimlineata"                                 
#> [1705] "Stilodes decemlineata"                                     
#> [1706] "Struthio australis"                                        
#> [1707] "Struthio camelus_australis"                                
#> [1708] "Boletus plorans"                                           
#> [1709] "Suillus plorans"                                           
#> [1710] "Dryobates pubescens"                                       
#> [1711] "Picoides pubescens_(Linnaeus,_1766)"                       
#> [1712] "Picoides pubescens"                                        
#> [1713] "Fusarium proliferatum_ET1"                                 
#> [1714] "Fusarium oxysporum_Fo47"                                   
#> [1715] "Drosophila sechellia"                                      
#> [1716] "Schizophyllum commune_H4-8"                                
#> [1717] "Depressaria gossypiella"                                   
#> [1718] "Pectinophora gossypiella"                                  
#> [1719] "Parus humilis"                                             
#> [1720] "Podoces humilis"                                           
#> [1721] "Pseudopoces humilis"                                       
#> [1722] "Pseudopodoces humilis"                                     
#> [1723] "Ascidia intestinalis"                                      
#> [1724] "Ciona intestinalis"                                        
#> [1725] "Opisthorchis viverrini"                                    
#> [1726] "Puccinia graminis_f._sp._tritici_CRL_75-36-700-3"          
#> [1727] "Plutella xylostella"                                       
#> [1728] "Melampsora larici-populina_98AG31"                         
#> [1729] "Drosophila obscura"                                        
#> [1730] "Fusarium verticillioides_7600"                             
#> [1731] "Anoplophora glabripennis"                                  
#> [1732] "Anoplophora nobilis"                                       
#> [1733] "Cerosterna glabripennis"                                   
#> [1734] "Melanauster nobilis"                                       
#> [1735] "Calypte anna"                                              
#> [1736] "Ornismya anna"                                             
#> [1737] "Microdochium trichocladiopsis"                             
#> [1738] "Anopheles merus"                                           
#> [1739] "Bactrocera (Daculus)_oleae"                                
#> [1740] "Bactrocera (Dacus)_oleae"                                  
#> [1741] "Bactrocera oleae"                                          
#> [1742] "Dacus oleae"                                               
#> [1743] "Fusarium mangiferae"                                       
#> [1744] "Drosophila yakuba"                                         
#> [1745] "Contarinia nasturtii"                                      
#> [1746] "Parastagonospora nodorum_SN15"                             
#> [1747] "Drosophila virilis"                                        
#> [1748] "Zasmidium cellare_ATCC_36951"                              
#> [1749] "Drosophila mauritiana"                                     
#> [1750] "Geospiza fortis"                                           
#> [1751] "Eupeodes corollae"                                         
#> [1752] "Eupeodes (Eupeodes)_corollae"                              
#> [1753] "Metasyrphus corollae"                                      
#> [1754] "Spodoptera litura"                                         
#> [1755] "Sitodiplosis mosellana"                                    
#> [1756] "Microgaster mediator"                                      
#> [1757] "Microplitis medianus"                                      
#> [1758] "Microplitis mediator"                                      
#> [1759] "Drosophila kikkawai"                                       
#> [1760] "Diaporthe citri"                                           
#> [1761] "Phomopsis citri"                                           
#> [1762] "Mesites unicolor"                                          
#> [1763] "Mesitornis unicolor"                                       
#> [1764] "Suillus subaureus"                                         
#> [1765] "Colletotrichum capsici"                                    
#> [1766] "Colletotrichum dematium_f._truncatum_(Schwein.)_Arx,_1957" 
#> [1767] "Colletotrichum truncatum"                                  
#> [1768] "Glomerella glycines"                                       
#> [1769] "Vermicularia capsici"                                      
#> [1770] "Vermicularia truncata"                                     
#> [1771] "Drosophila simulans"                                       
#> [1772] "Anisochrysa carnea"                                        
#> [1773] "Chrysopa carnea"                                           
#> [1774] "Chrysoperla carnea"                                        
#> [1775] "Drosophila takahashii"                                     
#> [1776] "Lucilia cuprina"                                           
#> [1777] "Drosophila persimilis"                                     
#> [1778] "Falco albicilla"                                           
#> [1779] "Haliaeetus albicilla"                                      
#> [1780] "Antrostomus carolinensis"                                  
#> [1781] "Caprimulgus carolinensis"                                  
#> [1782] "Nasonia vitripennis"                                       
#> [1783] "Athene cunicularia"                                        
#> [1784] "Speotyto cunicularia"                                      
#> [1785] "Strix cunicularia"                                         
#> [1786] "Colias crocea"                                             
#> [1787] "Colias croceus"                                            
#> [1788] "Papilio croceus"                                           
#> [1789] "Leptidea sinapis"                                          
#> [1790] "Papilio sinapis"                                           
#> [1791] "Anopheles arabiensis"                                      
#> [1792] "Drosophila ficusphila"                                     
#> [1793] "Vollenhovia emeryi"                                        
#> [1794] "Hermetia illucens"                                         
#> [1795] "Fusarium vanettenii_77-13-4"                               
#> [1796] "Nectria haematococca_mpVI_77-13-4"                         
#> [1797] "Thrips palmi"                                              
#> [1798] "Falco leucocephalus"                                       
#> [1799] "Haliaeetus leucocephalus"                                  
#> [1800] "Malaya genurostris"                                        
#> [1801] "Colletotrichum gloeosporioides_(Penz.)_Penz._&_Sacc.,_1884"
#> [1802] "Colletotrichum gloeosporioides"                            
#> [1803] "Glomerella cingulata"                                      
#> [1804] "Glomerella rufomaculans-vaccinii"                          
#> [1805] "Gnomoniopsis cingulata"                                    
#> [1806] "Vermicularia gloeosporioides"                              
#> [1807] "Acanthamoeba castellanii_Neff_strain"                      
#> [1808] "Acanthamoeba castellanii_strain_Neff"                      
#> [1809] "Acanthamoeba castellanii_str._Neff"                        
#> [1810] "Drosophila albomicans"                                     
#> [1811] "Drosophila nasuta_albomicans"                              
#> [1812] "Diaporthe amygdali"                                        
#> [1813] "Fusicoccum amygdali"                                       
#> [1814] "Phomopsis amygdali"                                        
#> [1815] "Pelecanus crispus"                                         
#> [1816] "Pelecanus philippensis_crispus"                            
#> [1817] "Drosophila rhopaloa"                                       
#> [1818] "Aphantopus hyperantus"                                     
#> [1819] "Maniola hyperantus"                                        
#> [1820] "Papilio hyperantus"                                        
#> [1821] "Drosophila serrata"                                        
#> [1822] "Leptopilina heterotoma"                                    
#> [1823] "Peronospora halstedii"                                     
#> [1824] "Plasmopara halstedii"                                      
#> [1825] "Cuculus discolor"                                          
#> [1826] "Leptosomus discolor"                                       
#> [1827] "Aphanomyces invadans"                                      
#> [1828] "Drosophila santomea"                                       
#> [1829] "Sipha flava"                                               
#> [1830] "Drosophila teissieri"                                      
#> [1831] "Aptenodytes forsteri"                                      
#> [1832] "Phaethon lepturus"                                         
#> [1833] "Drosophila bipectinata"                                    
#> [1834] "Fulmaris glacialis"                                        
#> [1835] "Fulmarus glacialis"                                        
#> [1836] "Procellaria glacialis"                                     
#> [1837] "Ardea garzetta"                                            
#> [1838] "Egretta garzetta"                                          
#> [1839] "Anopheles mysorensis"                                      
#> [1840] "Anopheles stephensi_mysorensis"                            
#> [1841] "Anopheles stephensi"                                       
#> [1842] "Anopheles stephensi_var._mysorensis"                       
#> [1843] "Neocellia intermedia_Rothwell,_1907"                       
#> [1844] "Neocellia intermedia"                                      
#> [1845] "Cryptotermes secundus"                                     
#> [1846] "Pestalotiopsis fici_W106-1"                                
#> [1847] "Aricia agestis"                                            
#> [1848] "Papilio agestis"                                           
#> [1849] "Polyommatus agestis"                                       
#> [1850] "Artogeia napi"                                             
#> [1851] "Papilio napi"                                              
#> [1852] "Pieris napi"                                               
#> [1853] "Drosophila eugracilis"                                     
#> [1854] "Wasmannia auropunctata"                                    
#> [1855] "Oppia nitens"                                              
#> [1856] "Adelges cooleyi"                                           
#> [1857] "Chermes cooleyi"                                           
#> [1858] "Gilletteella cooleyi"                                      
#> [1859] "Acanthisitta chloris"                                      
#> [1860] "Sitta chloris"                                             
#> [1861] "Agrilus feretrius"                                         
#> [1862] "Agrilus marcopoli"                                         
#> [1863] "Agrilus planipennis"                                       
#> [1864] "Drosophila elegans"                                        
#> [1865] "Hyposmocoma kahamanoa"                                     
#> [1866] "Cariama cristata"                                          
#> [1867] "Palamedea cristata"                                        
#> [1868] "Aleurodes tabaci"                                          
#> [1869] "Aleyrodes tabaci"                                          
#> [1870] "Bemisia tabaci"                                            
#> [1871] "Ibis nippon"                                               
#> [1872] "Nipponia nippon"                                           
#> [1873] "Balearica gibbericeps"                                     
#> [1874] "Balearica pavonina_gibbericeps"                            
#> [1875] "Balearica regulorum_gibbericepse"                          
#> [1876] "Balearica regulorum_gibbericeps"                           
#> [1877] "Bombus affinis"                                            
#> [1878] "Varroa jacobsoni"                                          
#> [1879] "Drosophila gunungcola"                                     
#> [1880] "Colius striatus"                                           
#> [1881] "Tauraco erythrolophus"                                     
#> [1882] "Colletotrichum aenigma"                                    
#> [1883] "Colletotrichum communis"                                   
#> [1884] "Colletotrichum dianesei"                                   
#> [1885] "Colletotrichum endomangiferae"                             
#> [1886] "Colletotrichum hymenocallidis"                             
#> [1887] "Colletotrichum jasmini-sambac"                             
#> [1888] "Colletotrichum melanocaulon"                               
#> [1889] "Colletotrichum siamense"                                   
#> [1890] "Pterocles gutturalis"                                      
#> [1891] "Aethina tumida"                                            
#> [1892] "Galleria mellonella"                                       
#> [1893] "Phalaena mellonella"                                       
#> [1894] "Bicyclus anynana"                                          
#> [1895] "Mycalesis anynana"                                         
#> [1896] "Leptopilina boulardi"                                      
#> [1897] "Zootermopsis nevadensis"                                   
#> [1898] "Achroia grisella"                                          
#> [1899] "Tinea grisella"                                            
#> [1900] "Acremonium falciforme"                                     
#> [1901] "Cephalosporium falciforme"                                 
#> [1902] "Fusarium falciforme"                                       
#> [1903] "Neocosmospora falciformis"                                 
#> [1904] "Drosophila mohavensis"                                     
#> [1905] "Drosophila mojavensis"                                     
#> [1906] "Drosophila innubila"                                       
#> [1907] "Bombus huntii"                                             
#> [1908] "Cuculus indicator"                                         
#> [1909] "Indicator indicator"                                       
#> [1910] "Dendroctonus ponderosae"                                   
#> [1911] "Ardea helias"                                              
#> [1912] "Eurypyga helias"                                           
#> [1913] "Loa loa"                                                   
#> [1914] "Cadophora gregata"                                         
#> [1915] "Cephalosporium gregatum"                                   
#> [1916] "Phialophora gregata"                                       
#> [1917] "Nestor notabilis_notabilis"                                
#> [1918] "Nestor notabilis"                                          
#> [1919] "Colymbus stellatus"                                        
#> [1920] "Gavia stellata"                                            
#> [1921] "Cynthia cardui"                                            
#> [1922] "Papilio cardui"                                            
#> [1923] "Vanessa cardui"                                            
#> [1924] "Cladosporium fulvum"                                       
#> [1925] "Fulvia fulva"                                              
#> [1926] "Mycovellosiella fulva"                                     
#> [1927] "Passalora fulva"                                           
#> [1928] "Plodia interpunctella"                                     
#> [1929] "Tinea interpunctella"                                      
#> [1930] "Stilbospora angustata"                                     
#> [1931] "Truncatella angustata"                                     
#> [1932] "Truncatella truncata"                                      
#> [1933] "Chlamydotis macqueenii_macqueenii"                         
#> [1934] "Chlamydotis macqueenii_macqueeni"                          
#> [1935] "Chlamydotis macqueenii"                                    
#> [1936] "Chlamydotis undulata_macqueenii"                           
#> [1937] "Otis macqueenii"                                           
#> [1938] "Anopheles funestus"                                        
#> [1939] "Fusarium fujikuroi_IMI_58289"                              
#> [1940] "Cephalosporium keratoplasticum_(nom._inval.)"              
#> [1941] "Fusarium keratoplasticum"                                  
#> [1942] "Neocosmospora keratoplastica"                              
#> [1943] "Drosophila lebanonensis"                                   
#> [1944] "Scaptodrosophila lebanonensis"                             
#> [1945] "Merops nubicus"                                            
#> [1946] "Coniothyrium fuckelii_var._sporulosum"                     
#> [1947] "Coniothyrium sporulosum"                                   
#> [1948] "Paraconiothyrium sporulosum"                               
#> [1949] "Paraconyotrichium sporulosum"                              
#> [1950] "Paraphaeosphaeria sporulosa"                               
#> [1951] "Fusarium venenatum"                                        
#> [1952] "Fusarium venetum"                                          
#> [1953] "Amyelois transitella"                                      
#> [1954] "Nephopteryx transitella"                                   
#> [1955] "Pelecanus carbo"                                           
#> [1956] "Phalacrocorax carbo"                                       
#> [1957] "Naegleria lovaniensis"                                     
#> [1958] "Papilio machaon"                                           
#> [1959] "Gaeumannomyces tritici_R3-111a-1"                          
#> [1960] "Papilio aegeria"                                           
#> [1961] "Pararge aegeria"                                           
#> [1962] "Lophyrus lecontei"                                         
#> [1963] "Neodiprion lecontei"                                       
#> [1964] "Sclerotinia sclerotiorum_1980_UF-70"                       
#> [1965] "Aegialitis vocifera"                                       
#> [1966] "Charadrius vociferous"                                     
#> [1967] "Charadrius vociferus"                                      
#> [1968] "Oxyechus vociferus"                                        
#> [1969] "Drosophila erecta"                                         
#> [1970] "Anopheles gambiae"                                         
#> [1971] "Arabidopsis thaliana"                                      
#> [1972] "Bos taurus"                                                
#> [1973] "Canis familiaris"                                          
#> [1974] "Gallus gallus"                                             
#> [1975] "Pan troglodytes"                                           
#> [1976] "Escherichia coli"                                          
#> [1977] "Drosophila melanogaster"                                   
#> [1978] "Homo sapiens"                                              
#> [1979] "Mus musculus"                                              
#> [1980] "Sus scrofa"                                                
#> [1981] "Rattus norvegicus"                                         
#> [1982] "Macaca mulatta"                                            
#> [1983] "Caenorhabditis elegans"                                    
#> [1984] "Xenopus laevis"                                            
#> [1985] "Saccharomyces cerevisiae"                                  
#> [1986] "Danio rerio"

Note that, if genome_type is specified to a supported genome, the human / mouse gene ontology annotation will be automatically generated.


What is Mappability and why should I care about it?
For the most part, the SpliceWiz reference can be built with just the FASTA and GTF files. This is sufficient for assessment for most forms of alternative splicing events.

For intron retention, accurate assessment of intron depth is important. However, introns contain many repetitive regions that are difficult to map. We refer to these regions as “mappability exclusions”.

We adopt IRFinder’s algorithm to identify these mappability exclusions. This is determined empirically by generating synthetic reads systematically from the genome, then aligning these reads back to the same genome. Regions that contain less than the expected coverage depth of reads define “mappability exclusions”.

See the vignette: SpliceWiz cookbook for details on how to generate “mappability exclusions” for any genome.

How do I use pre-built mappability exclusions to generate human and mouse references?
For human and mouse genomes, SpliceWiz provides pre-built mappability exclusion references that can be used to build the SpliceWiz reference. SpliceWiz provides these annotations via the NxtIRFdata package.

Simply specify the genome in the parameter genome_type in the buildRef() function (which accepts hg38, hg19, mm10 and mm9).

Additionally, a reference for non-polyadenylated transcripts is used. This has a minor role in QC of samples (to assess the adequacy of polyA capture).

For example, assuming your genome file "genome.fa" and a transcript annotation "transcripts.gtf" are in the working directory, a SpliceWiz reference can be built using the built-in hg38 low mappability regions and non-polyadenylated transcripts as follows:

## NOT RUN

ref_path_hg38 <- "./Reference"
buildRef(
    reference_path = ref_path_hg38,
    fasta = "genome.fa",
    gtf = "transcripts.gtf",
    genome_type = "hg38"
)


Process BAM files using SpliceWiz

The function SpliceWiz_example_bams() retrieves 6 example BAM files from ExperimentHub and places a copy of these in the temporary directory.

bams <- SpliceWiz_example_bams()
What are these example BAM files and how were they generated?
In this vignette, we provide 6 example BAM files. These were generated based on aligned RNA-seq BAMs of 6 samples from the Leucegene AML dataset (GSE67039). Sequences aligned to hg38 were filtered to only include genes aligned to that used to create the chrZ chromosome. These sequences were then re-aligned to the chrZ reference using STAR.

How can I easily locate multiple BAM files?
Often, alignment pipelines process multiple samples. SpliceWiz provides convenience functions to recursively locate all the BAM files in a given folder, and tries to ascertain sample names. Often sample names can be gleaned when: * The BAM files are named by their sample names, e.g. “sample1.bam”, “sample2.bam”. In this case, level = 0 * The BAM files have generic names but are contained inside parent directories labeled by their sample names, e.g. “sample1/Unsorted.bam”, “sample2/Unsorted.bam”. In this case, level = 1

# as BAM file names denote their sample names
bams <- findBAMS(tempdir(), level = 0) 

# In the case where BAM files are labelled using sample names as parent 
# directory names (which oftens happens with the STAR aligner), use level = 1


Process these BAM files using SpliceWiz:

pb_path <- file.path(tempdir(), "pb_output")
processBAM(
    bamfiles = bams$path,
    sample_names = bams$sample,
    reference_path = ref_path,
    output_path = pb_path
)

Using the GUI
After building the demo reference as shown in the previous section, start SpliceWiz GUI in demo mode. Then, click the Experiment tab from the menu side bar. The following interface will be shown:

The Experiment Panel - GUI

The Experiment Panel - GUI

The buttons on the left hand side are as follows:

  1. Set the folders containing the SpliceWiz reference, BAM files, and the output (NxtSE) folder
  2. Run processBAM (process BAM files)
  3. Import annotations from a tabular text file (such as a csv file)
  4. Settings for collateData - collating the experiment
  5. Run collateData (collating the experiment)
  6. Import/Export current sample annotations from/to the NxtSE folder, or export annotations as a csv file
  7. Set the number of threads used to run processBAM and `collateData functions

Also, (8) is a row of tabs that toggle between different tables, showing details of the Reference, BAM files, processBAM output Files, and sample Annotations

To continue with our example, click on (1) Define Project Folders to bring up the following drop-down box:

Define Project Folders

Define Project Folders

We need to define the folders that contain our reference, BAM files, as well as the experiment (NxtSE) output folder for the final compiled experiment

  • Click on Choose Reference Folder and select the Reference directory (where the SpliceWiz reference was generated by the previous step. Then,
  • Click on Choose BAM Folder and select the bams directory (where the demo BAM files have been generated).
  • Click on Choose / Create Experiment (NxtSE) Folder and select the NxtSE directory (which should currently be empty except for the pbOutput subdirectory). Note that when an Experiment folder is chosen via this step, a pbOutput subdirectory will be created if it does not already exist

After our folders have been defined, on the right hand side, an interactive table should be displayed that looks like the following:

Running processBAM

Running processBAM

To process the example BAM files, first make sure the BAM files you wish to process have been selected (BAM files can be unselected by removing the ticks in the selected column). Also, users have the option of renaming the samples (by setting the names in the sampleName column).

To continue with our example, lets leave the names as-is. Click the Process Selected BAMs button. A prompt should pop up asking for confirmation. Click OK to start running processBAM.


What is the processBAM() function
SpliceWiz’s processBAM() function can process one or more BAM files. This function is ultra-fast, relying on an internal native C++ function that uses OpenMP multi-threading (via the ompBAM C++ API).

Input BAM files can be either read-name sorted or coordinate sorted (although SpliceWiz prefers the former). Indexing of coordinate-sorted BAMs are not necessary.

processBAM() loads the SpliceWiz reference. Then, it reads each BAM file in their entirety, and quantifies the following:

  • Basic QC parameters including number of reads, directionality, etc
  • Counts of gapped (junction) reads / fragments
  • Intron coverage depths and other parameters (identical output to IRFinder)
  • COV files (which are like BigWig files but record strand-specific coverage)
  • Miscellaneous quants including coverage of chromosomes, intergenic regions, rRNAs, and non-polyadenylated regions
For each BAM file, processBAM() generates two output files. The first is a gzipped text file containing all the quantitation data. The second is a COV file which contains the per-nucleotide RNA-seq coverage of the sample.

More details on the processBAM() function
At minimum, processBAM() requires four parameters:

  • bamfiles : The paths of the BAM files
  • sample_names : The sample names corresponding to the given BAM files
  • reference_path : The directory containing the SpliceWiz reference
  • output_path : The directory where the output of processBAM() should go
pb_path <- file.path(tempdir(), "pb_output")
processBAM(
    bamfiles = bams$path,
    sample_names = bams$sample,
    reference_path = ref_path,
    output_path = pb_path
)

processBAM() also takes several optional, but useful, parameters:

  • n_threads : The number of threads for multi-threading
  • overwrite : Whether existing files in the output directory should be overwritten
  • run_featureCounts : (Requires the Rsubread package) runs featureCounts to obtain gene counts (which outputs results as an RDS file)

For example, to run processBAM() using 2 threads, disallow overwrite of existing processBAM() outputs, and run featureCounts afterwards, one would run the following:

# NOT RUN

# Re-run IRFinder without overwrite, and run featureCounts
require(Rsubread)

processBAM(
    bamfiles = bams$path,
    sample_names = bams$sample,
    reference_path = ref_path,
    output_path = pb_path,
    n_threads = 2,
    overwrite = FALSE,
    run_featureCounts = TRUE
)

# Load gene counts
gene_counts <- readRDS(file.path(pb_path, "main.FC.Rds"))

# Access gene counts:
gene_counts$counts


Collate the experiment

The helper function findSpliceWizOutput() organises the output files of SpliceWiz’s processBAM() function. It identifies matching "txt.gz" and "cov" files for each sample, and organises these file paths conveniently into a 3-column data frame:

expr <- findSpliceWizOutput(pb_path)

Using this data frame, collate the experiment using collateData(). We name the output directory as NxtSE_output as this folder will contain the data needed to import the NxtSE object:

nxtse_path <- file.path(tempdir(), "NxtSE_output")
collateData(
    Experiment = expr,
    reference_path = ref_path,
    output_path = nxtse_path
)

What is the collateData() function
collateData() combines the processBAM() output files of multiple samples and builds a single database. collateData() creates a number of files in the chosen output directory. These outputs can then be imported into the R session as a NxtSE data object for downstream analysis.

At minimum, collateData() takes the following parameters:

  • Experiment : The 2- or 3- column data frame. The first column should contain (unique) sample names. The second and (optional) third columns contain the "txt.gz" and "cov" file paths
  • reference_path : The directory containing the SpliceWiz reference
  • output_path : The directory where the output of processBAM() should go

collateData() can take some optional parameters:

  • IRMode : Whether to use SpliceWiz’s SpliceOver method, or IRFinder’s SpliceMax method, to determine total spliced transcript abundance. Briefly, SpliceMax considers junction reads that have either flanking splice site coordinate. SpliceOver considers additional junction reads that splices across exon clusters in common. Exon clusters are groups of mutually-overlapping exons. SpliceOver is the default option.
  • overwrite : Whether files in the output directory should be overwritten
  • n_threads : Use multi-threaded operations where possible
  • lowMemoryMode : Minimise memory usage where possible. Note that most of the collateData pipeline will be single-threaded if this is set to TRUE.
collateData() is a memory-intensive operation when run using multiple threads. We estimate it can use up to 6-7 Gb per thread. lowMemoryMode will minimise RAM usage to ~ 8 Gb, but will be slower and run on a single thread.

Enabling novel splicing detection
Novel splicing detection can be switched on by setting novelSplicing = TRUE from within the collateData() function:

# Modified pipeline - collateData with novel ASE discovery:

nxtse_path <- file.path(tempdir(), "NxtSE_output_novel")
collateData(
    Experiment = expr,
    reference_path = ref_path,
    output_path = nxtse_path,
    novelSplicing = TRUE     ## NEW ##
)

collateData() uses split reads that are not annotated introns to help construct hypothetical minimal transcripts. These are then injected into the original transcriptome annotation (GTF) file, whereby the SpliceWiz reference is rebuilt. The new SpliceWiz reference (which contains these novel transcripts) is then used to collate the samples.

To reduce false positives in novel splicing detection, SpliceWiz provides several filters to reduce the number of novel junctions fed into the analysis:

  • Novel junctions that are lowly expressed (only in a small number of samples) are removed. The minimum number of samples required to retain a novel junction is set using novelSplicing_minSamples parameter
  • Alternately, junctions are retained if its expression exceeds a certain threshold (set using novelSplicing_countThreshold) in a smaller number of samples (set using novelSplicing_minSamplesAboveThreshold)
  • Further, novel junctions can be filtered by requiring at least one end to be an annotated splice site (this is enabled using novelSplicing_requireOneAnnotatedSJ = TRUE).

For example, if one wished to retain novel reads seen in 3 or more samples, or novel spliced reads with 10 or more counts in at least 1 sample, and requiring at least one end of a novel junction being an annotated splice site:

By default, tandem junction reads (reads that align across two or more splice junctions) are used to detect novel exons. This can be turned off by setting novelSplicing_useTJ = FALSE.

nxtse_path <- file.path(tempdir(), "NxtSE_output_novel")
collateData(
    Experiment = expr,
    reference_path = ref_path,
    output_path = nxtse_path,
    
        ## NEW ##
    novelSplicing = TRUE,
        # switches on novel splice detection
    
    novelSplicing_requireOneAnnotatedSJ = TRUE,
        # novel junctions must share one annotated splice site

    novelSplicing_minSamples = 3,
        # retain junctions observed in 3+ samples (of any non-zero expression)
    
    novelSplicing_minSamplesAboveThreshold = 1,
        # only 1 sample required if its junction count exceeds a set threshold
    novelSplicing_countThreshold = 10  ,
        # threshold for previous parameter

    novelSplicing_useTJ = TRUE
        # whether tandem junction reads should be used to identify novel exons
)

Using the GUI to annotate the experiment
After running the Reference and processBAM steps as indicated in the previous sections (of the GUI instructions), there is an option to assign annotations to the experiment prior to collation. Annotations can be assigned from existing tabular files (such as csv files). For this example, we will demonstrate how to use a csv file containing annotations to annotate our experiment. Note that the annotation table should contain matching sample names in its leftmost column.

To select a file containing annotations, click on Import Annotations from file, then select the demo_annotations.csv file that should be in the working directory. Press ok. Your interface should now look like this:

Importing annotations

Importing annotations

Note that an extra button Add / Remove Annotation Columns (*) has appeared. Clicking on this button allows us to add/remove annotation columns

Adding annotation columns

Adding annotation columns

Using this panel, columns can be added or removed by clicking their corresponding buttons. Data types for columns can also be defined here.

Using the GUI to collate the experiment

After annotating the experiment in the step above, click on the Experiment Settings button. Then enable Look for Novel Splicing to bring up the following display:

Customizing the Experiment Settings

Customizing the Experiment Settings

This drop-down dialog box contains several parameters related to novel splicing detection:

  1. Toggle on/off novel splice detection
  2. Restrict novel junction reads to having one annotated splice site
  3. Filter novel junction counts based on the number of samples in which the novel junction was observed (any non-zero amount)
  4. Filter novel junction counts based on expression threshold (min number of samples set here)
  5. Threshold junction read count (expression threshold based novel junction filter)
  6. Whether to utilize tandem junction to define novel exons

Also, there is an option (7) to overwrite a previously-compiled NxtSE in the same folder. There is also an option to clear all the Reference/BAM/NxtSE folders (8).

For this example, we will leave the settings as above. Proceed to run collateData() by clicking on Collate Experiment. After several moments, a pop-up message should be shown when the experiment has been successfully collated.


Importing the experiment

Before differential analysis can be performed, the collated experiment must be imported into the R session as an NxtSE data object.

After running collateData(), import the experiment using the makeSE() function:

se <- makeSE(nxtse_path)

Using the GUI
After running the steps in the previous GUI sections, navigate to Analysis and then click the Load Experiment on the menu bar. The display should look like this:

Analysis - Loading the Experiment - GUI

Analysis - Loading the Experiment - GUI

The buttons on the left hand side are as follows:

  1. Select the NxtSE folder containing the collated experiment
  2. Import annotations from file (tabular format such as csv file)
  3. Load NxtSE from folder (into current session for downstream analysis)
  4. After NxtSE has been loaded, it can be saved as an RDS file to send to collaborators (note that COV files will be disconnected once the file has been moved to a different location; it is best to give your collaborators the NxtSE folder instead, with the COV files inside the pbOutput subdirectory)
  5. Load NxtSE from RDS file (saved via the prior step)

To continue with our example, click the Select Experiment (NxtSE) Folder, then select the NxtSE directory. The interface should now look like this:

Loading the NxtSE after reviewing the samples and annotations - GUI

Loading the NxtSE after reviewing the samples and annotations - GUI

To view any existing annotations, click the Annotations tab in the Experiment Display above the sample table. If you followed the prior steps in the Collate the experiment section, there should already be annotations here. If not, feel free to add annotations using the Import Annotations from File (and click on “demo_annotations.csv” file), or manually add and annotate columns by clicking on the Add / Remove Annotation Columns button to open the drop-down box.

To load the NxtSE object, click the Load NxtSE from Folder to which will load the NxtSE object into the current session. A pop-up will appear once the NxtSE object has been successfully completed.


What is the makeSE() function
The makeSE() function imports the compiled data generated by the collateData() function. Data is imported as an NxtSE object. Downstream analysis, including differential analysis and visualization, is performed using the NxtSE object.

More details about the makeSE() function
By default, makeSE() uses delayed operations to avoid consuming memory until the data is actually needed. This is advantageous in analysis of hundreds of samples on a computer with limited resources. However, it will be slower. To load all the data into memory, we need to “realize” the NxtSE object, as follows:

se <- realize_NxtSE(se)

Alternatively, makeSE() can realize the NxtSE object at construction:

se <- makeSE(nxtse_path, realize = TRUE)

By default, makeSE() constructs the NxtSE object using all the samples in the collated data. It is possible (and particularly useful in large data sets) to read only a subset of samples. In this case, construct a data frame object with the first column containing the desired sample names and parse this into the colData parameter as shown:

subset_samples <- colnames(se)[1:4]
df <- data.frame(sample = subset_samples)
se_small <- makeSE(nxtse_path, colData = df, RemoveOverlapping = TRUE)
In complex transcriptomes including those of human and mouse, alternative splicing implies that introns are often overlapping. Thus, algorithms run the risk of over-calling intron retention where overlapping introns are assessed. SpliceWiz removes overlapping introns by considering only introns belonging to the major splice isoforms. It estimates a list of introns of major isoforms by assessing the compatible splice junctions of each isoform, and removes overlapping introns belonging to minor isoforms. To disable this functionality, set RemoveOverlapping = FALSE.


Differential analysis

Assigning annotations to samples

colData(se)$condition <- rep(c("A", "B"), each = 3)
colData(se)$batch <- rep(c("K", "L", "M"), 2)

NB: to add annotations via the GUI workflow, see the Collate the experiment section.


Saving and reloading the NxtSE as an RDS file (GUI)
Once an NxtSE object has been loaded into memory, you can save it as an RDS object so it can be reloaded in a later session. To do this, click the Save NxtSE as RDS button. Choose a file name and location and press OK. This RDS file can be loaded as an NxtSE object in a later GUI session by clicking the Load NxtSE from RDS button.


What is the NxtSE object
NxtSE is a data object which contains all the required data for downstream analysis after all the BAM alignment files have been process and the experiment is collated.

se
#> class: NxtSE 
#> dim: 167 6 
#> metadata(14): Up_Inc Down_Inc ... ref row_gr
#> assays(5): Included Excluded Depth Coverage minDepth
#> rownames(167): TRA2B/ENST00000453386_Intron8/clean
#>   TRA2B/ENST00000453386_Intron7/clean ...
#>   RI:SRSF2-203-exon2;SRSF2-202-intron2
#>   RI:SRSF2-203-exon2;SRSF2-206-intron1
#> rowData names(20): EventName EventType ... is_annotated_IR
#>   NMD_direction
#> colnames(6): 02H003 02H025 ... 02H043 02H046
#> colData names(2): condition batch

The NxtSE object inherits the SummarizedExperiment object. This means that the functions for SummarizedExperiment can be used on the NxtSE object. These include row and column annotations using the rowData() and colData() accessors.

Rows in the NxtSE object contain information about each alternate splicing event. For example:

head(rowData(se))
#> DataFrame with 6 rows and 20 columns
#>                                                  EventName   EventType
#>                                                <character> <character>
#> TRA2B/ENST00000453386_Intron8/clean TRA2B/ENST0000045338..          IR
#> TRA2B/ENST00000453386_Intron7/clean TRA2B/ENST0000045338..          IR
#> TRA2B/ENST00000453386_Intron6/clean TRA2B/ENST0000045338..          IR
#> TRA2B/ENST00000453386_Intron5/clean TRA2B/ENST0000045338..          IR
#> TRA2B/ENST00000453386_Intron4/clean TRA2B/ENST0000045338..          IR
#> TRA2B/ENST00000453386_Intron3/clean TRA2B/ENST0000045338..          IR
#>                                          EventRegion         gene_id
#>                                          <character>     <character>
#> TRA2B/ENST00000453386_Intron8/clean chrZ:1921-2559/- ENSG00000136527
#> TRA2B/ENST00000453386_Intron7/clean chrZ:2634-3631/- ENSG00000136527
#> TRA2B/ENST00000453386_Intron6/clean chrZ:3692-5298/- ENSG00000136527
#> TRA2B/ENST00000453386_Intron5/clean chrZ:5383-6205/- ENSG00000136527
#> TRA2B/ENST00000453386_Intron4/clean chrZ:6322-7990/- ENSG00000136527
#> TRA2B/ENST00000453386_Intron3/clean chrZ:8180-9658/- ENSG00000136527
#>                                           gene_id_b              intron_id
#>                                         <character>            <character>
#> TRA2B/ENST00000453386_Intron8/clean ENSG00000136527 ENST00000453386_Intr..
#> TRA2B/ENST00000453386_Intron7/clean ENSG00000136527 ENST00000453386_Intr..
#> TRA2B/ENST00000453386_Intron6/clean ENSG00000136527 ENST00000453386_Intr..
#> TRA2B/ENST00000453386_Intron5/clean ENSG00000136527 ENST00000453386_Intr..
#> TRA2B/ENST00000453386_Intron4/clean ENSG00000136527 ENST00000453386_Intr..
#> TRA2B/ENST00000453386_Intron3/clean ENSG00000136527 ENST00000453386_Intr..
#>                                     Inc_Is_Protein_Coding Exc_Is_Protein_Coding
#>                                                 <logical>             <logical>
#> TRA2B/ENST00000453386_Intron8/clean                  TRUE                  TRUE
#> TRA2B/ENST00000453386_Intron7/clean                  TRUE                  TRUE
#> TRA2B/ENST00000453386_Intron6/clean                  TRUE                  TRUE
#> TRA2B/ENST00000453386_Intron5/clean                  TRUE                  TRUE
#> TRA2B/ENST00000453386_Intron4/clean                  TRUE                  TRUE
#> TRA2B/ENST00000453386_Intron3/clean                  TRUE                  TRUE
#>                                     Exc_Is_NMD Inc_Is_NMD     Inc_TSL
#>                                      <logical>  <logical> <character>
#> TRA2B/ENST00000453386_Intron8/clean      FALSE       TRUE           1
#> TRA2B/ENST00000453386_Intron7/clean      FALSE       TRUE           1
#> TRA2B/ENST00000453386_Intron6/clean      FALSE       TRUE           1
#> TRA2B/ENST00000453386_Intron5/clean      FALSE       TRUE           1
#> TRA2B/ENST00000453386_Intron4/clean      FALSE       TRUE           1
#> TRA2B/ENST00000453386_Intron3/clean      FALSE       TRUE           1
#>                                         Exc_TSL          Event1a     Event2a
#>                                     <character>      <character> <character>
#> TRA2B/ENST00000453386_Intron8/clean           1 chrZ:1921-2559/-          NA
#> TRA2B/ENST00000453386_Intron7/clean           1 chrZ:2634-3631/-          NA
#> TRA2B/ENST00000453386_Intron6/clean           1 chrZ:3692-5298/-          NA
#> TRA2B/ENST00000453386_Intron5/clean           1 chrZ:5383-6205/-          NA
#> TRA2B/ENST00000453386_Intron4/clean           1 chrZ:6322-7990/-          NA
#> TRA2B/ENST00000453386_Intron3/clean           1 chrZ:8180-9658/-          NA
#>                                         Event1b     Event2b
#>                                     <character> <character>
#> TRA2B/ENST00000453386_Intron8/clean          NA          NA
#> TRA2B/ENST00000453386_Intron7/clean          NA          NA
#> TRA2B/ENST00000453386_Intron6/clean          NA          NA
#> TRA2B/ENST00000453386_Intron5/clean          NA          NA
#> TRA2B/ENST00000453386_Intron4/clean          NA          NA
#> TRA2B/ENST00000453386_Intron3/clean          NA          NA
#>                                     is_always_first_intron
#>                                                  <logical>
#> TRA2B/ENST00000453386_Intron8/clean                  FALSE
#> TRA2B/ENST00000453386_Intron7/clean                  FALSE
#> TRA2B/ENST00000453386_Intron6/clean                  FALSE
#> TRA2B/ENST00000453386_Intron5/clean                  FALSE
#> TRA2B/ENST00000453386_Intron4/clean                  FALSE
#> TRA2B/ENST00000453386_Intron3/clean                  FALSE
#>                                     is_always_last_intron is_annotated_IR
#>                                                 <logical>       <logical>
#> TRA2B/ENST00000453386_Intron8/clean                 FALSE           FALSE
#> TRA2B/ENST00000453386_Intron7/clean                 FALSE           FALSE
#> TRA2B/ENST00000453386_Intron6/clean                 FALSE           FALSE
#> TRA2B/ENST00000453386_Intron5/clean                 FALSE           FALSE
#> TRA2B/ENST00000453386_Intron4/clean                 FALSE            TRUE
#> TRA2B/ENST00000453386_Intron3/clean                 FALSE           FALSE
#>                                     NMD_direction
#>                                         <numeric>
#> TRA2B/ENST00000453386_Intron8/clean             1
#> TRA2B/ENST00000453386_Intron7/clean             1
#> TRA2B/ENST00000453386_Intron6/clean             1
#> TRA2B/ENST00000453386_Intron5/clean             1
#> TRA2B/ENST00000453386_Intron4/clean             1
#> TRA2B/ENST00000453386_Intron3/clean             1

Columns contain information about each sample. By default, no annotations are assigned to each sample. These can be assigned as shown above.

Also, NxtSE objects can be subsetted by rows (ASEs) or columns (samples). This is useful if one wishes to perform analysis on a subset of the dataset, or only on a subset of ASEs (say for example, only skipped exon events). Subsetting is performed just like for SummarizedExperiment objects:

# Subset by columns: select the first 2 samples
se_sample_subset <- se[,1:2]

# Subset by rows: select the first 10 ASE events
se_ASE_subset <- se[1:10,]


Filtering high-confidence events

SpliceWiz offers default filters to identify and remove low confidence alternative splice events (ASEs). Run the default filter using the following:

se.filtered <- se[applyFilters(se),]
#> Running Depth filter
#> Running Participation filter
#> Running Participation filter
#> Running Consistency filter
#> Running Terminus filter
#> Running ExclusiveMXE filter
#> Running StrictAltSS filter

Using the GUI
After following the GUI tutorials in the prior sections, click on Analysis and then Filters from the menu bar. It should look like this:

Analysis - Filters - GUI

Analysis - Filters - GUI

To load SpliceWiz’s default filters, click the top right button Load Default Filters. Then to apply these filters to the NxtSE, click Apply Filters. After the filters have been run, your session should now look like this:

SpliceWiz default filters - GUI

SpliceWiz default filters - GUI


Why do we need to filter alternative splicing events?
Often, the gene annotations contain isoforms for all discovered splicing events. Most annotated transcripts are not expressed, and their inclusion in differential analysis complicates results including adjusting for multiple testing. It is prudent to filter these out using various approaches, akin to removing genes with low gene counts in differential gene analysis. We suggest using the default filters which work well for small experiments with sequencing depths at 100-million paired-end reads.

To learn more about filters, consult the documentation via ?ASEFilters


Performing differential analysis

Using the edgeR wrapper ASE_edgeR(), perform differential ASE analysis between conditions “A” and “B”:

# Requires edgeR to be installed:
require("edgeR")
res_edgeR <- ASE_edgeR(
    se = se.filtered,
    test_factor = "condition",
    test_nom = "B",
    test_denom = "A"
)

Using the GUI
After running the previous sections (of the GUI instructions), click Analysis and then Differential Expression Analysis on the menu side bar. It should look something like this:

Analysis - Differential Expression Analysis - GUI

Analysis - Differential Expression Analysis - GUI

To perform edgeR-based differential analysis, first ensure Method is set to edgeR. Using the Variable drop-down box, select condition. Then, select the Nominator and Denominator fields to B and A, respectively. Leave the batch factor fields as (none). Then, click Perform DE.

Once differential expression analysis has finished, your session should look like below. The output is a DT-based data table equivalent to the ASE_edgeR() function.

Example Differential Analysis using edgeR - GUI

Example Differential Analysis using edgeR - GUI

NB: The interface allows users to choose to sort the results either by nominal or (multiple-testing) adjusted P values

NB2: There are 3 different ways Intron Retention events can be quantified and analysed - see “What are the different ways intron retention is measured?” below for further details.

NB3: Analyses can be saved or loaded to/from RDS files using the corresponding buttons.


What are the options for differential ASE analysis?
SpliceWiz provides wrappers to three established algorithms:

  • ASE_limma uses limma to model isoform counts as log-normal distributions. Limma is probably the fastest method and is ideal for large datasets. Time series analysis is available for this mode.
  • ASE_DESeq uses DESeq2 to model isoform counts as negative binomial distribution. This method is the most computationally expensive, but gives robust results. Time series analysis is also available for this mode
  • ASE_edgeR uses edgeR to model isoform counts as negative binomial distributions. SpliceWiz uses the quasi-likelihood method that deals better with variance at near-zero junction counts, resulting in reduced false positives.
  • ASE_DoubleExpSeq uses the lesser-known CRAN package DoubleExpSeq. This package uses the beta-binomial distribution to model isoform counts. The method is at least as fast as limma, but for now it is restricted to analysis between two groups (i.e. batch correction is not implemented)

We recommend the following for differential analysis:

  • For quick comparisons between two groups, where no batch factors are involved, we recommend DoubleExpSeq
  • For large complex experiments where quick results are required for a preliminary or exploratory analysis, we recommend limma
  • For final analysis where accuracy is paramount, we recommend edgeR or DESeq2
# Requires limma to be installed:
require("limma")
res_limma <- ASE_limma(
    se = se.filtered,
    test_factor = "condition",
    test_nom = "B",
    test_denom = "A"
)

# Requires DoubleExpSeq to be installed:
require("DoubleExpSeq")
res_DES <- ASE_DoubleExpSeq(
    se = se.filtered,
    test_factor = "condition",
    test_nom = "B",
    test_denom = "A"
)

# Requires DESeq2 to be installed:
require("DESeq2")
res_deseq <- ASE_DESeq(
    se = se.filtered,
    test_factor = "condition",
    test_nom = "B",
    test_denom = "A",
    n_threads = 1
)

# Requires edgeR to be installed:
require("edgeR")
res_edgeR <- ASE_edgeR(
    se = se.filtered,
    test_factor = "condition",
    test_nom = "B",
    test_denom = "A"
)


What are the different ways intron retention is measured?
Intron retention can be measured via two approaches.

The first (and preferred) approach is using IR-ratio. We presume that every intron is potentially retained (thus ignoring annotation). Given this results in many overlapping introns, SpliceWiz adjusts for this via the following:

  • Where there are mutually-overlapping introns, the less abundant intron is removed from the analysis. Abundance is estimated across the entire dataset, and less-abundant overlapping introns are removed at the makeSE() step.
  • IR-ratio measures included (intronic) abundance using an identical approach to IRFinder, i.e., it calculates the trimmed mean of sequencing depth across the intron, excluding annotated outliers (other exons, intronic elements). Given we cannot assume whether the exact intron corresponds to the major isoform, we estimate splicing abundance by summing junction reads that share either exon cluster as the intron of interest (SpliceOver method in SpliceWiz). Alternately, users can choose to use IRFinder’s SpliceMax method, summing junction reads that share either splice junction with the intron of interest. This choice is also made by the user at the makeSE() step.

At the differential analysis step, users can choose the following:

  • IRmode = "all" - all introns are potentially retained, use IR-ratio to quantify IR (EventType = "IR")
  • IRmode = "annotated" - only annotated retained intron events are considered, but use IR-ratio to quantify IR (EventType = "IR")
  • IRmode = "annotated_binary" - only annotated retained intron events are considered, use PSI to quantify IR - which considers the IR-transcript and the transcript isoform with the exactly-spliced intron as binary alternatives. Splicing of overlapping introns are not considered in PSI quantitation.
res_edgeR_allIntrons <- ASE_edgeR(
    se = se.filtered,
    test_factor = "condition",
    test_nom = "B",
    test_denom = "A",
    IRmode = "all"
)

res_edgeR_annotatedIR <- ASE_edgeR(
    se = se.filtered,
    test_factor = "condition",
    test_nom = "B",
    test_denom = "A",
    IRmode = "annotated"
)

res_edgeR_annotated_binaryIR <- ASE_edgeR(
    se = se.filtered,
    test_factor = "condition",
    test_nom = "B",
    test_denom = "A",
    IRmode = "annotated_binary"
)


Can I account for batch factors?
ASE_limma, ASE_edgeR, and ASE_DESeq can accept up to 2 categories of batches from which to normalize. For example, to normalize the analysis by the batch category, one would run:

require("edgeR")
res_edgeR_batchnorm <- ASE_edgeR(
    se = se.filtered,
    test_factor = "condition",
    test_nom = "B",
    test_denom = "A",
    batch1 = "batch"
)


Can I do time series analysis?
Time series analysis can be performed using limma, edgeR, and DESeq2.

For limma and edgeR, time series analysis is done using the ASE_limma_timeseries() and ASE_edgeR_timeseries() function. test_factor, despite its name, should be a column in colData(se) containing numerical values that represent time series data.

Note that these time series wrappers function requires the splines package.

colData(se.filtered)$timevar <- rep(c(0,1,2), 2)

require("splines")
require("limma")
res_limma_cont <- ASE_limma_timeseries(
    se = se.filtered,
    test_factor = "timevar"
)

require("splines")
require("edgeR")
res_edgeR_cont <- ASE_edgeR_timeseries(
    se = se.filtered,
    test_factor = "timevar"
)

For DESeq2, time series analysis is performed using the ASE_DESeq() funcction. The key difference is that, for time series analysis, simply do not specify the test_nom and test_denom parameters. As long as the test_factor contains numeric values, ASE_DESeq will treat it as a continuous variable. See the following example:

colData(se.filtered)$timevar <- rep(c(0,1,2), 2)

require("DESeq2")
res_deseq_cont <- ASE_DESeq(
    se = se.filtered,
    test_factor = "timevar"
)


Advanced GLM-based differential ASE analysis with edgeR

We have implemented wrapper functions enabling advanced users to perform differential ASE analysis by constructing their own design matrices. This allows users to evaluate effects of covariates in complex experimental models.

We will be building a separate vignette to illustrate the full functionality of these edgeR-based functions, but for now a quick example can be found in the relevant documentation, which can be viewed via:

?`ASE-GLM-edgeR`


Visualization

Volcano plots

Volcano plots show changes in PSI levels (log fold change, x axis) against statistical significance (-log10 p values, y axis):

library(ggplot2)

ggplot(res_edgeR,
        aes(x = logFC, y = -log10(FDR))) + 
    geom_point() +
    labs(title = "Differential analysis - B vs A",
         x = "Log2-fold change", y = "FDR (-log10)")

Can I visualize significant events for each modality of alternative splicing events?
Yes. We can use ggplot2’s facet_wrap function to separately plot volcanos for each modality of ASE. The type of ASE is contained in the EventType column of the differential results data frame.

ggplot(res_edgeR,
        aes(x = logFC, y = -log10(FDR))) + 
    geom_point() + facet_wrap(vars(EventType)) +
    labs(title = "Differential analysis - B vs A",
         x = "Log2-fold change", y = "FDR (-log10)")


Using the GUI
After following the previous sections including differential analysis, navigate to Display and then Volcano Plot. Notice that there will be a message that says “No events found. Consider relaxing some filters”.

This message occurs because our example dataset has no differential events that surpass an adjusted P value of less than 0.05 (which is the default filter setting). The SpliceWiz GUI avoids plotting all ASEs as this will crowd the visualization. In this example, change the Filter Events by to Nominal P value, and move the P-value/FDR threshold all the way to the right. There should now be a volcano plot but most events have near-zero significance because the default y-axis setting is to Plot adjusted P values. Switch this off to show the following:

Volcano Plot - GUI

Volcano Plot - GUI

You can customize this volcano plot using the following controls:

  1. Use the filtering panel to adjust the number of events to show. Here, events can be filtered by Nominal or adjusted P values, or by top events. We can also filter by “highlighted events” which is useful for gene ontology analysis and heatmaps later. Events can also be filtered by the type of alternative splicing
  2. The volcano plot can be faceted by the type of alternative splicing.
  3. The y axis can show either nominal or adjusted P values (Benjamini Hochberg).
  4. If NMD Mode is ON, the horizontal axis represents whether splicing is shifted towards (positive values) or away from (negative values) a NMD substrate transcript
  5. The plot can be exported as a pdf file (ggplot object)
  6. Clear settings

Also, on the (right hand side) main panel:

  1. This clears any selected events (selected events are highlighted in red)
  2. Toggle between using tools to select or deselect events
  3. Plotly interactive toolbox. Users can use the box-select (9a) or lasso- select (9b) to select (or de-select) one or more events of interest. Selected events are highlighted in other plots and non-selected points can be removed by selecting “highlighted events” in the event filter panel (1).


Scatter plots

Scatter plots are useful for showing splicing levels (percent-spliced-in, PSI) between two conditions. The results from differential analysis contains these values and can be plotted:

library(ggplot2)

ggplot(res_edgeR, aes(x = 100 * AvgPSI_B, y = 100 * AvgPSI_A)) + 
    geom_point() + xlim(0, 100) + ylim(0, 100) +
    labs(title = "PSI values across conditions",
         x = "PSI of condition B", y = "PSI of condition A")
#> Warning: Removed 1 row containing missing values or values outside the scale range
#> (`geom_point()`).

Using the GUI
After following the previous sections including differential analysis, navigate to Display and then Scatter Plot. After relaxing the event filters similar to the previous section, change the Variable to condition, X-axis condition to A and Y-axis condition to B. A scatter plot should be automatically generated as follows:

Scatter plot - GUI

Scatter plot - GUI

You can customize the scatter plot using the following controls:

  1. Again, the filtering panel is available to filter the number of events by significance threshold, rank, or by highlighted events, as explained in the previous section (volcano plots - GUI)
  2. Variable refers to the annotation table column name
  3. and (4) - X-axis and Y-axis dropdowns refer to the condition categories (that are used to contrast between two experimental conditions)
  4. If NMD Mode is ON, the PSI values are altered such that they represent the inclusion values of the NMD substrate (instead of that of the “included” isoform)
  5. The plot can be exported as a pdf file (ggplot object)
  6. Clear settings

As in the volcano plot, the scatter plot is interactive and points highlighted on this plot will stay highlighted in other plots.

  1. Clears all highlighted events
  2. Toggles between tools that select or de-select events
  3. Plotly interactive plot where users can use box or lasso select tools to select / de-select events.


Selecting ASEs of interest using the interactive plots (GUI)
SpliceWiz GUI generates plotly interactive figures. For volcano and scatter plots, points of interest can be selected using the lasso or box select tools. For example, we can select the top hits from the faceted volcano plot as shown:

Volcano plot - selecting ASEs of interest via the GUI

Volcano plot - selecting ASEs of interest via the GUI

These ASEs of interest will then be highlighted in other plots, for example scatter plot:

Scatter plot - highlighted ASE events

Scatter plot - highlighted ASE events


How do I generate average PSI values for many conditions?
SpliceWiz provides the makeMeanPSI() function that can generate mean PSI values for each condition of a condition category. For example, the below code will calculate the mean PSIs of each “batch” of this example experiment:

meanPSIs <- makeMeanPSI(
    se = se,
    condition = "batch",
    conditionList = list("K", "L", "M")
)


Gene ontology (GO) analysis

Gene ontology analysis using the GUI
Our working example does not have enough genes to demonstrate a workable gene ontology analysis. Instead, the following explains the controls found in the Gene Ontology panel:

Gene ontology analysis

Gene ontology analysis

The controls are as follows:

  1. Again, the filtering panel is available to filter the number of events by significance threshold, rank, or by highlighted events, as explained in the previous sections. Filtered events are considered “enriched events” in the analysis.
  2. Type of gene ontology category - “Biological Function”, “Molecular Function” or “Cellular Compartment”
  3. Enrich for upregulated, downregulated events, or both
  4. Which set of genes to consider as background. Default is genes belonging to All ASE Events (as they contain introns). Alternatives are to either test for genes of ASEs of the specified modality (a popup select input box will appear, allowing users to select which modalities of ASE to derive background genes), or by all genes in the genome (NB this option will contain intronless genes which may bias the result).
  5. Gene ontology plots can be saved to PDF
  6. Gene ID’s of enriched genes (6a) or background genes (6b) can be exported to file, allowing users to repeat GO analysis using their own gene ontology enrichment tool of choice.


SpliceWiz has built-in gene ontology analysis. For now, only a limited number of species are supported for Ensembl gene ID’s. To see a list of supported organisms:

getAvailableGO()
#>    [1] "Triticum aestivum"                                         
#>    [2] "Triticum aestivum_subsp._aestivum"                         
#>    [3] "Triticum vulgare"                                          
#>    [4] "Brassica napus"                                            
#>    [5] "Arachis hypogaea"                                          
#>    [6] "Hibiscus syriacus"                                         
#>    [7] "Acridium cancellatum"                                      
#>    [8] "Schistocerca cancellata"                                   
#>    [9] "Triticum dicoccoides"                                      
#>   [10] "Triticum turgidum_subsp._dicoccoides"                      
#>   [11] "Triticum turgidum_var._dicoccoides"                        
#>   [12] "Dendrohyas sarda"                                          
#>   [13] "Hyla arborea_sarda"                                        
#>   [14] "Hyla sarda"                                                
#>   [15] "Locusta gregaria"                                          
#>   [16] "Schistocerca gregaria"                                     
#>   [17] "Gossypium hirsutum"                                        
#>   [18] "Gossypium hirsutum_subsp._mexicanum"                       
#>   [19] "Gossypium lanceolatum"                                     
#>   [20] "Gossypium purpurascens"                                    
#>   [21] "Camelina sativa"                                           
#>   [22] "Carassius auratus_gibelio"                                 
#>   [23] "Carassius gibelio_gibelio"                                 
#>   [24] "Carassius gibelio"                                         
#>   [25] "Carassius gibelio_subsp._gibelio"                          
#>   [26] "Cyprinus gibelio"                                          
#>   [27] "Schistocerca piceifrons"                                   
#>   [28] "Papaver somniferum"                                        
#>   [29] "Zingiber officinale"                                       
#>   [30] "Trichomonas vaginalis_G3"                                  
#>   [31] "Trichomonas vaginalis_strain_G3"                           
#>   [32] "Carassius auratus"                                         
#>   [33] "Carassius carassius_auratus"                               
#>   [34] "Cyprinus auratus"                                          
#>   [35] "Helianthus annuus"                                         
#>   [36] "Schistocerca americana"                                    
#>   [37] "Acipenser ruthenus"                                        
#>   [38] "Schistocerca serialis_cubense"                             
#>   [39] "Panicum virgatum"                                          
#>   [40] "Nicotiana tabacum"                                         
#>   [41] "Oncorhynchus mykiss"                                       
#>   [42] "Oncorhynchus nerka_mykiss"                                 
#>   [43] "Parasalmo mykiss"                                          
#>   [44] "Salmo mykiss"                                              
#>   [45] "Schistocerca nitens"                                       
#>   [46] "Schistocerca vaga"                                         
#>   [47] "Salvia splendens"                                          
#>   [48] "Carassius carassius"                                       
#>   [49] "Cyprinus carassius"                                        
#>   [50] "Vicia villosa"                                             
#>   [51] "Camellia sinensis"                                         
#>   [52] "Thea sinensis"                                             
#>   [53] "Oncorhynchus keta"                                         
#>   [54] "Salmo keta"                                                
#>   [55] "Pisum sativum"                                             
#>   [56] "Salmo salar"                                               
#>   [57] "Raphanus sativus"                                          
#>   [58] "Oncorhynchus kisutch"                                      
#>   [59] "Oncorhyncus kisutch"                                       
#>   [60] "Salmo kisatch"                                             
#>   [61] "Lolium rigidum"                                            
#>   [62] "Aegilops squarrosa_subsp._squarrosa"                       
#>   [63] "Aegilops squarrosa"                                        
#>   [64] "Aegilops tauschii"                                         
#>   [65] "Patropyrum tauschii_subsp._tauschii"                       
#>   [66] "Patropyrum tauschii"                                       
#>   [67] "Triticum aegilops"                                         
#>   [68] "Triticum tauschii"                                         
#>   [69] "Salmo trutta"                                              
#>   [70] "Cryptomeria japonica"                                      
#>   [71] "Coregonus clupeaformis"                                    
#>   [72] "Salmo clupeaformis"                                        
#>   [73] "Oncorhynchus gorbuscha"                                    
#>   [74] "Salmo gorbuscha"                                           
#>   [75] "Cyprinus carpio"                                           
#>   [76] "Glycine max_subsp._soja"                                   
#>   [77] "Glycine soja"                                              
#>   [78] "Salmo fontinalis"                                          
#>   [79] "Salvelinus fontinalis"                                     
#>   [80] "Glycine max"                                               
#>   [81] "Phaseolus max"                                             
#>   [82] "Chenopodium quinoa"                                        
#>   [83] "Hordeum sativum"                                           
#>   [84] "Hordeum vulgare_subsp._vulgare"                            
#>   [85] "Hordeum vulgare_var._nudum"                                
#>   [86] "Hordeum vulgare_var._vulgare"                              
#>   [87] "Festuca perennis_(L.)_Columbus_&_J.P.Sm.,_2010"            
#>   [88] "Festuca perennis"                                          
#>   [89] "Lolium perenne"                                            
#>   [90] "Lolium vulgare"                                            
#>   [91] "Coffea arabica"                                            
#>   [92] "Barbus grahami"                                            
#>   [93] "Sinocyclocheilus grahami"                                  
#>   [94] "Sinocyclocheilus rhinocerous"                              
#>   [95] "Gossypium arboreum"                                        
#>   [96] "Brassica oleracea"                                         
#>   [97] "Malus sylvestris"                                          
#>   [98] "Pyrus malus_var._sylvestris"                               
#>   [99] "Astyanax mexicanus"                                        
#>  [100] "Tetragonopterus mexicanus"                                 
#>  [101] "Arachis stenosperma"                                       
#>  [102] "Prosopis alba"                                             
#>  [103] "Sinocyclocheilus anshuiensis"                              
#>  [104] "Brassica rapa"                                             
#>  [105] "Lactuca sativa"                                            
#>  [106] "Dreissena polymorpha"                                      
#>  [107] "Mytilus polymorphus"                                       
#>  [108] "Hydractinia symbiolongicarpus"                             
#>  [109] "Hevea brasiliensis"                                        
#>  [110] "Oncorhynchus tschawytscha"                                 
#>  [111] "Oncorhynchus tshawytscha"                                  
#>  [112] "Salmo tshawytscha"                                         
#>  [113] "Arachis ipaensis"                                          
#>  [114] "Zea mays"                                                  
#>  [115] "Zea mays_var._japonica"                                    
#>  [116] "Salmo namaycush"                                           
#>  [117] "Salvelinus namaycush"                                      
#>  [118] "Capsicum annuum"                                           
#>  [119] "Brienomyrus brachyistius"                                  
#>  [120] "Marcusenius brachyistius"                                  
#>  [121] "Convolvulus nil"                                           
#>  [122] "Ipomoea nil"                                               
#>  [123] "Pharbitis nil"                                             
#>  [124] "Olea europaea_subsp._europaea_var._sylvestris"             
#>  [125] "Olea europaea_var._oleaster"                               
#>  [126] "Olea europaea_var._sylvestris"                             
#>  [127] "Olea europea_subsp._sylvestris"                            
#>  [128] "Alosa sapidissima"                                         
#>  [129] "Clupea sapidissima"                                        
#>  [130] "Carpiodes asiaticus"                                       
#>  [131] "Myxocyprinus asiaticus"                                    
#>  [132] "Actinidia eriantha"                                        
#>  [133] "Gossypium raimondii"                                       
#>  [134] "Salmo alpinus"                                             
#>  [135] "Salvelinus alpinus"                                        
#>  [136] "Catostomus texanus"                                        
#>  [137] "Xyrauchen texanus"                                         
#>  [138] "Doryrhamphus excisus"                                      
#>  [139] "Quercus lobata"                                            
#>  [140] "Malus communis"                                            
#>  [141] "Malus domestica"                                           
#>  [142] "Malus pumila_auct."                                        
#>  [143] "Malus pumila_var._domestica"                               
#>  [144] "Malus sylvestris_var._domestica"                           
#>  [145] "Malus x_domestica"                                         
#>  [146] "Pyrus malus"                                               
#>  [147] "Pyrus malus_var._domestica"                                
#>  [148] "Quercus suber"                                             
#>  [149] "Oncorhynchus nerka"                                        
#>  [150] "Salmo nerka"                                               
#>  [151] "Nicotiana tomentosiformis"                                 
#>  [152] "Carya illinoensis"                                         
#>  [153] "Carya illinoinensis"                                       
#>  [154] "Mercenaria mercenaria"                                     
#>  [155] "Venus mercenaria"                                          
#>  [156] "Quercus robur"                                             
#>  [157] "Durio zibethinus"                                          
#>  [158] "Pongo abelii"                                              
#>  [159] "Pongo pygmaeus_abelii"                                     
#>  [160] "Pongo pygmaeus_abeli"                                      
#>  [161] "Mya arenaria"                                              
#>  [162] "Arachis duranensis"                                        
#>  [163] "Arachis spegazzinii"                                       
#>  [164] "Pyrus x_bretschneideri"                                    
#>  [165] "Trifolium pratense"                                        
#>  [166] "Gorilla gorilla_gorilla"                                   
#>  [167] "Cobitis anguillicaudata"                                   
#>  [168] "Misgurnus anguillicaudatus"                                
#>  [169] "Scaphiopus bombifrons"                                     
#>  [170] "Spea bombifrons"                                           
#>  [171] "Haliotis rufenscens"                                       
#>  [172] "Haliotis rufescens"                                        
#>  [173] "Oreochromis nilotica"                                      
#>  [174] "Oreochromis niloticus"                                     
#>  [175] "Perca nilotica"                                            
#>  [176] "Tilapia nilotica"                                          
#>  [177] "Acropora convexa"                                          
#>  [178] "Acropora millepora"                                        
#>  [179] "Acropora singularis"                                       
#>  [180] "Cebus apella"                                              
#>  [181] "Sapajus apella"                                            
#>  [182] "Simia apella"                                              
#>  [183] "Eucalyptus grandis"                                        
#>  [184] "Dasypus novemcinctus"                                      
#>  [185] "Callithrix jacchus_jacchus"                                
#>  [186] "Callithrix jacchus"                                        
#>  [187] "Simia jacchus"                                             
#>  [188] "Pistacia vera"                                             
#>  [189] "greater Indian_fruit_bat"                                  
#>  [190] "Pteropus giganteus"                                        
#>  [191] "Pteropus medius"                                           
#>  [192] "Salvia miltiorhiza"                                        
#>  [193] "Salvia miltiorrhiza"                                       
#>  [194] "Daphnia pulicaria"                                         
#>  [195] "Magnolia sinica"                                           
#>  [196] "Manglietia sinica"                                         
#>  [197] "Manglietiastrum sinicum"                                   
#>  [198] "Pachylarnax sinica_(Y.W.Law)_N.H.Xia_&_C.Y.Wu"             
#>  [199] "Rosa chinensis"                                            
#>  [200] "Rosa indica_auct.,_non_L."                                 
#>  [201] "Mytilus californianus"                                     
#>  [202] "Pteropus vampyrus"                                         
#>  [203] "Vespertilio vampyrus"                                      
#>  [204] "Chinemys reevesii"                                         
#>  [205] "Chinemys reevesi"                                          
#>  [206] "Emys reevesii"                                             
#>  [207] "Geoclemys reevesii"                                        
#>  [208] "Geoclemys reevessi"                                        
#>  [209] "Mauremys reevesii"                                         
#>  [210] "Mauremys reevesi"                                          
#>  [211] "Choloepus brasiliensis_Fitzinger_1871"                     
#>  [212] "Choloepus brasiliensis"                                    
#>  [213] "Choloepus didactylus"                                      
#>  [214] "Macaca nemestrina"                                         
#>  [215] "Simia nemestrina"                                          
#>  [216] "Bubalus arnee_carabanensis"                                
#>  [217] "Bubalus bubalis_carabanesis"                               
#>  [218] "Bubalus carabanensis_carabanensis"                         
#>  [219] "Bubalus carabanensis"                                      
#>  [220] "Lotus corniculatus_var._japonicus"                         
#>  [221] "Lotus japonicus"                                           
#>  [222] "Nicotiana sylvestris"                                      
#>  [223] "Tupaia belangeri_chinensis"                                
#>  [224] "Tupaia chinensis"                                          
#>  [225] "Clarias gariepinus"                                        
#>  [226] "Clarias lazera"                                            
#>  [227] "Silurus gariepinus"                                        
#>  [228] "Canis dingo"                                               
#>  [229] "Canis familiaris_dingo"                                    
#>  [230] "Canis lupus_dingo"                                         
#>  [231] "Barbus tetrazona"                                          
#>  [232] "Capoeta tetrazona"                                         
#>  [233] "Puntigrus tetrazona"                                       
#>  [234] "Puntius tetrazona"                                         
#>  [235] "Systomus tetrazona"                                        
#>  [236] "Lycium ferocissimum"                                       
#>  [237] "Nicotiana attenuata"                                       
#>  [238] "Denticeps clupeoides"                                      
#>  [239] "Octodon degus"                                             
#>  [240] "Haliotis rubra"                                            
#>  [241] "Aedes albopictus"                                          
#>  [242] "Stegomyia albopicta"                                       
#>  [243] "Spinacia oleracea"                                         
#>  [244] "Paramecium aurelia_syngen_4"                               
#>  [245] "Paramecium tetraurelia"                                    
#>  [246] "Salvia hispanica"                                          
#>  [247] "Medicago truncatula"                                       
#>  [248] "Crassostrea virginica"                                     
#>  [249] "Ostrea virginica"                                          
#>  [250] "Felis catus"                                               
#>  [251] "Felis domesticus"                                          
#>  [252] "Felis silvestris_catus"                                    
#>  [253] "Anubis baboon"                                             
#>  [254] "Papio anubis"                                              
#>  [255] "Papio cynocephalus_anubis"                                 
#>  [256] "Papio doguera"                                             
#>  [257] "Papio hamadryas_anubis"                                    
#>  [258] "Papio hamadryas_doguera"                                   
#>  [259] "Pongo pygmaeus"                                            
#>  [260] "Simia pygmaeus"                                            
#>  [261] "Sorex etruscus"                                            
#>  [262] "Suncus etruscus"                                           
#>  [263] "Prosopis cineraria"                                        
#>  [264] "Nycticebus coucang"                                        
#>  [265] "Tardigradus coucang"                                       
#>  [266] "Rhododendron vialii"                                       
#>  [267] "Pan paniscus"                                              
#>  [268] "Nematostella vectensis"                                    
#>  [269] "Ixodes dammini"                                            
#>  [270] "Ixodes scapularis"                                         
#>  [271] "Lupinus angustifolius"                                     
#>  [272] "Ipomoea triloba"                                           
#>  [273] "Equus asinus"                                              
#>  [274] "Emiliania huxleyi_CCMP1516"                                
#>  [275] "Emiliania huxleyi_CCMP2090"                                
#>  [276] "Mangifera indica"                                          
#>  [277] "Pteropus alecto"                                           
#>  [278] "Rana temporaria"                                           
#>  [279] "Crassostrea gigas"                                         
#>  [280] "Ostrea gigas"                                              
#>  [281] "Crassostrea angulata"                                      
#>  [282] "Etheostoma spectabile"                                     
#>  [283] "Poecilichthys spectabilis"                                 
#>  [284] "Macadamia integrifolia"                                    
#>  [285] "Megalobrama amblycephala"                                  
#>  [286] "Halichoerus grypus"                                        
#>  [287] "Phoca grypus"                                              
#>  [288] "Juglans regia"                                             
#>  [289] "Selaginella moellendorffii"                                
#>  [290] "Selaginella moellendorfii"                                 
#>  [291] "Pleuronectes platessa"                                     
#>  [292] "Presbytis francoisi"                                       
#>  [293] "Trachypithecus francoisi"                                  
#>  [294] "Tripterygium wilfordii"                                    
#>  [295] "Argiope bruennichi"                                        
#>  [296] "Lepus cuniculus"                                           
#>  [297] "Oryctolagus cuniculus"                                     
#>  [298] "Huro salmoides"                                            
#>  [299] "Labrus salmoides"                                          
#>  [300] "Labrus salmonides"                                         
#>  [301] "Micropterus nigricans"                                     
#>  [302] "Micropterus salmoides"                                     
#>  [303] "Solanum stenotomum"                                        
#>  [304] "Heterocephalus glaber"                                     
#>  [305] "Neosciurus carolinensis"                                   
#>  [306] "Sciurus carolinensis"                                      
#>  [307] "Cervus elaphus"                                            
#>  [308] "Polyodon spathula"                                         
#>  [309] "Squalus spathula"                                          
#>  [310] "Gadus chalcogrammus"                                       
#>  [311] "Theragra chalcogramma_finnmarchica"                        
#>  [312] "Theragra chalcogramma"                                     
#>  [313] "Theragra finnmarchica"                                     
#>  [314] "Nothobranchius furzeri"                                    
#>  [315] "Bos bubalis"                                               
#>  [316] "Bubalus arnee_bubalis"                                     
#>  [317] "Bubalus bubalis"                                           
#>  [318] "Pleuronectes solea"                                        
#>  [319] "Solea solea"                                               
#>  [320] "Solea vulgaris"                                            
#>  [321] "Mastomys coucha"                                           
#>  [322] "Praomys coucha"                                            
#>  [323] "Impatiens glandulifera"                                    
#>  [324] "Dermacentor andersoni"                                     
#>  [325] "Felis nebulosa"                                            
#>  [326] "Neofelis nebulosa"                                         
#>  [327] "Pteropus egyptiacus"                                       
#>  [328] "Rousettus aegyptiacus"                                     
#>  [329] "Rousettus aegypticus"                                      
#>  [330] "Rousettus egyptiacus"                                      
#>  [331] "Phoenix dactylifera"                                       
#>  [332] "Pimephales promelas"                                       
#>  [333] "Ostrea edulis"                                             
#>  [334] "Cebus capucinus_imitator"                                  
#>  [335] "Cebus imitator"                                            
#>  [336] "Peromyscus maniculatus_bairdii"                            
#>  [337] "Gasterosteus pungitius"                                    
#>  [338] "Pungitius pungitius"                                       
#>  [339] "Populus alba"                                              
#>  [340] "Cricetus auratus"                                          
#>  [341] "Golden hamsters"                                           
#>  [342] "Mesocricetus auratus"                                      
#>  [343] "Syrian hamsters"                                           
#>  [344] "Chromis aureus"                                            
#>  [345] "Oreochromis aurea"                                         
#>  [346] "Oreochromis aureus"                                        
#>  [347] "Daucus carota_subsp._sativus"                              
#>  [348] "Daucus carota_var._sativus"                                
#>  [349] "Dermacentor silvarum"                                      
#>  [350] "Hylobates syndactylus"                                     
#>  [351] "Simia syndactyla"                                          
#>  [352] "Symphalangus syndactylus"                                  
#>  [353] "Saccharolobus solfataricus"                                
#>  [354] "Sulfolobus solfataricus"                                   
#>  [355] "Felis geoffroyi"                                           
#>  [356] "Leopardus geoffroyi"                                       
#>  [357] "Oncifelis geoffroyi"                                       
#>  [358] "Felis yagouaroundi"                                        
#>  [359] "Herpailurus yagouaroundi"                                  
#>  [360] "Herpailurus yaguarondi"                                    
#>  [361] "Puma yagouaroundii"                                        
#>  [362] "Puma yagouaroundi"                                         
#>  [363] "Cervus canadensis"                                         
#>  [364] "Populus diversifolia"                                      
#>  [365] "Populus euphratica"                                        
#>  [366] "Cucurbita pepo_subsp._pepo"                                
#>  [367] "Cucurbita pepo_var._medullosa"                             
#>  [368] "Cucurbita pepo_var._pepo"                                  
#>  [369] "Macaca cynomolgus"                                         
#>  [370] "Macaca fascicularis"                                       
#>  [371] "Macaca irus"                                               
#>  [372] "Simia fascicularis"                                        
#>  [373] "Emys muticus"                                              
#>  [374] "Geoclemmys mutica"                                         
#>  [375] "Mauremys mutica"                                           
#>  [376] "Suricata suricatta"                                        
#>  [377] "Viverra suricatta"                                         
#>  [378] "Hylobates moloch"                                          
#>  [379] "Simia moloch"                                              
#>  [380] "Solanum dulcamara"                                         
#>  [381] "Cucurbita moschata"                                        
#>  [382] "Coffea eugeniodes"                                         
#>  [383] "Coffea eugenioides"                                        
#>  [384] "Cucurbita maxima"                                          
#>  [385] "Colobus tephrosceles"                                      
#>  [386] "Piliocolobus tephrosceles"                                 
#>  [387] "Procolobus badius_tephrosceles"                            
#>  [388] "Procolobus rufomitratus_tephrosceles"                      
#>  [389] "Labrus bergylta"                                           
#>  [390] "Centropristis striata"                                     
#>  [391] "Labrus striatus"                                           
#>  [392] "Oryza sativa_(japonica_cultivar-group)"                    
#>  [393] "Oryza sativa_Japonica_Group"                               
#>  [394] "Oryza sativa_subsp._japonica"                              
#>  [395] "Jaculus jaculus"                                           
#>  [396] "Mus jaculus"                                               
#>  [397] "Dioscorea cayenensis_subsp._rotundata"                     
#>  [398] "Dioscorea rotundata"                                       
#>  [399] "Cercopithecus aethiops_sabaeus"                            
#>  [400] "Cercopithecus sabaeus"                                     
#>  [401] "Cercopithecus sabeus"                                      
#>  [402] "Chlorocebus aethiops_sabaeus"                              
#>  [403] "Chlorocebus aethiops_sabeus"                               
#>  [404] "Chlorocebus sabaeus"                                       
#>  [405] "Chlorocebus sabeus"                                        
#>  [406] "Simia sabaea"                                              
#>  [407] "Marmota monax"                                             
#>  [408] "Mus monax"                                                 
#>  [409] "Pygathrix roxellana"                                       
#>  [410] "Rhinopithecus roxellana"                                   
#>  [411] "Semnopithecus roxellana"                                   
#>  [412] "Callorhinus ursinus"                                       
#>  [413] "Callorhynus ursius"                                        
#>  [414] "Phoca ursina"                                              
#>  [415] "Cricetulus barabensis_griseus"                             
#>  [416] "Cricetulus griseus"                                        
#>  [417] "Elephantulus edwardii"                                     
#>  [418] "Macroscelides edwardii"                                    
#>  [419] "Cobitis heteroclita"                                       
#>  [420] "Fundulus heteroclitus"                                     
#>  [421] "Neothunnus macropterus"                                    
#>  [422] "Scomber albacares"                                         
#>  [423] "Thunnus albacares"                                         
#>  [424] "Telopea speciosissima"                                     
#>  [425] "Danio aesculapii"                                          
#>  [426] "Marmota marmota_marmota"                                   
#>  [427] "Apodemus sylvaticus"                                       
#>  [428] "Mus sylvaticus"                                            
#>  [429] "Sylvaemus sylvaticus"                                      
#>  [430] "Populus balsamifera_subsp._trichocarpa"                    
#>  [431] "Populus trichocarpa"                                       
#>  [432] "Mercurialis annua"                                         
#>  [433] "Syzygium oleosum"                                          
#>  [434] "Citellus tridecemlineatus"                                 
#>  [435] "Ictidomys tridecemlineatus"                                
#>  [436] "Spermophilus tridecemlineatus"                             
#>  [437] "Ovis ammon_aries"                                          
#>  [438] "Ovis aries"                                                
#>  [439] "Ovis orientalis_aries"                                     
#>  [440] "Ovis ovis"                                                 
#>  [441] "Solanum verrucosum"                                        
#>  [442] "Leo pardus"                                                
#>  [443] "Panthera pardus"                                           
#>  [444] "Microtus oregoni"                                          
#>  [445] "Arabidopsis lyrata_subsp._lyrata"                          
#>  [446] "Arabis lyrata_subsp._lyrata"                               
#>  [447] "Arabis lyrata"                                             
#>  [448] "Cardaminopsis lyrata"                                      
#>  [449] "Manihot esculenta"                                         
#>  [450] "Manihot utilissima"                                        
#>  [451] "Mustela erminea"                                           
#>  [452] "Phaseolus unguiculatus"                                    
#>  [453] "Vigna unguiculata"                                         
#>  [454] "Lycopersicon pennellii_(Correll)_D'Arcy,_1982"             
#>  [455] "Solanum pennellii_Correll,_1958"                           
#>  [456] "Solanum pennellii"                                         
#>  [457] "Setaria viridis"                                           
#>  [458] "Musa AA_Group"                                             
#>  [459] "Musa acuminata_AA_Group"                                   
#>  [460] "Musa acuminata"                                            
#>  [461] "Musa nana"                                                 
#>  [462] "Gymnostomus macrolepis"                                    
#>  [463] "Onychostoma macrolepis"                                    
#>  [464] "Scaphesthes macrolepis"                                    
#>  [465] "Varicorhinus macrolepis"                                   
#>  [466] "Varicorhinus (Scaphesthes)_macrolepis"                     
#>  [467] "Oryza glaberrima"                                          
#>  [468] "Pelteobagrus fulvidraco"                                   
#>  [469] "Pimelodus fulvidraco"                                      
#>  [470] "Pseudobagrus fulvidraco"                                   
#>  [471] "Tachysurus fulvidraco"                                     
#>  [472] "Hylobates concolor_leucogenys"                             
#>  [473] "Hylobates concolor_leucogyneus"                            
#>  [474] "Hylobates leucogenys_leucogenys"                           
#>  [475] "Hylobates leucogenys"                                      
#>  [476] "Nomascus leucogenys_leucogenys"                            
#>  [477] "Nomascus leucogenys"                                       
#>  [478] "Nomascus leukogenys"                                       
#>  [479] "Nannospalax ehrenbergi_galili"                             
#>  [480] "Nannospalax galili"                                        
#>  [481] "Spalax galili"                                             
#>  [482] "Equus caballus"                                            
#>  [483] "Equus przewalskii_f._caballus"                             
#>  [484] "Equus przewalskii_forma_caballus"                          
#>  [485] "Thunnus maccoyii"                                          
#>  [486] "Thynnus maccoyii"                                          
#>  [487] "Chromis diagramma"                                         
#>  [488] "Simochromis diagramma"                                     
#>  [489] "Diplophysa dalaica"                                        
#>  [490] "Triplophysa dalaica"                                       
#>  [491] "Panthera tigris"                                           
#>  [492] "Strongylocentrotus purpuratus"                             
#>  [493] "Lucioperca lucioperca"                                     
#>  [494] "Perca lucioperca"                                          
#>  [495] "Sander lucioperca"                                         
#>  [496] "Stizostedion lucioperca"                                   
#>  [497] "Dipodomys spectabilis"                                     
#>  [498] "Acinonyx jubatus"                                          
#>  [499] "Felis jubata"                                              
#>  [500] "Conyza canadensis"                                         
#>  [501] "Erigeron canadensis"                                       
#>  [502] "Mustela lutreola"                                          
#>  [503] "Camelus bactrianus_ferus"                                  
#>  [504] "Camelus ferus"                                             
#>  [505] "Cajanus cajan"                                             
#>  [506] "Didelphys domestica"                                       
#>  [507] "Monodelphis domestica"                                     
#>  [508] "Pygathrix bieti"                                           
#>  [509] "Rhinopithecus bieti"                                       
#>  [510] "Saimiri boliviensis"                                       
#>  [511] "Hesperomys eremicus"                                       
#>  [512] "Peromyscus eremicus"                                       
#>  [513] "Arabidopsis salsuginea"                                    
#>  [514] "Eutrema salsugineum"                                       
#>  [515] "Hesperis salsuginea"                                       
#>  [516] "Sisymbrium salsugineum"                                    
#>  [517] "Stenophragma salsugineum"                                  
#>  [518] "Thellungiella salsuginea"                                  
#>  [519] "Thelypodium salsugineum"                                   
#>  [520] "Coetomys damarensis"                                       
#>  [521] "Cryptomys damarensis"                                      
#>  [522] "Fukomys damarensis"                                        
#>  [523] "Gadus morhua"                                              
#>  [524] "Leptonychotes weddellii"                                   
#>  [525] "Leptonychotes weddelli"                                    
#>  [526] "Otaria weddellii"                                          
#>  [527] "Grammomys dolichurus_surdaster"                            
#>  [528] "Grammomys surdaster"                                       
#>  [529] "Thamnomys surdaster"                                       
#>  [530] "Solanum tuberosum"                                         
#>  [531] "Andropogon sorghum"                                        
#>  [532] "Sorghum bicolor"                                           
#>  [533] "Sorghum bicolor_subsp._bicolor"                            
#>  [534] "Sorghum nervosum"                                          
#>  [535] "Sorghum saccharatum"                                       
#>  [536] "Sorghum vulgare"                                           
#>  [537] "Holocentrus calcarifer"                                    
#>  [538] "Lates calcarifer"                                          
#>  [539] "Hippopotamus amphibius_kiboko"                             
#>  [540] "Ixodes sanguineus"                                         
#>  [541] "Rhipicephalus sanguineus"                                  
#>  [542] "Clupea harengus_harengus"                                  
#>  [543] "Clupea harengus"                                           
#>  [544] "Bos indicus_x_Bos_taurus"                                  
#>  [545] "Bos primigenius_indicus_x_Bos_primigenius_taurus"          
#>  [546] "Bos taurus_indicus_x_Bos_taurus_taurus"                    
#>  [547] "Bos taurus_x_Bos_indicus"                                  
#>  [548] "Chrysochloris asiatica"                                    
#>  [549] "Talpa asiatica"                                            
#>  [550] "Macacus gelada"                                            
#>  [551] "Theropithecus gelada"                                      
#>  [552] "Bufo bufo"                                                 
#>  [553] "Rana bufo"                                                 
#>  [554] "Maylandia zebra"                                           
#>  [555] "Metriaclima zebra"                                         
#>  [556] "Pseudotropheus sp._'Pseudotropheus_zebra_complex'"         
#>  [557] "Pseudotropheus zebra"                                      
#>  [558] "Otaria californiana"                                       
#>  [559] "Zalophus californianus"                                    
#>  [560] "Ictalurus punctatus"                                       
#>  [561] "Silurus punctatus"                                         
#>  [562] "Mus caroli"                                                
#>  [563] "Mus formosanus"                                            
#>  [564] "Oryx dammah"                                               
#>  [565] "Camelus dromedarius"                                       
#>  [566] "Asparagus officinalis"                                     
#>  [567] "Amaranthus gangeticus"                                     
#>  [568] "Amaranthus mangostanus"                                    
#>  [569] "Amaranthus tricolor"                                       
#>  [570] "Pneumatophorus japonicus"                                  
#>  [571] "Scomber japonicus"                                         
#>  [572] "Lutra lutra"                                               
#>  [573] "Peromyscus leucopus"                                       
#>  [574] "Perca fluviatilis"                                         
#>  [575] "Pagothenia bernacchii"                                     
#>  [576] "Pseudotrematomus bernacchii"                               
#>  [577] "Trematomus bernacchii"                                     
#>  [578] "Trematomus bernacchi"                                      
#>  [579] "Phascolarctos cinereus"                                    
#>  [580] "Mustela furo"                                              
#>  [581] "Mustela putorius_furo"                                     
#>  [582] "Chaetochloa italica"                                       
#>  [583] "Panicum italicum"                                          
#>  [584] "Pennisetum macrochaetum"                                   
#>  [585] "Setaria italica"                                           
#>  [586] "Setaria viridis_subsp._italica"                            
#>  [587] "Elaeis guineensis"                                         
#>  [588] "Mus rattus"                                                
#>  [589] "Rattus rattoides"                                          
#>  [590] "Rattus rattus"                                             
#>  [591] "Rattus wroughtoni"                                         
#>  [592] "Acropora digitifera"                                       
#>  [593] "Madrepora digitifera"                                      
#>  [594] "Leo leo"                                                   
#>  [595] "Panthera leo"                                              
#>  [596] "Echinops telfairii"                                        
#>  [597] "Echinops telfairi"                                         
#>  [598] "Madrepora verrucosa"                                       
#>  [599] "Pocillopora danae"                                         
#>  [600] "Pocillopora verrucosa"                                     
#>  [601] "Myotis daubentonii"                                        
#>  [602] "Myotis daubentoni"                                         
#>  [603] "Vespertilio daubentonii"                                   
#>  [604] "Limia formosa"                                             
#>  [605] "Mollienesia formosa"                                       
#>  [606] "Poecilia formosa"                                          
#>  [607] "Phyllostomus discolor"                                     
#>  [608] "Aurata aurata"                                             
#>  [609] "Sparus aurata"                                             
#>  [610] "Sparus auratus"                                            
#>  [611] "Microcebus murinus"                                        
#>  [612] "Peromyscus californicus_insignis"                          
#>  [613] "Peromyscus californicus_subsp._insignis"                   
#>  [614] "Galago garnettii"                                          
#>  [615] "Galago garnetti"                                           
#>  [616] "Otolemur garnettii"                                        
#>  [617] "Arvicanthis niloticus"                                     
#>  [618] "Didelphis ursina"                                          
#>  [619] "Vombatus ursinus"                                          
#>  [620] "Phaseolus angularis"                                       
#>  [621] "Vigna angularis"                                           
#>  [622] "Haitia acuta"                                              
#>  [623] "Physa acuta"                                               
#>  [624] "Physa heterostropha"                                       
#>  [625] "Physa integra"                                             
#>  [626] "Physella acuta"                                            
#>  [627] "Physella heterostropha"                                    
#>  [628] "Physella integra"                                          
#>  [629] "Ctenopharyngodon idella"                                   
#>  [630] "Ctenopharyngodon idellus"                                  
#>  [631] "Leuciscus idella"                                          
#>  [632] "Thalassophryne amazonica"                                  
#>  [633] "Cyprinus rohita"                                           
#>  [634] "Labeo rohita"                                              
#>  [635] "Talpa occidentalis"                                        
#>  [636] "Bombina bombina"                                           
#>  [637] "Rana bombina"                                              
#>  [638] "Cavia aperea_porcellus"                                    
#>  [639] "Cavia cobaya"                                              
#>  [640] "Cavia porcellus"                                           
#>  [641] "Mus porcellus"                                             
#>  [642] "Odocoileus virginianus"                                    
#>  [643] "Amphibalanus amphitrite"                                   
#>  [644] "Balanus amphitrite"                                        
#>  [645] "Panicum hallii"                                            
#>  [646] "Angill angill"                                             
#>  [647] "Anguilla anguilla_anguilla"                                
#>  [648] "Anguilla anguilla"                                         
#>  [649] "Muraena anguilla"                                          
#>  [650] "Orcinus orca"                                              
#>  [651] "Cannabis sativa"                                           
#>  [652] "Penaeus bubulus"                                           
#>  [653] "Penaeus carinatus"                                         
#>  [654] "Penaeus durbani"                                           
#>  [655] "Penaeus monodon"                                           
#>  [656] "Penaeus (Penaeus)_monodon"                                 
#>  [657] "Didelphis vulpecula"                                       
#>  [658] "Trichosurus vulpecula"                                     
#>  [659] "Myotis lucifugus"                                          
#>  [660] "Vespertilio lucifugus"                                     
#>  [661] "Brachypodium distachyon"                                   
#>  [662] "Aotus nancymaae"                                           
#>  [663] "Aotus nancymai"                                            
#>  [664] "Astatotilapia calliptera"                                  
#>  [665] "Chromis callipterus"                                       
#>  [666] "Ctenochromis callipterus"                                  
#>  [667] "Haplochromis calliptera"                                   
#>  [668] "Haplochromis callipterus"                                  
#>  [669] "Rhamnus zizyphus"                                          
#>  [670] "Ziziphus jujuba"                                           
#>  [671] "Ailuropoda melanoleuca"                                    
#>  [672] "Micropterus dolomieu"                                      
#>  [673] "Micropterus velox"                                         
#>  [674] "Lycopersicon esculentum"                                   
#>  [675] "Lycopersicon esculentum_var._esculentum"                   
#>  [676] "Solanum esculentum"                                        
#>  [677] "Solanum lycopersicum"                                      
#>  [678] "Solanum lycopersicum_var._humboldtii"                      
#>  [679] "Poecilia mexicana"                                         
#>  [680] "Manis pentadactyla"                                        
#>  [681] "Meles meles"                                               
#>  [682] "Ursus meles"                                               
#>  [683] "Ornithorhynchus anatinus"                                  
#>  [684] "Platypus anatinus"                                         
#>  [685] "Felis uncia"                                               
#>  [686] "Panthera uncia"                                            
#>  [687] "Uncia uncia"                                               
#>  [688] "Alligator mississippiensis"                                
#>  [689] "Crocodilus mississipiensis"                                
#>  [690] "Myrmecophaga aculeata"                                     
#>  [691] "Tachyglossus aculeatus"                                    
#>  [692] "Colossoma macropomum"                                      
#>  [693] "Myletes macropomus"                                        
#>  [694] "Cordylus capensis"                                         
#>  [695] "Cordylus (Hemicordylus)_capensis"                          
#>  [696] "Hemicordylus capensis"                                     
#>  [697] "Pseudocordylus capensis"                                   
#>  [698] "Zonurus capensis"                                          
#>  [699] "Eptesicus fuscus"                                          
#>  [700] "Vespertilio fuscus"                                        
#>  [701] "Dromiciops australis"                                      
#>  [702] "Dromiciops gliroides"                                      
#>  [703] "Camelus pacos"                                             
#>  [704] "Lama guanicoe_pacos"                                       
#>  [705] "Lama pacos"                                                
#>  [706] "Vicugna pacos"                                             
#>  [707] "Mollienesia latipinna"                                     
#>  [708] "Poecilia latipinna"                                        
#>  [709] "Elephas maximus_indicus"                                   
#>  [710] "Corylus avellana"                                          
#>  [711] "Ostrea maxima"                                             
#>  [712] "Pecten maximus"                                            
#>  [713] "Felis viverrina"                                           
#>  [714] "Prionailurus viverrinus"                                   
#>  [715] "Gymnodraco acuticeps"                                      
#>  [716] "Thalarctos maritimus"                                      
#>  [717] "Ursus maritimus"                                           
#>  [718] "Lemur catta"                                               
#>  [719] "Myotis myotis"                                             
#>  [720] "Vespertilio myotis"                                        
#>  [721] "Lytechinus pictus"                                         
#>  [722] "Litopenaeus vannamei"                                      
#>  [723] "Penaeus (Litopenaeus)_vannamei"                            
#>  [724] "Penaeus vannamei"                                          
#>  [725] "Ursus arctos"                                              
#>  [726] "Vitis riparia"                                             
#>  [727] "Felis bengalensis"                                         
#>  [728] "Prionailurus bengalensis"                                  
#>  [729] "Clethrionomys glareolus"                                   
#>  [730] "Mus glareolus"                                             
#>  [731] "Myodes glareolus"                                          
#>  [732] "Mustela nigripes"                                          
#>  [733] "Putorius nigripes"                                         
#>  [734] "Pygocentrus nattereri"                                     
#>  [735] "Serrasalmus nattereri"                                     
#>  [736] "Alopex lagopus"                                            
#>  [737] "Canis lagopus"                                             
#>  [738] "Vulpes lagopus"                                            
#>  [739] "Cercocebus atys"                                           
#>  [740] "Cercocebus torquatus_atys"                                 
#>  [741] "Simia atys"                                                
#>  [742] "Lepidosiren annectens"                                     
#>  [743] "Protopterus annectens"                                     
#>  [744] "Rhinocryptis annectens"                                    
#>  [745] "Cerasus avium"                                             
#>  [746] "Prunus avium"                                              
#>  [747] "Prunus cerasus_var._avium"                                 
#>  [748] "Procambarus clarkii"                                       
#>  [749] "Sorex fumeus"                                              
#>  [750] "Macrorhinus angustirostris"                                
#>  [751] "Mirounga angustirostris"                                   
#>  [752] "Beta vulgaris_subsp._vulgaris"                             
#>  [753] "Beta vulgaris_subsp._vulgaris_var._altissima"              
#>  [754] "Beta vulgaris_Sugar_Beet_Group"                            
#>  [755] "Beta vulgaris_var._altissima"                              
#>  [756] "Eumetopias jubatus"                                        
#>  [757] "Phoca jubata"                                              
#>  [758] "Centruroides sculpturatus"                                 
#>  [759] "Diceros bicornis_minor"                                    
#>  [760] "Cicer arietinum"                                           
#>  [761] "Cleome hassleriana_Chodat,_1898"                           
#>  [762] "Tarenaya hassleriana"                                      
#>  [763] "Sebastes umbrosus"                                         
#>  [764] "Sebastichthys umbrosus"                                    
#>  [765] "Eriocheir chinensis"                                       
#>  [766] "Eriocheir japonica_sinensis"                               
#>  [767] "Eriocheir sinensis"                                        
#>  [768] "Dicentrarchus labrax"                                      
#>  [769] "Labrax labrax"                                             
#>  [770] "Morone labrax"                                             
#>  [771] "Perca labrax"                                              
#>  [772] "Roccus labrax"                                             
#>  [773] "Sciaena labrax"                                            
#>  [774] "Acanthopagrus latus"                                       
#>  [775] "Sparus latus"                                              
#>  [776] "Xiphophorus hellerii"                                      
#>  [777] "Xiphophorus helleri"                                       
#>  [778] "Acanthochromis polyacanthus"                               
#>  [779] "Acanthochromis polyacathus"                                
#>  [780] "Dascyllus polyacanthus"                                    
#>  [781] "Mustela vison"                                             
#>  [782] "Neogale vison"                                             
#>  [783] "Neovison vison"                                            
#>  [784] "Lingula anatina"                                           
#>  [785] "Lingula lingua"                                            
#>  [786] "Lingula nipponica"                                         
#>  [787] "Lingula unguis"                                            
#>  [788] "Madrepora faveolata"                                       
#>  [789] "Montastraea faveolata"                                     
#>  [790] "Montastrea faveolata"                                      
#>  [791] "Orbicella faveolata"                                       
#>  [792] "Chinchilla lanigera"                                       
#>  [793] "Chinchilla velligera"                                      
#>  [794] "Chinchilla villidera"                                      
#>  [795] "Mirounga leonina"                                          
#>  [796] "Phoca leonina"                                             
#>  [797] "Perognathus longimembris_pacificus"                        
#>  [798] "Cynocephalus variegatus"                                   
#>  [799] "Galeopithecus variegatus"                                  
#>  [800] "Galeopterus variegatus"                                    
#>  [801] "Vigna radiata"                                             
#>  [802] "Vitis vinifera"                                            
#>  [803] "Vitis vinifera_subsp._vinifera"                            
#>  [804] "Characodon multiradiatus"                                  
#>  [805] "Girardinichthys multiradiatus"                             
#>  [806] "Marmota flaviventris"                                      
#>  [807] "Phaseolus calcaratus"                                      
#>  [808] "Phaseolus chrysanthos"                                     
#>  [809] "Phaseolus chrysanthus"                                     
#>  [810] "Vigna calcarata"                                           
#>  [811] "Vigna umbellata"                                           
#>  [812] "Balaenoptera acutorostrata"                                
#>  [813] "Canis procyonoides"                                        
#>  [814] "Nyctereutes procyonoides"                                  
#>  [815] "Amphioxus floridae"                                        
#>  [816] "Branchiostoma floridae"                                    
#>  [817] "Moschus berezovskii"                                       
#>  [818] "Erythranthe guttata"                                       
#>  [819] "Mimulus guttatus_subsp._guttatus"                          
#>  [820] "Mimulus guttatus"                                          
#>  [821] "Camelus bactrianus"                                        
#>  [822] "Desmodus rotundus"                                         
#>  [823] "Phyllostoma rotundum"                                      
#>  [824] "Octopus sinensis"                                          
#>  [825] "Physeter catodon"                                          
#>  [826] "Physeter macrocephalus"                                    
#>  [827] "Alexandromys fortis"                                       
#>  [828] "Microtus fortis"                                           
#>  [829] "Apogon orbicularis"                                        
#>  [830] "Sphaeramia orbicularis"                                    
#>  [831] "Dendronephthya gigantea"                                   
#>  [832] "Canis hyaena"                                              
#>  [833] "Hyaena hyaena"                                             
#>  [834] "Helicophagus hypophthalmus"                                
#>  [835] "Pangasianodon hypophthalmus"                               
#>  [836] "Pangasius hypophthalmus"                                   
#>  [837] "Pangasius sutchi"                                          
#>  [838] "Castor canadensis"                                         
#>  [839] "Coelomys parahi"                                           
#>  [840] "Mus pahari"                                                
#>  [841] "Pseudochaenichthys georgianus"                             
#>  [842] "Capsella rubella"                                          
#>  [843] "Perkinsus marinus_ATCC_50983"                              
#>  [844] "Holocentrus leopardus"                                     
#>  [845] "Plectropomus leopardus"                                    
#>  [846] "Hippocampus zosterae"                                      
#>  [847] "Seriola dorsalis"                                          
#>  [848] "Seriola lalandi_dorsalis"                                  
#>  [849] "Felis canadensis"                                          
#>  [850] "Lynx canadensis"                                           
#>  [851] "Artibeus jamaicensis"                                      
#>  [852] "Citrus sinensis"                                           
#>  [853] "Citrus x_sinensis"                                         
#>  [854] "Punica granatum"                                           
#>  [855] "Abrus cyaneus"                                             
#>  [856] "Abrus precatorius"                                         
#>  [857] "Polypterus senegalus"                                      
#>  [858] "Acomys russatus"                                           
#>  [859] "Hemibagrus wyckioides"                                     
#>  [860] "Macrones wyckioides"                                       
#>  [861] "Mystus wyckioides"                                         
#>  [862] "Melanotaenia boesemani"                                    
#>  [863] "Sturnira hondurensis"                                      
#>  [864] "Amphilophus centrarchus"                                   
#>  [865] "Archocentrus centrarchus"                                  
#>  [866] "Cichlasoma centrarchus"                                    
#>  [867] "Heros centrarchus"                                         
#>  [868] "Delphinus melas"                                           
#>  [869] "Globicephala melaena"                                      
#>  [870] "Globicephala melas"                                        
#>  [871] "Manis javanica"                                            
#>  [872] "Phyllostomus hastatus"                                     
#>  [873] "Vespertilio hastatus"                                      
#>  [874] "Scyliorhinus canicula"                                     
#>  [875] "Silurana tropicalis"                                       
#>  [876] "Xenopus laevis_tropicalis"                                 
#>  [877] "Xenopus (Silurana)_tropicalis"                             
#>  [878] "Xenopus tropicalis"                                        
#>  [879] "Pipistrellus kuhlii"                                       
#>  [880] "Pipistrellus kuhli"                                        
#>  [881] "Vespertilio kuhlii"                                        
#>  [882] "Solea senegalensis"                                        
#>  [883] "Blennius fasciatus"                                        
#>  [884] "Salarias fasciatus"                                        
#>  [885] "Mugil cephalotus"                                          
#>  [886] "Mugil cephalus"                                            
#>  [887] "Mugil galapagensis"                                        
#>  [888] "Mugil japonicus"                                           
#>  [889] "Siphostoma scovelli"                                       
#>  [890] "Syngnathus scovelli"                                       
#>  [891] "Canis vulpes"                                              
#>  [892] "Vulpes vulpes"                                             
#>  [893] "Capra aegagrus_hircus"                                     
#>  [894] "Capra hircus"                                              
#>  [895] "Poeciliopsis prolifica"                                    
#>  [896] "Gopherus flavomarginatus"                                  
#>  [897] "Lontra canadensis"                                         
#>  [898] "Lutra canadensis"                                          
#>  [899] "Hesperomys torridus"                                       
#>  [900] "Onychomys torridus"                                        
#>  [901] "Elephas africanus"                                         
#>  [902] "Loxodonta africana_africana"                               
#>  [903] "Loxodonta africana"                                        
#>  [904] "Limia couchiana"                                           
#>  [905] "Xiphophorus couchianus"                                    
#>  [906] "Boophilus microplus"                                       
#>  [907] "Rhipicephalus (Boophilus)_microplus"                       
#>  [908] "Rhipicephalus microplus"                                   
#>  [909] "Betta splendens"                                           
#>  [910] "Molossus molossus"                                         
#>  [911] "Vespertilio molossus"                                      
#>  [912] "Lagenorhynchus obliquidens"                                
#>  [913] "Delphinus truncatus"                                       
#>  [914] "Tursiops truncatus"                                        
#>  [915] "Morone flavescens"                                         
#>  [916] "Perca flavescens"                                          
#>  [917] "Euarctos americanus"                                       
#>  [918] "Ursus americanus"                                          
#>  [919] "Arvicola nivalis"                                          
#>  [920] "Chionomys nivalis"                                         
#>  [921] "Microtus nivalis"                                          
#>  [922] "Felis rufus"                                               
#>  [923] "Lynx rufus"                                                
#>  [924] "Myotis brandtii"                                           
#>  [925] "Vespertilio brandtii"                                      
#>  [926] "Astatotilapia burtoni"                                     
#>  [927] "Chromis burtoni"                                           
#>  [928] "Haplochromis burtoni"                                      
#>  [929] "Sorex araneus"                                             
#>  [930] "Aplocheilus melastigmus"                                   
#>  [931] "Oryzias melastigma"                                        
#>  [932] "Silurus meridionalis"                                      
#>  [933] "Silurus soldatovi_meridionalis"                            
#>  [934] "Cucumis melo"                                              
#>  [935] "Hydra attenuata"                                           
#>  [936] "Hydra carnea"                                              
#>  [937] "Hydra littoralis"                                          
#>  [938] "Hydra magnipapillata"                                      
#>  [939] "Hydra vulgaris"                                            
#>  [940] "Anoplopoma fimbria"                                        
#>  [941] "Gadus fimbria"                                             
#>  [942] "Alosa alosa"                                               
#>  [943] "Clupea alosa"                                              
#>  [944] "Chelonia mydas"                                            
#>  [945] "Testudo mydas"                                             
#>  [946] "Ctenocephalides felis"                                     
#>  [947] "Brienomyrus kingsleyae"                                    
#>  [948] "Brienomyrus sp._CAB"                                       
#>  [949] "Mormyrus kingsleyae"                                       
#>  [950] "Paramormyrops kingsleyae"                                  
#>  [951] "Pollimyrus kingsleyae"                                     
#>  [952] "Stylophora pistillata"                                     
#>  [953] "Cyrtodiopsis dalmanii"                                     
#>  [954] "Diopsis dalmanni"                                          
#>  [955] "Teleopsis dalmanni"                                        
#>  [956] "Rhagoletis zephyria"                                       
#>  [957] "Rhodamnia argentea"                                        
#>  [958] "Gasterosteus aculeatus"                                    
#>  [959] "Labrus celidotus"                                          
#>  [960] "Notolabrus celidotus"                                      
#>  [961] "Budorcas taxicolor"                                        
#>  [962] "Nelumbo nucifera"                                          
#>  [963] "Amphiprion ocellaris"                                      
#>  [964] "Arvicola amphibius"                                        
#>  [965] "Arvicola terrestris_(Linnaeus,_1758)"                      
#>  [966] "Mus amphibius"                                             
#>  [967] "Daphnia magna"                                             
#>  [968] "Phaseolus vulgaris"                                        
#>  [969] "Psammomys obesus"                                          
#>  [970] "Carlito syrichta"                                          
#>  [971] "Simia syrichta"                                            
#>  [972] "Tarsius syrichta"                                          
#>  [973] "Cyprinodon tularosa"                                       
#>  [974] "Gouania willdenowi"                                        
#>  [975] "Lepadogaster willdenowi"                                   
#>  [976] "Ochotona princeps"                                         
#>  [977] "Phytophthora sojae"                                        
#>  [978] "Equus caballus_przewalskii"                                
#>  [979] "Equus ferus_przewalskii"                                   
#>  [980] "Equus przewalskii"                                         
#>  [981] "Phoca vitulina"                                            
#>  [982] "Coecilia bivitatum"                                        
#>  [983] "Rhinatrema bivitattum"                                     
#>  [984] "Rhinatrema bivittatum"                                     
#>  [985] "Gambusia affinis"                                          
#>  [986] "Heterandria affinis"                                       
#>  [987] "Lagomys curzoniae"                                         
#>  [988] "Ochotona curzonae"                                         
#>  [989] "Ochotona curzoniae"                                        
#>  [990] "Kogia breviceps"                                           
#>  [991] "Physeter breviceps"                                        
#>  [992] "Ambassis ranga"                                            
#>  [993] "Chanda ranga"                                              
#>  [994] "Parambassis ranga"                                         
#>  [995] "Pseudambassis ranga"                                       
#>  [996] "Clupea cyprinoides"                                        
#>  [997] "Megalops cyprinoides"                                      
#>  [998] "Diospyros lotus"                                           
#>  [999] "Hippoglossus stenolepis"                                   
#> [1000] "Phacochoerus africanus"                                    
#> [1001] "Corythoichthys intestinalis"                               
#> [1002] "Syngnatus intestinalis"                                    
#> [1003] "Mandrillus leucophaeus"                                    
#> [1004] "Papio leucophaeus"                                         
#> [1005] "Simia leucophaea"                                          
#> [1006] "Epinephelus fuscoguttatus"                                 
#> [1007] "Perca summana_fuscoguttata"                                
#> [1008] "Asterina miniata"                                          
#> [1009] "Patiria miniata"                                           
#> [1010] "Rhinolophus rouxii_sinicus"                                
#> [1011] "Rhinolophus sinicus"                                       
#> [1012] "Lampris incognitus"                                        
#> [1013] "Monachus schauinslandi"                                    
#> [1014] "Neomonachus schauinslandi"                                 
#> [1015] "Hippoglossus hippoglossus"                                 
#> [1016] "Pleuronectes hippoglossus"                                 
#> [1017] "Andrographis paniculata"                                   
#> [1018] "Etheostoma cragini"                                        
#> [1019] "Perca chuatsi"                                             
#> [1020] "Siniperca chuatsi"                                         
#> [1021] "Meriones unguiculatus"                                     
#> [1022] "Colobus angolensis_palliatus"                              
#> [1023] "Notothenia coriiceps"                                      
#> [1024] "Hypomesus transpacificus"                                  
#> [1025] "Dermochelys coriacea"                                      
#> [1026] "Testudo coriacea"                                          
#> [1027] "Bufo bufo_gargarizans"                                     
#> [1028] "Bufo gargarizans"                                          
#> [1029] "Bufo japonicus_gargarizans"                                
#> [1030] "Delphinapterus leucas"                                     
#> [1031] "Delphinus leucas"                                          
#> [1032] "Fugu flavidus"                                             
#> [1033] "Takifugu flavidus"                                         
#> [1034] "Pteronotus mesoamericanus"                                 
#> [1035] "Pteronotus parnellii_mesoamericanus"                       
#> [1036] "Citrus clementina"                                         
#> [1037] "Citrus deliciosa_x_Citrus_sinensis"                        
#> [1038] "Fugu rubripes"                                             
#> [1039] "Sphaeroides rubripes"                                      
#> [1040] "Takifugu rubripes"                                         
#> [1041] "Tetraodon rubripes"                                        
#> [1042] "Homarus americanus"                                        
#> [1043] "Osteoglossum formosum"                                     
#> [1044] "Scleropages formosus"                                      
#> [1045] "Larimichthys crocea"                                       
#> [1046] "Pseudosciaena amblyceps"                                   
#> [1047] "Pseudosciaena crocea"                                      
#> [1048] "Sciaena crocea"                                            
#> [1049] "Fragaria vesca"                                            
#> [1050] "Folsomia candida"                                          
#> [1051] "Limulus polyphemus"                                        
#> [1052] "Doryrhamphus dactyliophorus"                               
#> [1053] "Dunckerocampus dactyliophorus"                             
#> [1054] "Syngnathus dactyliophorus"                                 
#> [1055] "Epinephelus lanceolatus"                                   
#> [1056] "Holocentrus lanceolatus"                                   
#> [1057] "Promicrops lanceolatus"                                    
#> [1058] "Mizuhopecten yessoensis"                                   
#> [1059] "Patinopecten yessoensis"                                   
#> [1060] "Patiopecten yessoensis"                                    
#> [1061] "Pecten yessoensis"                                         
#> [1062] "Calamoichthys calabaricus"                                 
#> [1063] "Erpetoichthys calabaricus"                                 
#> [1064] "Platypoecilus maculatus"                                   
#> [1065] "Xiphophorus maculatus"                                     
#> [1066] "Echeneis naucrates"                                        
#> [1067] "Triplophysa rosa"                                          
#> [1068] "Antechinus flavipes"                                       
#> [1069] "Phascogale flavipes"                                       
#> [1070] "Balaena musculus"                                          
#> [1071] "Balaenoptera musculus"                                     
#> [1072] "Rhinolophus ferrumequinum"                                 
#> [1073] "Vespertilio ferrumequinum"                                 
#> [1074] "Oryza brachyantha"                                         
#> [1075] "Chrysemys picta"                                           
#> [1076] "Testudo picta"                                             
#> [1077] "Trachemys picta"                                           
#> [1078] "Tetrahymena thermophila_SB210"                             
#> [1079] "Myripristis murdjan"                                       
#> [1080] "Ostichthys murdjan"                                        
#> [1081] "Perca murdjan"                                             
#> [1082] "Sciaena murdjan"                                           
#> [1083] "Amphiprion testudineus"                                    
#> [1084] "Anabas testudineus"                                        
#> [1085] "Anthias testudineus"                                       
#> [1086] "Antias testudineus"                                        
#> [1087] "Amygdalus communis"                                        
#> [1088] "Prunus amygdalus"                                          
#> [1089] "Prunus communis"                                           
#> [1090] "Prunus dulcis"                                             
#> [1091] "Prunus dulcis_var._sativa"                                 
#> [1092] "Oryzias latipes"                                           
#> [1093] "Poecilia latipes"                                          
#> [1094] "Sarcophilus harrisii"                                      
#> [1095] "Sarcophilus laniarius_(Owen,_1838)"                        
#> [1096] "Sarcophilus laniarius"                                     
#> [1097] "Ursinus harrisii"                                          
#> [1098] "Ictalurus furcatus"                                        
#> [1099] "Pimelodus furcatus"                                        
#> [1100] "Branchiostoma belcheri"                                    
#> [1101] "Gigantopelta aegis"                                        
#> [1102] "Lytechinus variegatus"                                     
#> [1103] "Diaphorina citri"                                          
#> [1104] "Epinephelus moara"                                         
#> [1105] "Serranus moara"                                            
#> [1106] "Stegodyphus dumicola"                                      
#> [1107] "Boleophthalmus pectinirostris"                             
#> [1108] "Gobius pectinirostris"                                     
#> [1109] "Lacerrta muralis"                                          
#> [1110] "Podarcis muralis"                                          
#> [1111] "Seps muralis"                                              
#> [1112] "Austrofundulus limnaeus"                                   
#> [1113] "Columba livia_domestica"                                   
#> [1114] "Columba livia"                                             
#> [1115] "Citellus parryii"                                          
#> [1116] "Spermophilus parryii"                                      
#> [1117] "Spermophilus parryi"                                       
#> [1118] "Urocitellus parryii"                                       
#> [1119] "Latimeria chalumnae"                                       
#> [1120] "Pleuronectes maximus"                                      
#> [1121] "Psetta maxima"                                             
#> [1122] "Rhombus maximus"                                           
#> [1123] "Scophthalmus maximus"                                      
#> [1124] "Sesamum indicum"                                           
#> [1125] "Sesamum orientale"                                         
#> [1126] "Cyclopterus lumpus"                                        
#> [1127] "Armeniaca mume"                                            
#> [1128] "Prunus mume"                                               
#> [1129] "Myotis davidii"                                            
#> [1130] "Vespertilio Davidii"                                       
#> [1131] "Didelphys agilis"                                          
#> [1132] "Gracilinanus agilis"                                       
#> [1133] "Phocoena sinus"                                            
#> [1134] "Acanthophacelus reticulata"                                
#> [1135] "Poecilia (Acanthophacelus)_reticulata"                     
#> [1136] "Poecilia latipinna_reticulata"                             
#> [1137] "Poecilia reticulata"                                       
#> [1138] "Gopherus evgoodei"                                         
#> [1139] "Australorbis glabratus"                                    
#> [1140] "Biomphalaria glabrata"                                     
#> [1141] "Planorbis glabratus"                                       
#> [1142] "Hypudaeus ochrogaster"                                     
#> [1143] "Microtus ochrogaster"                                      
#> [1144] "Amygdalus persica"                                         
#> [1145] "Persica vulgaris"                                          
#> [1146] "Prunus persica"                                            
#> [1147] "Prunus persica_var._densa"                                 
#> [1148] "Chiloscyllium plagiosum"                                   
#> [1149] "Scyllium plagiosum"                                        
#> [1150] "Cheilinus undulatus"                                       
#> [1151] "Phodopus roborovskii"                                      
#> [1152] "Caenorhabditis remanei"                                    
#> [1153] "Caenorhabditis vulgaris"                                   
#> [1154] "Lamprologus brichardi"                                     
#> [1155] "Neolamprologus brichardi"                                  
#> [1156] "Gymnopis unicolor"                                         
#> [1157] "Microcaecilia unicolor"                                    
#> [1158] "Rhinatrema unicolor"                                       
#> [1159] "Rhizophagus irregularis_DAOM_181602=DAOM_197198"           
#> [1160] "Sciaena jaculatrix"                                        
#> [1161] "Toxotes jaculatrix"                                        
#> [1162] "Bos indicus"                                               
#> [1163] "Bos primigenius_indicus"                                   
#> [1164] "Bos taurus_indicus"                                        
#> [1165] "Lacerta sicula_raffonei"                                   
#> [1166] "Podarcis raffoneae"                                        
#> [1167] "Podarcis raffonei"                                         
#> [1168] "Podarcis wagleriana_raffonei"                              
#> [1169] "Benincasa cerifera"                                        
#> [1170] "Benincasa hispida"                                         
#> [1171] "Benincasa pruriens"                                        
#> [1172] "Cucurbita hispida"                                         
#> [1173] "Lagenaria siceraria_var._hispida"                          
#> [1174] "Dendrobium catenatum"                                      
#> [1175] "Marsupenaeus japonicus"                                    
#> [1176] "Penaeus japonicus"                                         
#> [1177] "Penaeus (Marsupenaeus)_japonicus"                          
#> [1178] "Penaeus (Melicertus)_japonicus"                            
#> [1179] "Chaetodon argus"                                           
#> [1180] "Scatophagus argus"                                         
#> [1181] "Chanos chanos"                                             
#> [1182] "Mugil chanos"                                              
#> [1183] "Bison bison_bison"                                         
#> [1184] "Bos bison_bison"                                           
#> [1185] "Amblyraja radiata"                                         
#> [1186] "Raja radiata"                                              
#> [1187] "Amphimedon queenslandica"                                  
#> [1188] "Electrophorus electricus"                                  
#> [1189] "Gymnotus electricus"                                       
#> [1190] "Hippocampus comes"                                         
#> [1191] "Hipposideros armiger"                                      
#> [1192] "Rhinolophus armiger"                                       
#> [1193] "Monodon monoceros"                                         
#> [1194] "Cynoglossus (Arelia)_semilaevis"                           
#> [1195] "Cynoglossus semilaevis"                                    
#> [1196] "Anneissia japonica"                                        
#> [1197] "Oxycomanthus japonicus"                                    
#> [1198] "Ananas comosus"                                            
#> [1199] "Ananas comosus_var._comosus"                               
#> [1200] "Ananas lucidus"                                            
#> [1201] "Bromelia comosa"                                           
#> [1202] "Callionymus splendidus"                                    
#> [1203] "Pterosynchiropus splendidus"                               
#> [1204] "Synchiropus splendidus"                                    
#> [1205] "Neophocaena asiaeorientalis_asiaeorientalis"               
#> [1206] "Coluber guttatus"                                          
#> [1207] "Elaphe guttata"                                            
#> [1208] "Pantherophis guttatus"                                     
#> [1209] "Pollicipes cornucopia"                                     
#> [1210] "Pollicipes pollicipes"                                     
#> [1211] "Pseudoliparis swirei"                                      
#> [1212] "Chelonoidis abingdonii"                                    
#> [1213] "Chelonoidis abingdoni"                                     
#> [1214] "Chelonoidis nigra_abingdonii"                              
#> [1215] "Geochelone nigra_abigdonii"                                
#> [1216] "Geochelone nigra_abingdoni"                                
#> [1217] "Geochelone nigra_ephippium"                                
#> [1218] "Testudo abingdonii"                                        
#> [1219] "Rhincodon typus"                                           
#> [1220] "Aphritis gobio"                                            
#> [1221] "Cottoperca gobio"                                          
#> [1222] "Ricinus communis"                                          
#> [1223] "Ricinus sanguineus"                                        
#> [1224] "Malania oleifera"                                          
#> [1225] "Ceratotherium simum_simum"                                 
#> [1226] "Kryptolebias marmoratus"                                   
#> [1227] "Rivulus marmoratus"                                        
#> [1228] "Patella vulgata"                                           
#> [1229] "Rhagoletis pomonella"                                      
#> [1230] "Trypanosoma cruzi"                                         
#> [1231] "Cistudo triunguis"                                         
#> [1232] "Terrapene carolina_triunguis"                              
#> [1233] "Terrapene mexicana_triunguis"                              
#> [1234] "Terrapene triunguis"                                       
#> [1235] "Odobenus rosmarus_divergens"                               
#> [1236] "Trichechus manatus_latirostris"                            
#> [1237] "Carcharodon carcharias"                                    
#> [1238] "Squalus carcharias"                                        
#> [1239] "Macrognathus armatus"                                      
#> [1240] "Mastacembelus armatus"                                     
#> [1241] "Anas boschas"                                              
#> [1242] "Anas domesticus"                                           
#> [1243] "Anas platyrhynchos_f._domestica"                           
#> [1244] "Anas platyrhynchos"                                        
#> [1245] "Theobroma cacao"                                           
#> [1246] "Diabrotica virgifera_virgifera"                            
#> [1247] "Actinia diaphana"                                          
#> [1248] "Aiptasia pallida"                                          
#> [1249] "Aiptasia pulchella"                                        
#> [1250] "Dysactis pallida"                                          
#> [1251] "Exaiptasia diaphana"                                       
#> [1252] "Exaiptasia pallida"                                        
#> [1253] "Syngnathus acus_rubescens"                                 
#> [1254] "Syngnathus acus"                                           
#> [1255] "Syngnathus rubescens"                                      
#> [1256] "Caretta caretta"                                           
#> [1257] "Testudo caretta"                                           
#> [1258] "Guillardia theta_CCMP2712"                                 
#> [1259] "Anarrhichthys ocellatus"                                   
#> [1260] "Pelodiscus sinensis"                                       
#> [1261] "Trionyx sinensis"                                          
#> [1262] "Hippoglossus olivaceus"                                    
#> [1263] "Paralichthys olivaceus"                                    
#> [1264] "Xiphias gladius"                                           
#> [1265] "Cyprinodon variegatus"                                     
#> [1266] "Bos grunniens_mutus"                                       
#> [1267] "Bos mutus"                                                 
#> [1268] "Poephagus mutus"                                           
#> [1269] "Alligator sinensis"                                        
#> [1270] "Morus notabilis"                                           
#> [1271] "Nymphaea colorata"                                         
#> [1272] "Photinus pyralis"                                          
#> [1273] "Periophthalmus magnuspinnatus"                             
#> [1274] "Meleagris gallopavo"                                       
#> [1275] "Pomacea canaliculata"                                      
#> [1276] "Haplochromis nyererei"                                     
#> [1277] "Pundamilia nyererei"                                       
#> [1278] "Cyanistes caeruleus"                                       
#> [1279] "Parus caeruleus"                                           
#> [1280] "Caranx dumerili"                                           
#> [1281] "Seriola dumerili"                                          
#> [1282] "Macrosteles (Macrosteles)_quadrilineatus"                  
#> [1283] "Macrosteles quadrilineatus"                                
#> [1284] "Enhydra lutris_kenyoni"                                    
#> [1285] "Fluta alba"                                                
#> [1286] "Monopterus albus"                                          
#> [1287] "Muraena alba"                                              
#> [1288] "Caecilia seraphini"                                        
#> [1289] "Caecilia Seraphini"                                        
#> [1290] "Geotrypetes seraphini"                                     
#> [1291] "Hypogeophis seraphini"                                     
#> [1292] "Chaetodon rostratus"                                       
#> [1293] "Chelmon rostratus"                                         
#> [1294] "Cucumis sativus"                                           
#> [1295] "Cyrtodactylus macularius"                                  
#> [1296] "Eublepharis macularius"                                    
#> [1297] "Felis concolor"                                            
#> [1298] "Panthera concolor"                                         
#> [1299] "Puma concolor"                                             
#> [1300] "Fenneropenaeus chinensis"                                  
#> [1301] "Penaeus chinensis"                                         
#> [1302] "Pomacentrus partitus"                                      
#> [1303] "Stegastes partitus"                                        
#> [1304] "Phascum patens"                                            
#> [1305] "Physcomitrella patens_subsp._patens"                       
#> [1306] "Physcomitrella patens"                                     
#> [1307] "Physcomitrium patens"                                      
#> [1308] "Anas jamaicensis"                                          
#> [1309] "Oxyura jamaicensis"                                        
#> [1310] "Drosophila miranda"                                        
#> [1311] "Lottia gigantea"                                           
#> [1312] "Eurytemora affinis"                                        
#> [1313] "Crotalus tigris"                                           
#> [1314] "Argentina anserina_subsp._anserina"                        
#> [1315] "Argentina anserina"                                        
#> [1316] "Potentilla anserina"                                       
#> [1317] "Achaearanea tepidariorum"                                  
#> [1318] "Parasteatoda tepidariorum"                                 
#> [1319] "Theridion tepidariorum"                                    
#> [1320] "Uranotaenia lowii"                                         
#> [1321] "Cynolebias whitei"                                         
#> [1322] "Nematolebias whitei"                                       
#> [1323] "Simpsonichthys whitei"                                     
#> [1324] "Sceloporus undulatus"                                      
#> [1325] "Stellio undulatus"                                         
#> [1326] "Helobdella robusta"                                        
#> [1327] "Styela clava"                                              
#> [1328] "Orycteropus afer_afer"                                     
#> [1329] "Dipodomys ordii"                                           
#> [1330] "Leucoraja erinacea"                                        
#> [1331] "Raja erinacea"                                             
#> [1332] "Raja erinaceus"                                            
#> [1333] "Raja erinacia"                                             
#> [1334] "Phytophthora parasitica_INRA-310"                          
#> [1335] "Anas olor"                                                 
#> [1336] "Cygnus olor"                                               
#> [1337] "Lacerta agilis"                                            
#> [1338] "Naja scutata"                                              
#> [1339] "Notechis scutatus"                                         
#> [1340] "Millepora damicornis"                                      
#> [1341] "Pocillopora caespitosa_laysanensis"                        
#> [1342] "Pocillopora damicornis_laysanensis"                        
#> [1343] "Pocillopora damicornis"                                    
#> [1344] "Morone saxatilis"                                          
#> [1345] "Perca saxatilis"                                           
#> [1346] "Miniopterus natalensis"                                    
#> [1347] "Miniopterus schreibersii_natalensis"                       
#> [1348] "Vespertilio natalensis"                                    
#> [1349] "Anas cygnoid"                                              
#> [1350] "Anser cygnoides"                                           
#> [1351] "Actinia tenebrosa"                                         
#> [1352] "Neptunus trituberculatus"                                  
#> [1353] "Portunus (Portunus)_trituberculatus"                       
#> [1354] "Portunus trituberculatus"                                  
#> [1355] "Lacerta vivipara"                                          
#> [1356] "Zootoca vivipara"                                          
#> [1357] "Propithecus coquereli"                                     
#> [1358] "Propithecus verreauxi_coquereli"                           
#> [1359] "Erinaceus europaeus"                                       
#> [1360] "Jatropha curcas"                                           
#> [1361] "Caenorhabditis briggsae"                                   
#> [1362] "Cherax quadricarinatus"                                    
#> [1363] "Homalodisca coagulata"                                     
#> [1364] "Homalodisca vitripennis"                                   
#> [1365] "Tettigonia coagulata"                                      
#> [1366] "Tettigonia vitripennis"                                    
#> [1367] "Furina textilis"                                           
#> [1368] "Pseudonaja textilis"                                       
#> [1369] "Anolis carolinensis"                                       
#> [1370] "Python bivittatus"                                         
#> [1371] "Python molurus_bivittatus"                                 
#> [1372] "Chrysemys scripta_elegans"                                 
#> [1373] "Emys elegans"                                              
#> [1374] "Pseudemys scripta_elegans"                                 
#> [1375] "Trachemys scripta_elegans"                                 
#> [1376] "Protobothrops mucrosquamatus"                              
#> [1377] "Trigonocephalus mucrosquamatus"                            
#> [1378] "Trimeresurus mucrosquamatus"                               
#> [1379] "Daphnia pulex"                                             
#> [1380] "Paramacrobiotus metropolitanus"                            
#> [1381] "Lipotes vexillifer"                                        
#> [1382] "Petromyzon marinus"                                        
#> [1383] "Poephila guttata"                                          
#> [1384] "Taeniopygia guttata"                                       
#> [1385] "Taenopygia guttata"                                        
#> [1386] "Amphibolurus vitticeps"                                    
#> [1387] "Pogona vitticeps"                                          
#> [1388] "Aplysia californica"                                       
#> [1389] "Phalaenopsis equestris"                                    
#> [1390] "Saccoglossus kowalevskii"                                  
#> [1391] "Saccoglossus kowalevskyi"                                  
#> [1392] "Numida meleagris"                                          
#> [1393] "Phasianus meleagris"                                       
#> [1394] "Momordica charantia"                                       
#> [1395] "Callorhinchus milii"                                       
#> [1396] "Sphaerodactylus townsendi"                                 
#> [1397] "Eutainia elegans"                                          
#> [1398] "Thamnophis elegans"                                        
#> [1399] "Corvus hawaiiensis"                                        
#> [1400] "Manacus candei"                                            
#> [1401] "Pipra candei"                                              
#> [1402] "Euleptes europaea"                                         
#> [1403] "Euleptes europea"                                          
#> [1404] "Phyllodactylus europaea"                                   
#> [1405] "Phyllodactylus europaeus"                                  
#> [1406] "Ptyodactylus caudivolvolus"                                
#> [1407] "Lepisosteus oculatus"                                      
#> [1408] "Altirana parkeri"                                          
#> [1409] "Nanorana parkeri"                                          
#> [1410] "Aquila chrysaetos_chrysaetos"                              
#> [1411] "Ahaetulla prasina"                                         
#> [1412] "Fusarium oxysporum_f._sp._lycopersici_4287"                
#> [1413] "Heteropelma chrysocephalum"                                
#> [1414] "Neopelma chrysocephalum"                                   
#> [1415] "Musca domestica"                                           
#> [1416] "Pristis pectinata"                                         
#> [1417] "Ischnura elegans"                                          
#> [1418] "Vidua chalybeata"                                          
#> [1419] "Coturnix coturnix_japanica"                                
#> [1420] "Coturnix coturnix_japonica"                                
#> [1421] "Coturnix coturnix_Japonicus"                               
#> [1422] "Coturnix japonica_japonica"                                
#> [1423] "Coturnix japonica"                                         
#> [1424] "Gekko japonicus"                                           
#> [1425] "Platydactylus japonicus"                                   
#> [1426] "Nilaparvata lugens"                                        
#> [1427] "Ardea americana"                                           
#> [1428] "Grus americana"                                            
#> [1429] "Grus americanus"                                           
#> [1430] "Harpia harpyja"                                            
#> [1431] "Vultur harpyja"                                            
#> [1432] "Corvus moneduloides"                                       
#> [1433] "Pipra filicauda"                                           
#> [1434] "Herrania umbratica"                                        
#> [1435] "Ilyonectria robusta"                                       
#> [1436] "Tympanuchus pallidicinctus"                                
#> [1437] "Topomyia yanbarensis"                                      
#> [1438] "Parus atricapillus"                                        
#> [1439] "Poecile atricapilla"                                       
#> [1440] "Poecile atricapillus"                                      
#> [1441] "Corapipo altera"                                           
#> [1442] "Acyrthosiphon pisum"                                       
#> [1443] "Acyrthosiphum pisum"                                       
#> [1444] "Varanus komodoensis"                                       
#> [1445] "Saprolegnia parasitica_CBS_223.65"                         
#> [1446] "Vidua macroura"                                            
#> [1447] "Carica papaya"                                             
#> [1448] "Chiroxiphia lanceolata"                                    
#> [1449] "Pipra lanceolata"                                          
#> [1450] "Octopus bimaculoides"                                      
#> [1451] "Lagopus muta"                                              
#> [1452] "Tetrao mutus"                                              
#> [1453] "Bradysia coprophila"                                       
#> [1454] "Sciara coprophila"                                         
#> [1455] "Coluber sirtalis"                                          
#> [1456] "Thamnophis sirtalis"                                       
#> [1457] "Falco peregrinus"                                          
#> [1458] "Falco cherrug"                                             
#> [1459] "Asteracanthion distichum"                                  
#> [1460] "Asterias attenuata"                                        
#> [1461] "Asterias clathrata"                                        
#> [1462] "Asterias disticha"                                         
#> [1463] "Asterias gigantea"                                         
#> [1464] "Asterias pallida"                                          
#> [1465] "Asterias rubens"                                           
#> [1466] "Asterias stimpsoni"                                        
#> [1467] "Asterias vulgaris"                                         
#> [1468] "Manduca sexta"                                             
#> [1469] "Sphinx sexta"                                              
#> [1470] "Condylura cristata"                                        
#> [1471] "Sorex cristatus"                                           
#> [1472] "Cuculus canorus"                                           
#> [1473] "Pezoporus wallicus"                                        
#> [1474] "Aedes aegypti"                                             
#> [1475] "Aedes (Stegomyia)_aegypti"                                 
#> [1476] "Stegomyia aegypti"                                         
#> [1477] "Falco naumanni"                                            
#> [1478] "Corvus kubaryi"                                            
#> [1479] "Necator americanus"                                        
#> [1480] "Larus tridactylus"                                         
#> [1481] "Rissa tridactyla"                                          
#> [1482] "Aphanomyces astaci"                                        
#> [1483] "Culex (Culex)_pipiens_pallens"                             
#> [1484] "Culex pipiens_pallens"                                     
#> [1485] "Catharus ustulatus"                                        
#> [1486] "Turdus ustulatus"                                          
#> [1487] "Accipiter gentilis"                                        
#> [1488] "Accipiter gentillis"                                       
#> [1489] "Falco gentilis"                                            
#> [1490] "Crocodylus porosus"                                        
#> [1491] "Amborella trichopoda"                                      
#> [1492] "Falco biarmicus"                                           
#> [1493] "Lagopus leucura"                                           
#> [1494] "Lagopus leucurus"                                          
#> [1495] "Falco rusticolus"                                          
#> [1496] "Phasianus colchicus"                                       
#> [1497] "Strigops habroptila"                                       
#> [1498] "Strigops habroptilis"                                      
#> [1499] "Strigops habroptilus"                                      
#> [1500] "Corvus brachyrhynchos"                                     
#> [1501] "Uloborus diversus"                                         
#> [1502] "Phytophthora infestans_strain_T30-4"                       
#> [1503] "Phytophthora infestans_T30-4"                              
#> [1504] "Empidonax traillii"                                        
#> [1505] "Muscicapa traillii"                                        
#> [1506] "Strix alba"                                                
#> [1507] "Tyto alba"                                                 
#> [1508] "Parus major"                                               
#> [1509] "Lepeophtheirus salmonis"                                   
#> [1510] "Gavialis gangeticus"                                       
#> [1511] "Lacerta gangetica"                                         
#> [1512] "Casuarius novaehollandiae"                                 
#> [1513] "Dromaius novaehollandiae"                                  
#> [1514] "Dromaius novae-hollandiae"                                 
#> [1515] "Lepidothrix coronata"                                      
#> [1516] "Pipra coronata"                                            
#> [1517] "Daphnia carinata"                                          
#> [1518] "Aphis gossypii"                                            
#> [1519] "Hyalella azteca"                                           
#> [1520] "Hyalella knickerbockeri"                                   
#> [1521] "Colletotrichum lupini"                                     
#> [1522] "Gloeosporium lupini"                                       
#> [1523] "Sphaeroforma arctica_JP610"                                
#> [1524] "Suillus fuscotomentosus"                                   
#> [1525] "Mollisia scopiformis"                                      
#> [1526] "Phialocephala scopiformis"                                 
#> [1527] "Myiozetetes cayanensis"                                    
#> [1528] "Hyaloscypha bicolor_E"                                     
#> [1529] "Lonchura domestica"                                        
#> [1530] "Lonchura striata_domestica"                                
#> [1531] "Melopsittacus undulatus"                                   
#> [1532] "Psittacus undulatus"                                       
#> [1533] "Apteryx australis_mantelli"                                
#> [1534] "Apteryx mantelli_mantelli"                                 
#> [1535] "Fringilla montana"                                         
#> [1536] "Passer montanus"                                           
#> [1537] "Coccinella axyridis"                                       
#> [1538] "Harmonia axyridis"                                         
#> [1539] "Calidris pugnax"                                           
#> [1540] "Machetes pugnax"                                           
#> [1541] "Pavoncella pugnax"                                         
#> [1542] "Philomachus pugnax"                                        
#> [1543] "Tringa pugnax"                                             
#> [1544] "Aimophila crissalis"                                       
#> [1545] "Kieneria crissalis"                                        
#> [1546] "Kieneria crissalis_(Vigors,_1839)"                         
#> [1547] "Melozone crissalis"                                        
#> [1548] "Pipilo crissalis"                                          
#> [1549] "Pipilo fuscus_crissalis"                                   
#> [1550] "Stomoxis calcitrans"                                       
#> [1551] "Stomoxys calcitrans"                                       
#> [1552] "Anas atrata"                                               
#> [1553] "Cygnus atratus"                                            
#> [1554] "Culex fatigans"                                            
#> [1555] "Culex pipiens_fatigans"                                    
#> [1556] "Culex pipiens_quinquefasciatus"                            
#> [1557] "Culex quinquefasciatus"                                    
#> [1558] "Hirundo rustica"                                           
#> [1559] "Acanthaster planci"                                        
#> [1560] "Molothrus ater"                                            
#> [1561] "Oriolus ater"                                              
#> [1562] "Laccaria bicolor_S238N-H82"                                
#> [1563] "Apteryx australis_rowii"                                   
#> [1564] "Apteryx rowii"                                             
#> [1565] "Apteryx rowi"                                              
#> [1566] "Anastrepha obliqua"                                        
#> [1567] "Grapholitha glycinivorella"                                
#> [1568] "Leguminivora glycinivorella"                               
#> [1569] "Crypturus perdicarius"                                     
#> [1570] "Nothoprocta perdicaria"                                    
#> [1571] "Ammospiza nelsoni"                                         
#> [1572] "Nylanderia fulva"                                          
#> [1573] "Paratrechina fulva"                                        
#> [1574] "Agelaius phoeniceus"                                       
#> [1575] "Agelaius phoniceus"                                        
#> [1576] "Oriolus phoeniceus"                                        
#> [1577] "Colletotrichum fructicola"                                 
#> [1578] "Colletotrichum ignotum"                                    
#> [1579] "Euthrips occidentalis"                                     
#> [1580] "Frankliniella brunnescens"                                 
#> [1581] "Frankliniella californica"                                 
#> [1582] "Frankliniella occidentalis_brunnescens"                    
#> [1583] "Frankliniella occidentalis"                                
#> [1584] "Motacilla alba_alba"                                       
#> [1585] "Fusarium solani_(Mart.)_Sacc.,_1881"                       
#> [1586] "Fusarium solani"                                           
#> [1587] "Fusisporium solani"                                        
#> [1588] "Neocosmospora solani"                                      
#> [1589] "Sitophilus oryzae"                                         
#> [1590] "Corvus cornix_cornix"                                      
#> [1591] "Fringilla canaria_Linnaeus,_1758"                          
#> [1592] "Serinus canaria"                                           
#> [1593] "Serinus canarius"                                          
#> [1594] "Drosophila subpulchrella"                                  
#> [1595] "Chlamydomonas reinhardtii"                                 
#> [1596] "Chlamydomonas smithii"                                     
#> [1597] "Puccinia striiformis_f._sp._tritici"                       
#> [1598] "Bactrocera cucurbitae"                                     
#> [1599] "Bactrocera (Zeugodacus)_cucurbitae"                        
#> [1600] "Zeugodacus cucurbitae"                                     
#> [1601] "Zeugodacus (Zeugodacus)_cucurbitae"                        
#> [1602] "Antrodia serialis"                                         
#> [1603] "Neoantrodia serialis"                                      
#> [1604] "Drosophila suzukii"                                        
#> [1605] "Wyeomyia smithii"                                          
#> [1606] "Montifringilla ruficollis"                                 
#> [1607] "Pyrgilauda ruficollis"                                     
#> [1608] "Gymnogyps californianus"                                   
#> [1609] "Vultur californianus"                                      
#> [1610] "Bactrocera (Bactrocera)_dorsalis"                          
#> [1611] "Bactrocera (Bactrocera)_invadens"                          
#> [1612] "Bactrocera dorsalis"                                       
#> [1613] "Bactrocera invadens"                                       
#> [1614] "Bactrocera papayae"                                        
#> [1615] "Bactrocera philippinensis"                                 
#> [1616] "Trichoplusia ni"                                           
#> [1617] "Leptothorax curvispinosus"                                 
#> [1618] "Temnothorax curvispinosus"                                 
#> [1619] "Saprolegnia declina_VS20"                                  
#> [1620] "Saprolegnia diclina_VS20"                                  
#> [1621] "Zonotrichia albicollis"                                    
#> [1622] "Bactrocera neohumeralis"                                   
#> [1623] "Sphaeria pertusa"                                          
#> [1624] "Trematosphaeria pertusa"                                   
#> [1625] "Fusarium oxysporum_var._redolens"                          
#> [1626] "Fusarium redolens"                                         
#> [1627] "Anastrepha ludens"                                         
#> [1628] "Cantharellus anzutake"                                     
#> [1629] "Malurus melanocephalus"                                    
#> [1630] "Muscicapa melanocephala"                                   
#> [1631] "Melitaea cinxia"                                           
#> [1632] "Papilio cinxia"                                            
#> [1633] "Maniola jurtina"                                           
#> [1634] "Papilio jurtina"                                           
#> [1635] "Anas fuligula"                                             
#> [1636] "Aythya fuligula"                                           
#> [1637] "Bombyx mori"                                               
#> [1638] "Phalaena mori"                                             
#> [1639] "Botys furnacalis"                                          
#> [1640] "Ostrinia furnacalis"                                       
#> [1641] "Priapula caudata"                                          
#> [1642] "Priapulus caudatus"                                        
#> [1643] "Apanteles glomeratus"                                      
#> [1644] "Cotesia glomerata"                                         
#> [1645] "Centrocercus urophasianus"                                 
#> [1646] "Centrocerus urophasianus"                                  
#> [1647] "Tetrao urophasianus"                                       
#> [1648] "Montifringilla taczanowskii_(Przewalski,_1876)"            
#> [1649] "Onychostruthus taczanowskii"                               
#> [1650] "Monomorium pharaonis"                                      
#> [1651] "Daktulosphaira vitifoliae"                                 
#> [1652] "Pemphigus vitifoliae"                                      
#> [1653] "Viteus vitifoliae"                                         
#> [1654] "Helicoverpa armigera"                                      
#> [1655] "Heliothis armigera"                                        
#> [1656] "Heliothis (Helicoverpa)_armigera"                          
#> [1657] "Noctua armigera"                                           
#> [1658] "Drosophila biarmipes"                                      
#> [1659] "Myzus (Nectarosiphon)_persicae"                            
#> [1660] "Myzus persicae"                                            
#> [1661] "Lucilia sericata"                                          
#> [1662] "Phaenicia sericata"                                        
#> [1663] "Tinamus guttatus"                                          
#> [1664] "Solenopsis invicta"                                        
#> [1665] "Fringilla georgiana"                                       
#> [1666] "Melospiza georgiana"                                       
#> [1667] "Helicoverpa zea"                                           
#> [1668] "Heliothis zea"                                             
#> [1669] "Phalaena zea"                                              
#> [1670] "Drosophila ananassae"                                      
#> [1671] "Drosophila annanassae"                                     
#> [1672] "Fusarium odoratissimum_NRRL_54006"                         
#> [1673] "Coccinella 7-punctata"                                     
#> [1674] "Coccinella septempunctata"                                 
#> [1675] "Spodoptera frugiperda"                                     
#> [1676] "Tigriopus californicus"                                    
#> [1677] "Ficedula albicollis"                                       
#> [1678] "Muscicapa albicollis"                                      
#> [1679] "Drosophila pseudoobscura"                                  
#> [1680] "Mytilidion resinicola"                                     
#> [1681] "Mytilinidion resinicola"                                   
#> [1682] "Halyomorpha halys"                                         
#> [1683] "Phycomyces blakesleeanus_NRRL_1555(-)"                     
#> [1684] "Camarhynchus parvulus"                                     
#> [1685] "Geospiza parvula"                                          
#> [1686] "Drosophila willistoni"                                     
#> [1687] "Monoraphidium neglectum"                                   
#> [1688] "Sturnus vulgaris"                                          
#> [1689] "Bactrocera tryoni"                                         
#> [1690] "Apus apus"                                                 
#> [1691] "Hirundo apus"                                              
#> [1692] "Suillus paluster"                                          
#> [1693] "Naegleria gruberi"                                         
#> [1694] "Suillus discolor"                                          
#> [1695] "Suillus tomentosus_var._discolor"                          
#> [1696] "Manacus vitellinus"                                        
#> [1697] "Pipra vitellina"                                           
#> [1698] "Trichinella spiralis"                                      
#> [1699] "Onthophagus taurus"                                        
#> [1700] "Epistrophe balteatus"                                      
#> [1701] "Episyrphus balteatus"                                      
#> [1702] "Episyrphus (Episyrphus)_balteatus"                         
#> [1703] "Leptinotarsa decemlineata"                                 
#> [1704] "Leptinotarsa decimlineata"                                 
#> [1705] "Stilodes decemlineata"                                     
#> [1706] "Struthio australis"                                        
#> [1707] "Struthio camelus_australis"                                
#> [1708] "Boletus plorans"                                           
#> [1709] "Suillus plorans"                                           
#> [1710] "Dryobates pubescens"                                       
#> [1711] "Picoides pubescens_(Linnaeus,_1766)"                       
#> [1712] "Picoides pubescens"                                        
#> [1713] "Fusarium proliferatum_ET1"                                 
#> [1714] "Fusarium oxysporum_Fo47"                                   
#> [1715] "Drosophila sechellia"                                      
#> [1716] "Schizophyllum commune_H4-8"                                
#> [1717] "Depressaria gossypiella"                                   
#> [1718] "Pectinophora gossypiella"                                  
#> [1719] "Parus humilis"                                             
#> [1720] "Podoces humilis"                                           
#> [1721] "Pseudopoces humilis"                                       
#> [1722] "Pseudopodoces humilis"                                     
#> [1723] "Ascidia intestinalis"                                      
#> [1724] "Ciona intestinalis"                                        
#> [1725] "Opisthorchis viverrini"                                    
#> [1726] "Puccinia graminis_f._sp._tritici_CRL_75-36-700-3"          
#> [1727] "Plutella xylostella"                                       
#> [1728] "Melampsora larici-populina_98AG31"                         
#> [1729] "Drosophila obscura"                                        
#> [1730] "Fusarium verticillioides_7600"                             
#> [1731] "Anoplophora glabripennis"                                  
#> [1732] "Anoplophora nobilis"                                       
#> [1733] "Cerosterna glabripennis"                                   
#> [1734] "Melanauster nobilis"                                       
#> [1735] "Calypte anna"                                              
#> [1736] "Ornismya anna"                                             
#> [1737] "Microdochium trichocladiopsis"                             
#> [1738] "Anopheles merus"                                           
#> [1739] "Bactrocera (Daculus)_oleae"                                
#> [1740] "Bactrocera (Dacus)_oleae"                                  
#> [1741] "Bactrocera oleae"                                          
#> [1742] "Dacus oleae"                                               
#> [1743] "Fusarium mangiferae"                                       
#> [1744] "Drosophila yakuba"                                         
#> [1745] "Contarinia nasturtii"                                      
#> [1746] "Parastagonospora nodorum_SN15"                             
#> [1747] "Drosophila virilis"                                        
#> [1748] "Zasmidium cellare_ATCC_36951"                              
#> [1749] "Drosophila mauritiana"                                     
#> [1750] "Geospiza fortis"                                           
#> [1751] "Eupeodes corollae"                                         
#> [1752] "Eupeodes (Eupeodes)_corollae"                              
#> [1753] "Metasyrphus corollae"                                      
#> [1754] "Spodoptera litura"                                         
#> [1755] "Sitodiplosis mosellana"                                    
#> [1756] "Microgaster mediator"                                      
#> [1757] "Microplitis medianus"                                      
#> [1758] "Microplitis mediator"                                      
#> [1759] "Drosophila kikkawai"                                       
#> [1760] "Diaporthe citri"                                           
#> [1761] "Phomopsis citri"                                           
#> [1762] "Mesites unicolor"                                          
#> [1763] "Mesitornis unicolor"                                       
#> [1764] "Suillus subaureus"                                         
#> [1765] "Colletotrichum capsici"                                    
#> [1766] "Colletotrichum dematium_f._truncatum_(Schwein.)_Arx,_1957" 
#> [1767] "Colletotrichum truncatum"                                  
#> [1768] "Glomerella glycines"                                       
#> [1769] "Vermicularia capsici"                                      
#> [1770] "Vermicularia truncata"                                     
#> [1771] "Drosophila simulans"                                       
#> [1772] "Anisochrysa carnea"                                        
#> [1773] "Chrysopa carnea"                                           
#> [1774] "Chrysoperla carnea"                                        
#> [1775] "Drosophila takahashii"                                     
#> [1776] "Lucilia cuprina"                                           
#> [1777] "Drosophila persimilis"                                     
#> [1778] "Falco albicilla"                                           
#> [1779] "Haliaeetus albicilla"                                      
#> [1780] "Antrostomus carolinensis"                                  
#> [1781] "Caprimulgus carolinensis"                                  
#> [1782] "Nasonia vitripennis"                                       
#> [1783] "Athene cunicularia"                                        
#> [1784] "Speotyto cunicularia"                                      
#> [1785] "Strix cunicularia"                                         
#> [1786] "Colias crocea"                                             
#> [1787] "Colias croceus"                                            
#> [1788] "Papilio croceus"                                           
#> [1789] "Leptidea sinapis"                                          
#> [1790] "Papilio sinapis"                                           
#> [1791] "Anopheles arabiensis"                                      
#> [1792] "Drosophila ficusphila"                                     
#> [1793] "Vollenhovia emeryi"                                        
#> [1794] "Hermetia illucens"                                         
#> [1795] "Fusarium vanettenii_77-13-4"                               
#> [1796] "Nectria haematococca_mpVI_77-13-4"                         
#> [1797] "Thrips palmi"                                              
#> [1798] "Falco leucocephalus"                                       
#> [1799] "Haliaeetus leucocephalus"                                  
#> [1800] "Malaya genurostris"                                        
#> [1801] "Colletotrichum gloeosporioides_(Penz.)_Penz._&_Sacc.,_1884"
#> [1802] "Colletotrichum gloeosporioides"                            
#> [1803] "Glomerella cingulata"                                      
#> [1804] "Glomerella rufomaculans-vaccinii"                          
#> [1805] "Gnomoniopsis cingulata"                                    
#> [1806] "Vermicularia gloeosporioides"                              
#> [1807] "Acanthamoeba castellanii_Neff_strain"                      
#> [1808] "Acanthamoeba castellanii_strain_Neff"                      
#> [1809] "Acanthamoeba castellanii_str._Neff"                        
#> [1810] "Drosophila albomicans"                                     
#> [1811] "Drosophila nasuta_albomicans"                              
#> [1812] "Diaporthe amygdali"                                        
#> [1813] "Fusicoccum amygdali"                                       
#> [1814] "Phomopsis amygdali"                                        
#> [1815] "Pelecanus crispus"                                         
#> [1816] "Pelecanus philippensis_crispus"                            
#> [1817] "Drosophila rhopaloa"                                       
#> [1818] "Aphantopus hyperantus"                                     
#> [1819] "Maniola hyperantus"                                        
#> [1820] "Papilio hyperantus"                                        
#> [1821] "Drosophila serrata"                                        
#> [1822] "Leptopilina heterotoma"                                    
#> [1823] "Peronospora halstedii"                                     
#> [1824] "Plasmopara halstedii"                                      
#> [1825] "Cuculus discolor"                                          
#> [1826] "Leptosomus discolor"                                       
#> [1827] "Aphanomyces invadans"                                      
#> [1828] "Drosophila santomea"                                       
#> [1829] "Sipha flava"                                               
#> [1830] "Drosophila teissieri"                                      
#> [1831] "Aptenodytes forsteri"                                      
#> [1832] "Phaethon lepturus"                                         
#> [1833] "Drosophila bipectinata"                                    
#> [1834] "Fulmaris glacialis"                                        
#> [1835] "Fulmarus glacialis"                                        
#> [1836] "Procellaria glacialis"                                     
#> [1837] "Ardea garzetta"                                            
#> [1838] "Egretta garzetta"                                          
#> [1839] "Anopheles mysorensis"                                      
#> [1840] "Anopheles stephensi_mysorensis"                            
#> [1841] "Anopheles stephensi"                                       
#> [1842] "Anopheles stephensi_var._mysorensis"                       
#> [1843] "Neocellia intermedia_Rothwell,_1907"                       
#> [1844] "Neocellia intermedia"                                      
#> [1845] "Cryptotermes secundus"                                     
#> [1846] "Pestalotiopsis fici_W106-1"                                
#> [1847] "Aricia agestis"                                            
#> [1848] "Papilio agestis"                                           
#> [1849] "Polyommatus agestis"                                       
#> [1850] "Artogeia napi"                                             
#> [1851] "Papilio napi"                                              
#> [1852] "Pieris napi"                                               
#> [1853] "Drosophila eugracilis"                                     
#> [1854] "Wasmannia auropunctata"                                    
#> [1855] "Oppia nitens"                                              
#> [1856] "Adelges cooleyi"                                           
#> [1857] "Chermes cooleyi"                                           
#> [1858] "Gilletteella cooleyi"                                      
#> [1859] "Acanthisitta chloris"                                      
#> [1860] "Sitta chloris"                                             
#> [1861] "Agrilus feretrius"                                         
#> [1862] "Agrilus marcopoli"                                         
#> [1863] "Agrilus planipennis"                                       
#> [1864] "Drosophila elegans"                                        
#> [1865] "Hyposmocoma kahamanoa"                                     
#> [1866] "Cariama cristata"                                          
#> [1867] "Palamedea cristata"                                        
#> [1868] "Aleurodes tabaci"                                          
#> [1869] "Aleyrodes tabaci"                                          
#> [1870] "Bemisia tabaci"                                            
#> [1871] "Ibis nippon"                                               
#> [1872] "Nipponia nippon"                                           
#> [1873] "Balearica gibbericeps"                                     
#> [1874] "Balearica pavonina_gibbericeps"                            
#> [1875] "Balearica regulorum_gibbericepse"                          
#> [1876] "Balearica regulorum_gibbericeps"                           
#> [1877] "Bombus affinis"                                            
#> [1878] "Varroa jacobsoni"                                          
#> [1879] "Drosophila gunungcola"                                     
#> [1880] "Colius striatus"                                           
#> [1881] "Tauraco erythrolophus"                                     
#> [1882] "Colletotrichum aenigma"                                    
#> [1883] "Colletotrichum communis"                                   
#> [1884] "Colletotrichum dianesei"                                   
#> [1885] "Colletotrichum endomangiferae"                             
#> [1886] "Colletotrichum hymenocallidis"                             
#> [1887] "Colletotrichum jasmini-sambac"                             
#> [1888] "Colletotrichum melanocaulon"                               
#> [1889] "Colletotrichum siamense"                                   
#> [1890] "Pterocles gutturalis"                                      
#> [1891] "Aethina tumida"                                            
#> [1892] "Galleria mellonella"                                       
#> [1893] "Phalaena mellonella"                                       
#> [1894] "Bicyclus anynana"                                          
#> [1895] "Mycalesis anynana"                                         
#> [1896] "Leptopilina boulardi"                                      
#> [1897] "Zootermopsis nevadensis"                                   
#> [1898] "Achroia grisella"                                          
#> [1899] "Tinea grisella"                                            
#> [1900] "Acremonium falciforme"                                     
#> [1901] "Cephalosporium falciforme"                                 
#> [1902] "Fusarium falciforme"                                       
#> [1903] "Neocosmospora falciformis"                                 
#> [1904] "Drosophila mohavensis"                                     
#> [1905] "Drosophila mojavensis"                                     
#> [1906] "Drosophila innubila"                                       
#> [1907] "Bombus huntii"                                             
#> [1908] "Cuculus indicator"                                         
#> [1909] "Indicator indicator"                                       
#> [1910] "Dendroctonus ponderosae"                                   
#> [1911] "Ardea helias"                                              
#> [1912] "Eurypyga helias"                                           
#> [1913] "Loa loa"                                                   
#> [1914] "Cadophora gregata"                                         
#> [1915] "Cephalosporium gregatum"                                   
#> [1916] "Phialophora gregata"                                       
#> [1917] "Nestor notabilis_notabilis"                                
#> [1918] "Nestor notabilis"                                          
#> [1919] "Colymbus stellatus"                                        
#> [1920] "Gavia stellata"                                            
#> [1921] "Cynthia cardui"                                            
#> [1922] "Papilio cardui"                                            
#> [1923] "Vanessa cardui"                                            
#> [1924] "Cladosporium fulvum"                                       
#> [1925] "Fulvia fulva"                                              
#> [1926] "Mycovellosiella fulva"                                     
#> [1927] "Passalora fulva"                                           
#> [1928] "Plodia interpunctella"                                     
#> [1929] "Tinea interpunctella"                                      
#> [1930] "Stilbospora angustata"                                     
#> [1931] "Truncatella angustata"                                     
#> [1932] "Truncatella truncata"                                      
#> [1933] "Chlamydotis macqueenii_macqueenii"                         
#> [1934] "Chlamydotis macqueenii_macqueeni"                          
#> [1935] "Chlamydotis macqueenii"                                    
#> [1936] "Chlamydotis undulata_macqueenii"                           
#> [1937] "Otis macqueenii"                                           
#> [1938] "Anopheles funestus"                                        
#> [1939] "Fusarium fujikuroi_IMI_58289"                              
#> [1940] "Cephalosporium keratoplasticum_(nom._inval.)"              
#> [1941] "Fusarium keratoplasticum"                                  
#> [1942] "Neocosmospora keratoplastica"                              
#> [1943] "Drosophila lebanonensis"                                   
#> [1944] "Scaptodrosophila lebanonensis"                             
#> [1945] "Merops nubicus"                                            
#> [1946] "Coniothyrium fuckelii_var._sporulosum"                     
#> [1947] "Coniothyrium sporulosum"                                   
#> [1948] "Paraconiothyrium sporulosum"                               
#> [1949] "Paraconyotrichium sporulosum"                              
#> [1950] "Paraphaeosphaeria sporulosa"                               
#> [1951] "Fusarium venenatum"                                        
#> [1952] "Fusarium venetum"                                          
#> [1953] "Amyelois transitella"                                      
#> [1954] "Nephopteryx transitella"                                   
#> [1955] "Pelecanus carbo"                                           
#> [1956] "Phalacrocorax carbo"                                       
#> [1957] "Naegleria lovaniensis"                                     
#> [1958] "Papilio machaon"                                           
#> [1959] "Gaeumannomyces tritici_R3-111a-1"                          
#> [1960] "Papilio aegeria"                                           
#> [1961] "Pararge aegeria"                                           
#> [1962] "Lophyrus lecontei"                                         
#> [1963] "Neodiprion lecontei"                                       
#> [1964] "Sclerotinia sclerotiorum_1980_UF-70"                       
#> [1965] "Aegialitis vocifera"                                       
#> [1966] "Charadrius vociferous"                                     
#> [1967] "Charadrius vociferus"                                      
#> [1968] "Oxyechus vociferus"                                        
#> [1969] "Drosophila erecta"                                         
#> [1970] "Anopheles gambiae"                                         
#> [1971] "Arabidopsis thaliana"                                      
#> [1972] "Bos taurus"                                                
#> [1973] "Canis familiaris"                                          
#> [1974] "Gallus gallus"                                             
#> [1975] "Pan troglodytes"                                           
#> [1976] "Escherichia coli"                                          
#> [1977] "Drosophila melanogaster"                                   
#> [1978] "Homo sapiens"                                              
#> [1979] "Mus musculus"                                              
#> [1980] "Sus scrofa"                                                
#> [1981] "Rattus norvegicus"                                         
#> [1982] "Macaca mulatta"                                            
#> [1983] "Caenorhabditis elegans"                                    
#> [1984] "Xenopus laevis"                                            
#> [1985] "Saccharomyces cerevisiae"                                  
#> [1986] "Danio rerio"

To enable gene ontology analysis, one must use a SpliceWiz reference with prepared GO annotations for the specified organism. To view the gene ontology annotation for a given SpliceWiz reference:

ref_path <- file.path(tempdir(), "Reference")
ontology <- viewGO(ref_path)
head(ontology)
#>           gene_id      go_id evidence              go_term ontology gene_name
#> 1 ENSG00000121410 GO:0003674       ND   molecular_function       MF      <NA>
#> 2 ENSG00000121410 GO:0005576      HDA extracellular region       CC      <NA>
#> 3 ENSG00000121410 GO:0005576      IDA extracellular region       CC      <NA>
#> 4 ENSG00000121410 GO:0005576      TAS extracellular region       CC      <NA>
#> 5 ENSG00000121410 GO:0005615      HDA  extracellular space       CC      <NA>
#> 6 ENSG00000121410 GO:0005886      IBA      plasma membrane       CC      <NA>

Note that gene_names are not available for our example reference because there are only 7 genes in the example reference. Nevertheless, the GO annotation is complete and relies on Ensembl gene_ids for matching.

For simple gene over-representation analysis, we use the goGenes function. As an example, to analyse for enriched biological functions of the first 1000 genes in the reference:

allGenes <- sort(unique(ontology$gene_id))
exampleGeneID <- allGenes[1:1000]
exampleBkgdID <- allGenes

go_byGenes <- goGenes(
    enrichedGenes = exampleGeneID, 
    universeGenes = exampleBkgdID, 
    ontologyRef = ontology
)

To visualize the top gene ontology categories of the above analysis:

plotGO(go_byGenes, filter_n_terms = 12)

Of course, we wish to perform GO analysis on the top differential events in our analysis:

res_edgeR <- ASE_edgeR(
    se = se.filtered,
    test_factor = "condition",
    test_nom = "B",
    test_denom = "A"
)
#> Oct 16 21:04:49 Performing edgeR contrast for included / excluded counts separately
#> Oct 16 21:04:50 Performing edgeR contrast for included / excluded counts together

go_byASE <- goASE(
  enrichedEventNames = res_edgeR$EventName[1:10],
  universeEventNames = NULL,
  se = se
)
head(go_byASE)
#>         go_id                                                   go_term
#>        <char>                                                    <char>
#> 1: GO:0000381  regulation of alternative mRNA splicing, via spliceosome
#> 2: GO:0045892        negative regulation of DNA-templated transcription
#> 3: GO:0048026     positive regulation of mRNA splicing, via spliceosome
#> 4: GO:0000122 negative regulation of transcription by RNA polymerase II
#> 5: GO:0000375           RNA splicing, via transesterification reactions
#> 6: GO:0000423                                                 mitophagy
#>         pval      padj overlap  size overlapGenes expected foldEnrichment
#>        <num>     <num>   <int> <int>       <list>    <num>          <num>
#> 1: 0.4761905 0.7862282       2     2 ENSG0000....        2              1
#> 2: 0.4761905 0.7862282       2     2 ENSG0000....        2              1
#> 3: 0.4761905 0.7862282       2     2 ENSG0000....        2              1
#> 4: 0.7142857 0.7862282       1     1 ENSG0000....        1              1
#> 5: 0.7142857 0.7862282       1     1 ENSG0000....        1              1
#> 6: 0.7142857 0.7862282       1     1 ENSG0000....        1              1

In most cases, users will wish to use the set of genes represented in the background ASEs as the background genes. This is because some genes do not have alternative splicing events, most often because they are intronless genes!

To perform GO analysis using background genes from analysed ASEs:

go_byASE <- goASE(
  enrichedEventNames = res_edgeR$EventName[1:10],
  universeEventNames = res_edgeR$EventName,
  se = se
)

Heatmaps

Heatmaps are useful for visualizing differential expression of individual samples, as well as potential patterns of expression.

First, obtain a matrix of PSI values:

# Create a matrix of values of the top 10 differentially expressed events:
mat <- makeMatrix(
    se.filtered,
    event_list = res_edgeR$EventName[1:10],
    method = "PSI"
)


How does makeMatrix() work?
makeMatrix() provides a matrix of PSI values from the given NxtSE object. The parameters event_list and sample_list allows subsetting for ASEs and/or samples, respectively.

The parameter method accepts 3 options:

  • "PSI" : outputs raw PSI values
  • "logit" : outputs logit PSI values
  • "Z-score" : outputs Z-score transformed PSI values

Also, makeMatrix() facilitates exclusion of low confidence PSI values. These can occur when counts of both isoforms are too low. Setting the depth_threshold (default 10) will set samples with total isoform count below this value to be converted to NA.

Splicing events (ASEs) with too many NA values are filtered out. Setting the parameter na.percent.max (default 0.1) means any ASE with the proportion of NA above this threshold will be removed from the final matrix.


Plot this matrix of values in a heatmap:

library(pheatmap)

anno_col_df <- as.data.frame(colData(se.filtered))
anno_col_df <- anno_col_df[, 1, drop=FALSE]

pheatmap(mat, annotation_col = anno_col_df)


Using the GUI
Navigate to Display, then Heatmap in the menu side bar. After relaxing the event filters as per the prior sections, a heatmap will be automatically generated:

Heatmap - GUI

Heatmap - GUI

The heatmap can be customized as follows:

  1. Again, the filtering panel is available to filter the number of events by significance threshold, rank, or by highlighted events, as explained in the previous sections. Highlighted events are particularly useful for heatmaps as users can (if they want) cherry-pick events of interest
  2. After above filtering, an additional filter by gene ontology category can be performed. GO analysis must first be performed, and the top enriched categories are listed here
  3. Samples can be annotated by one or more annotation categories
  4. Samples can be sorted by the chosen annotation category
  5. If (4) is selected, whether the sort order should be ascendeng or descending
  6. The maximum number of rows of the heatmap to display
  7. Whether the heatmap values are raw PSI, logit-transformed PSI, or Z-score transformed values
  8. Users can customize their heatmap color palettes from this list
  9. A static plot (using the pheatmap R package) will be saved to PDF


SpliceWiz Coverage Plots

Coverage plots visualize RNA-seq coverage in individual samples. SpliceWiz uses its coverage normalization algorithm to visualize group differences in PSIs.

What are SpliceWiz coverage plots and how are they generated?
SpliceWiz produces RNA-seq coverage plots of analysed samples. Coverage data is compiled simultaneous to the IR and junction quantitation performed by processBAM(). This data is saved in “COV” files, which is a BGZF compressed and indexed file. COV files show compression and performance gains over BigWig files.

Additionally, SpliceWiz visualizes plots group-averaged coverages, based on user-defined experimental conditions. This is a powerful tool to illustrate group-specific differential splicing or IR. SpliceWiz does this by normalising the coverage depths of each sample based on transcript depth at the splice junction / intron of interest. By doing so, the coverage depths of constitutively expressed flanking exons are normalised to unity. As a result, the intron depths reflect the fraction of transcripts with retained introns and can be compared across samples.


Coverage plots of individual samples

First, lets obtain a list of differential events with delta PSI > 5%:

res_edgeR <- ASE_edgeR(
    se = se.filtered,
    test_factor = "condition",
    test_nom = "B",
    test_denom = "A"
)
#> Oct 16 21:04:53 Performing edgeR contrast for included / excluded counts separately
#> Oct 16 21:04:54 Performing edgeR contrast for included / excluded counts together

res_edgeR.filtered <- res_edgeR[res_edgeR$abs_deltaPSI > 0.05,]
res_edgeR.filtered$EventName[1]
#> [1] "NSUN5/ENST00000252594_Intron2/clean"

We can see here the top differential event belongs to NSUN5. The first step to plotting this event is to create a data object that contains the requisite data for the gene NSUN5:

dataObj <- getCoverageData(se, Gene = "NSUN5", tracks = colnames(se))

This step retrieves all coverage and junction data for NSUN5 (and surrounding genes).

Next, we need to create event-specific data for the IR event in NSUN5 intron 2.

plotObj <- getPlotObject(dataObj, Event = res_edgeR.filtered$EventName[1])

This step normalizes the coverage and junction data from the viewpoint of NSUN5 intron 2.

The final step is to generate the plot:

plotView(
    plotObj,
    centerByEvent = TRUE, # whether the plot should be centered at the `Event`
    trackList = list(1,2,3,4,5,6),
    plotJunctions = TRUE
)

This plots all 6 individual coverage plots, each in its own track.

Why are the tracks referred to using numbers

In the covPlotObject which is returned by the above function, each “track” is a sample. These can be viewed using the tracks() function:

tracks(plotObj)
#> [1] "02H003" "02H025" "02H026" "02H033" "02H043" "02H046"

For convenience, users have the option of referring to tracks by their actual names, or by numbers. So, in the above example, the two parameters below are equivalent:

  • trackList = list(1,2)
  • `trackList = list(“02H003”, “02H025”)


Normalized Coverage plots of individual samples

To plot “normalized” coverage plots, where coverage is normalized to transcript depth (i.e. sum of all transcripts = 1), set normalizeCoverage = TRUE. We can also include multiple samples in each trace, so lets stack all samples in the same condition in the same trace:

plotView(
    plotObj,
    centerByEvent = TRUE,
    trackList = list(c(1,2,3), c(4,5,6)), 
    # Each list element contains a vector of track id's
    
    normalizeCoverage = TRUE
)

NB: junction (sashimi) arcs are not plotted in tracks with more than 1 trace (to avoid cluttering)

Group Coverage plots

To plot group coverage plots, first we need to generate a new covPlotObject. This is because the previous covPlotObject was generated on the basis of each track being an individual sample. To generate a plot object where each track represents a condition:

plotObj_group <- getPlotObject(
    dataObj,
    Event = res_edgeR.filtered$EventName[1],
    condition = "condition",
    tracks = c("A", "B") 
)

# NB:
# when `condition` is not specified, tracks are assumed to be the same samples
# as that of the covDataObject
# when `condition` is specified, tracks must refer to the condition categories
# that are desired for the final plot

Note that there are several scenarios where a new covPlotObject is required:

  • When plotting normalized coverages using a different normalization ASE,
  • When plotting by a different condition or different tracks,
  • When plotting on a different strand (default is * - unstranded)

To generate the group coverage plot, with the two conditions on the same track:

plotView(
    plotObj_group,
    centerByEvent = TRUE,
    trackList = list(c("A", "B"))
)

Coverage plots using exon windows

Sometimes, we are interested in the differential coverage of exons but not of the intervening introns. Given introns are often much longer than exons, it is useful to plot by the exons of interest.

For example, to plot the skipped casette exon in TRA2B:

dataObj <- getCoverageData(se, Gene = "TRA2B", tracks = colnames(se))

plotObj <- getPlotObject(
    dataObj, 
    Event = "SE:TRA2B-206-exon2;TRA2B-205-int1", 
    condition = "condition", tracks = c("A", "B")
)

plotView(
    plotObj, 
    centerByEvent = TRUE, 
    trackList = list(c(1,2)), 
    filterByEventTranscripts = TRUE
)

NB: setting filterByEventTranscripts = TRUE means only transcripts involved in the specified splicing event are plotted in the annotation

Note that the involved exons are in only a small area of the coverage plot. To zoom in on the exons, we first have to plot an “exon view” so the exons are labelled, and at the same time return their coordinates:

gr <- plotView(
    plotObj, 
    centerByEvent = TRUE, 
    trackList = list(c(1,2)), 
    filterByEventTranscripts = TRUE, 
    showExonRanges = TRUE
)

By setting the parameter showExonRanges = TRUE, the plotView function shows a plot with exons and their names in the annotation track, and returns a GRanges object, with ranges named by their corresponding exon names:

names(gr)
#> [1] "TRA2B-205-E3" "TRA2B-206-E4" "TRA2B-206-E3" "TRA2B-206-E2" "TRA2B-206-E1"
#> [6] "TRA2B-205-E2" "TRA2B-213-E2" "TRA2B-213-E1" "TRA2B-205-E1"

We can see in the above figure that he exons of interest are c("TRA2B-206-E1", "TRA2B-206-E2", "TRA2B-206-E3"). To plot these in an “exons view” coverage plot:

plotView(
    plotObj, 
    centerByEvent = TRUE, 
    trackList = list(c(1,2)), 
    filterByEventTranscripts = TRUE, 
    plotRanges = gr[c("TRA2B-206-E1", "TRA2B-206-E2", "TRA2B-206-E3")]
)

Using the GUI

The main coverage plot interface is as follows

Coverage Plots Main Panel - GUI

Coverage Plots Main Panel - GUI

The top bar contains controls to locate the genomic loci of interest:

  1. Genes - locate by the specified gene
  2. Events - locate by the specified alternate splicing event. The list of events can be modified by the right-hand controls in the top bar
  3. Coordinates - locate by the specified chromosome and start/end coordinate
  4. Zoom in or out (each step by a factor of 3)
  5. Strand - view coverage by strand-specific (+/-) or non-specific (*) modes
  6. Select events either by all events in differential analysis (which can be filtered using the html-table column filters), user-highlighted events (using volcano/scatter plots), or by gene ontology category. If using gene-ontology category, an extra drop-down box will appear allowing users to specify from the top gene ontology categories from prior GO analysis
  7. Limit the number of events displayed in the Event drop-down box (2)

In the plot control panel on the left hand side of the main plot area:

  1. Select which alternative splicing event that should be normalized against. This is critical when plotting normalized coverages (14) or plotting by condition (9)
  2. Select whether to plot by individual samples, or specify the condition to plot group-plots. See the next section for condition-specific controls.
  3. The track list table allows users to specify track ID’s for one or more of the specified controls. The left hand column displays available sample names (if plotting by individual samples), or condition categories (if a condition is specified)
  4. Interactive plots can be slow to render because of too many data points. Resolution (the number of data points across the main display) can be controlled using this slider.
  5. For group plots, variance of each group can be displayed either as standard deviation (SD), standard error of the mean (SEM), 95% confidence interval (95% CI) or none.
  6. Whether junction (sashimi) arcs should be plotted. Note that junction arcs will be disabled if there are multiple traces on the same track
  7. When plotting individual samples, whether raw or normalized coverages should be displayed. Junction counts will also be normalized (they are automatically normalized in group coverage plots)
  8. Whether to display isoforms belonging to the included / excluded isoform described in the selected normalizing alternative splicing event
  9. Whether transcripts should be condensed by gene. This is useful if there are too many transcripts cluttering the display.
  10. If selected, brings up a transcript table where users can select 1 or more transcripts to display in the annotation track.
  11. If selected, converts the annotation track into an exons display track with named exons. A table will pop up allowing users to select 2 or more exon ranges. Selecting these exons will generate a static exons-only coverage plot.

The main plot (19) is a plotly-based interactive plot. Users can zoom in to a genomic loci of interest using the zoom tool.

There are several options that appear if a specific condition is set:

Group Coverage Plots - GUI

Group Coverage Plots - GUI

  1. Note that the track list table now shows condition categories instead of names of samples
  2. Difference of normalized coverage between conditions can be objectively measured. Users have the option to use the Student’s T-test to measure difference in normalized coverage between replicates of a pair of conditions
  3. Two drop-down boxes allowing users to select a pair of conditions to assess difference in normalized coverage

As mentioned, when the “Exon Plot mode” is selected, an exons table is displayed where users can select one or more exon ranges can be selected, as shown below:

Generating Exon Coverage (Static) Plots - GUI

Generating Exon Coverage (Static) Plots - GUI

Selecting 2 or more exon ranges will trigger (after a 3 second delay) a static exon-window coverage plot to be generated.

Note that at the bottom of the left hand panel, users can save the interactive (top) plot, or the static exon-window coverage (bottom) plot to PDF as ggplot- based figures.


SessionInfo

sessionInfo()
#> R version 4.4.1 (2024-06-14)
#> Platform: x86_64-pc-linux-gnu
#> Running under: Ubuntu 22.04.5 LTS
#> 
#> Matrix products: default
#> BLAS:   /home/biocbuild/bbs-3.19-bioc/R/lib/libRblas.so 
#> LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.10.0
#> 
#> locale:
#>  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
#>  [3] LC_TIME=en_GB              LC_COLLATE=C              
#>  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
#>  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
#>  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
#> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
#> 
#> time zone: America/New_York
#> tzcode source: system (glibc)
#> 
#> attached base packages:
#> [1] splines   stats4    stats     graphics  grDevices utils     datasets 
#> [8] methods   base     
#> 
#> other attached packages:
#>  [1] pheatmap_1.0.12             ggplot2_3.5.1              
#>  [3] DESeq2_1.44.0               SummarizedExperiment_1.34.0
#>  [5] Biobase_2.64.0              MatrixGenerics_1.16.0      
#>  [7] matrixStats_1.4.1           GenomicRanges_1.56.2       
#>  [9] GenomeInfoDb_1.40.1         IRanges_2.38.1             
#> [11] S4Vectors_0.42.1            DoubleExpSeq_1.1           
#> [13] edgeR_4.2.2                 limma_3.60.6               
#> [15] fstcore_0.9.18              AnnotationHub_3.12.0       
#> [17] BiocFileCache_2.12.0        dbplyr_2.5.0               
#> [19] BiocGenerics_0.50.0         SpliceWiz_1.6.6            
#> [21] NxtIRFdata_1.10.0          
#> 
#> loaded via a namespace (and not attached):
#>   [1] later_1.3.2               BiocIO_1.14.0            
#>   [3] bitops_1.0-9              filelock_1.0.3           
#>   [5] tibble_3.2.1              R.oo_1.26.0              
#>   [7] XML_3.99-0.17             lifecycle_1.0.4          
#>   [9] processx_3.8.4            lattice_0.22-6           
#>  [11] dendextend_1.18.1         magrittr_2.0.3           
#>  [13] plotly_4.10.4             sass_0.4.9               
#>  [15] rmarkdown_2.28            jquerylib_0.1.4          
#>  [17] yaml_2.3.10               httpuv_1.6.15            
#>  [19] cowplot_1.1.3             chromote_0.3.1           
#>  [21] DBI_1.2.3                 RColorBrewer_1.1-3       
#>  [23] abind_1.4-8               zlibbioc_1.50.0          
#>  [25] rvest_1.0.4               purrr_1.0.2              
#>  [27] R.utils_2.12.3            RCurl_1.98-1.16          
#>  [29] rappdirs_0.3.3            seriation_1.5.6          
#>  [31] GenomeInfoDbData_1.2.12   genefilter_1.86.0        
#>  [33] annotate_1.82.0           DelayedMatrixStats_1.26.0
#>  [35] codetools_0.2-20          DelayedArray_0.30.1      
#>  [37] DT_0.33                   xml2_1.3.6               
#>  [39] tidyselect_1.2.1          UCSC.utils_1.0.0         
#>  [41] farver_2.1.2              rhandsontable_0.3.8      
#>  [43] viridis_0.6.5             TSP_1.2-4                
#>  [45] shinyWidgets_0.8.7        webshot_0.5.5            
#>  [47] GenomicAlignments_1.40.0  jsonlite_1.8.9           
#>  [49] fst_0.9.8                 survival_3.7-0           
#>  [51] iterators_1.0.14          foreach_1.5.2            
#>  [53] tools_4.4.1               progress_1.2.3           
#>  [55] Rcpp_1.0.13               glue_1.8.0               
#>  [57] gridExtra_2.3             SparseArray_1.4.8        
#>  [59] xfun_0.48                 websocket_1.4.2          
#>  [61] dplyr_1.1.4               ca_0.71.1                
#>  [63] HDF5Array_1.32.1          numDeriv_2016.8-1.1      
#>  [65] shinydashboard_0.7.2      withr_3.0.1              
#>  [67] BiocManager_1.30.25       fastmap_1.2.0            
#>  [69] rhdf5filters_1.16.0       fansi_1.0.6              
#>  [71] digest_0.6.37             R6_2.5.1                 
#>  [73] mime_0.12                 colorspace_2.1-1         
#>  [75] GO.db_3.19.1              RSQLite_2.3.7            
#>  [77] R.methodsS3_1.8.2         utf8_1.2.4               
#>  [79] tidyr_1.3.1               generics_0.1.3           
#>  [81] data.table_1.16.2         rtracklayer_1.64.0       
#>  [83] prettyunits_1.2.0         httr_1.4.7               
#>  [85] htmlwidgets_1.6.4         S4Arrays_1.4.1           
#>  [87] pkgconfig_2.0.3           gtable_0.3.5             
#>  [89] blob_1.2.4                registry_0.5-1           
#>  [91] XVector_0.44.0            htmltools_0.5.8.1        
#>  [93] fgsea_1.30.0              scales_1.3.0             
#>  [95] ompBAM_1.8.0              png_0.1-8                
#>  [97] knitr_1.48                rjson_0.2.23             
#>  [99] curl_5.2.3                cachem_1.1.0             
#> [101] rhdf5_2.48.0              BiocVersion_3.19.1       
#> [103] parallel_4.4.1            AnnotationDbi_1.66.0     
#> [105] restfulr_0.0.15           pillar_1.9.0             
#> [107] grid_4.4.1                vctrs_0.6.5              
#> [109] promises_1.3.0            shinyFiles_0.9.3         
#> [111] xtable_1.8-4              evaluate_1.0.1           
#> [113] locfit_1.5-9.10           cli_3.6.3                
#> [115] compiler_4.4.1            Rsamtools_2.20.0         
#> [117] rlang_1.1.4               crayon_1.5.3             
#> [119] heatmaply_1.5.0           labeling_0.4.3           
#> [121] ps_1.8.0                  fs_1.6.4                 
#> [123] stringi_1.8.4             viridisLite_0.4.2        
#> [125] BiocParallel_1.38.0       assertthat_0.2.1         
#> [127] munsell_0.5.1             Biostrings_2.72.1        
#> [129] lazyeval_0.2.2            Matrix_1.7-0             
#> [131] BSgenome_1.72.0           hms_1.1.3                
#> [133] patchwork_1.3.0           sparseMatrixStats_1.16.0 
#> [135] bit64_4.5.2               Rhdf5lib_1.26.0          
#> [137] statmod_1.5.0             KEGGREST_1.44.1          
#> [139] shiny_1.9.1               highr_0.11               
#> [141] memoise_2.0.1             bslib_0.8.0              
#> [143] fastmatch_1.1-4           bit_4.5.0