SeqPlots 1.22.2
Tracks:
Features:
Files must be formatted according to UCSC guidelines. All widely used chromosome names conventions are accepted, e.g. for human files either ‘chr1’ or ‘1’ can be used, however these conventions should not be mixed within single files.
Press the Add files
button to bring up the file upload panel.
You can drag and drop files here or press the Add files...
button to open a file selection menu. Before starting the upload the following mandatory information must be provided about each file:
Comments are optional.
The contents of the text field can be copied to all files by clicking the icon at the left of the field. The default values can be set using Set defaults...
button. Default values are stored using the browser cookies, and the settings will be remembered across different sessions as long as the same web browser is used. File extensions that are not supported will raise an error.
Individual files can be uploaded by pressing ‘start’ next to the file name or all files can be uploaded at once by pressing the Start upload
button at the top of file upload panel.
During the upload process a progress bar is displayed. After upload SeqPlots gives a message that upload was successful or or gives an error message. Common errors are misformatted file formats or chromosome names do not matched the reference genome. For more information please refer to Error explained chapter.
To dismiss the upload window, click on X
or outside the window.
Clicking the New plot set
button brings up the file collection window. The primary function of this window is to choose signal tracks and feature files to use for calculating the plots. However, it also provides basic file management capabilities. Information on files can be reviewed and files can be downloaded or deleted. Fields can be searched, filtered and sorted by any column. The red x
button on the right site of file table removes a single file from the collection, while Remove selected files
button will erase all selected files.
Pressing the New plot set
button brings up the file collection window from which you can choose signal tracks and feature files to calculate average plots and heat maps. If you wish to upload more files please refer to adding new files documentation. The file collection window has three tabs:
Tracks
- signal files, i.e., Wiggle, BigWiggle and BedGraph files.Features
- genomic feature files, i.e., BED, GFF and GTF filesSequence features
- input any motif of interest that you want to plot.The Tracks
and Features
tabs displays information about the files and allows you to filter and sort by any column. The “Search:” dialog allows you to find any keyword in any field, while dropdowns below the file grid allow for more advanced filtering on specific columns.
Select files by clicking on the file name or any other part of the row beside Show comment
and Download
or Remove
buttons. Chosen files are highlighted in light blue. Clicking the file name again will cancel the selection. At least one signal track or motif and one feature file must be selected before starting the calculation.
Options controlling the plot settings is found below the file selection window:
Bin track @ [bp]:
- this numeric input determines the resolution of data acquisition; the default value 10 means that 10bp intervals within the plotting range will be summarized by calculating the mean. Higher values increases the speed of calculation, but decreases resolution. See the explanations.Choose the plot type
- there are three options:
Point Features
- anchor plot on the start of a feature. By default, plot will be directional if strand information is present (i.e, use start position and plot on positive strand for + strand features and use end position and plot on negative strand for minus strand features). If strand information is not present in the feature file (or if the “ignore strand” option is chosen), plot will use start position of feature and be plotted on the positive strand (see explanations). User chooses length of upstream and downstream sequence to plot.Midpoint Features
- similar to point feature, but plot is centered on the midpoint of the feature.Endpoint Features
- similar to point feature, but plot is centered on the end of the feature. Strand information is used by default to determine the end side.Anchored Features
- features are anchored at start and stop positions and given pseudo-length chosen by the user. Additionally, the user chooses the length of sequence upstream of the start and downstream of the end to plot.Ignore strand
- the directionality (strand) will be ignored all features plotted on the positive strand.Ignore zeros
- signal values of 0 in the track will be be excluded from calculationsCalculate heatmap
- selecting this generates and saves a heat map matrix. Select if you wish to generate heatmap; uncheck if you only wish to generate average plots, as this will speed calculations.Plotting distances in [bp]
- the distances in to be plotted:
Upstream
- the plotting distance in base pairs upstream to the featureAnchored
- the pseudo-length, to which the features will be extended or shrunk using linear approximation (only for anchored plots)Downstream
- the plotting distance in base pairs downstream to the featureThe Sequence features
tab allows you to calculate and plot the density of any user-defined motif around the chosen genomic feature using the reference sequence package. Motif plots can be mixed with track files’ signal plots. The following options can be set:
DNA motif
- the DNA motifSliding window size in base pairs [bp]
- the size of the sliding window for motif calculation. The value (number of matching motifs within the window) is reported in the middle of the window, e.g. if window is set to 200bp, DNA motif is “GC” and there are 8 CpGs in first 200 bp of the chromosome the value 8 will be reported at 100th bp.Display name
- The name of the motif that will be shown in key and heatmap labels. Leave blank to use DNA motif
value.Plot heatmap or error estimates
- this checkbox determines if heatmap matrix and error estimates should be calculated. If unchecked much faster algorithm will be used for motif density calculation, but only the average plot without the error estimates will be available.Match reverse complement as well
- select if reverse complement motif should be reported as well. For example the TATA motif will report both TATA and ATAT with this option selected.Clicking Add
button adds the motif to plot set, while Reset All
clears the motif selection. On the right side of the motif setting panel gives a list summary of included motifs.
The options are executed by pressing the Run calculation
button. This dismisses the file collection window and brings up the calculation dialog, which shows the progress. On Linux and Mac OS X (systems supporting fork based parallelization) the calculation can be stopped using the Cancel
button - this will bring back all settings in file collection window.
After successful execution the plot array and plot preview panel will appear. In case of error an informative error pop-up will explain the problem. Please refer to the error section for further information.
This section focuses on average (line) plots and options common between these and heatmaps. For heatmap options please refer to heatmap documentation.
After calculating or loading a plot set, a plot array of checkboxes is displayed to select the desired pairs of features and tracks/motifs. Clicking on the column name (tracks/motifs) or row name (features) selects/deselects the whole column or row. Clicking on top-left most cell of plot array toggles the selection of whole array.
If at least one pair on plot array is selected pressing the Line plot
button produces an average plot preview and the Heatmap
button produces a heatmap preview. Alternatively, pressing the [RETURN] key will also produce the new selection and options. These operations are done automatically in reactive mode (see Advanced options chapter). Plots can be downloaded as PDF files using the Line plot and Heatmap buttons next to Download (at the top of the panel).
Below the plotting buttons are options for labeling plots and setting axes. On application start the first panel responsible for bringing file upload, management and plot set calculation modals is active. The further three panels hold common plot settings.
This panel groups settings influencing the plot main title, axis labels, various font sizes plus vertical and horizontal plot limits.
Title
- The main title of the plot, shown in top-center part of the figure; default emptyX-axis label
- Label shown below horizontal axis; default emptyY-axis label
- Label shown below vertical axis; default emptyTitle font size
- Font size of the title in points (point = ~1/72 an inch for standard A4 output); default 20 pointsLabels font size
- Font size of axis labels in points; default 16 pointsAxis font size
- Controls axis ticks font size, that is size of the numbers indicating position in base pairs on X-axis and means signal value on X-axis; default 14 pointsSet X-axis limits
- Set hard plotting limits for X-axis; default values are whole range chosen during plot set calculationSet Y-axis limits
- Set hard plotting limits for Y-axis; default values are a range between lowest and highest mean signal extended by error estimateControls in this panel controls the display of guide lines and error estimates, and allows to log scale the signal prior to plotting.
Transform signal
- if set to Log2 transform
performs log2 transformation of the signal prior to plotting; default setting is Do not transform
Show vertical guide line
- show the vertical line at point 0 - beginning of the feature or midpoint and end of the pseudo-length scaled features (only for anchored plots); turn on by defaultShow horizontal guide line
- show the horizontal line at user determined height; turn off by defaultShow error estimates
- show error standard error and 95% confidence interval as fields, if turned off only the line representing the mean signal is shown; turn on by default