The START app allows users to visualize RNA-seq data starting with count data.
Note, this is version 1.1.1, published on 12/9/19. If you need the previous version, it is hosted here: kcvi.shinyapps.io/STARTapp_2018/
Visualize your data:
GROUPNAME_REPLICATE#
, e.g. Treat_1, Treat_2, Treat_3, Control_1, Control_2, High_1, High_2
Count or Expression Data
Analyzed Data
After uploading your data to START, click red button
to download an .RData file to upload your data to START with one click.
Next time use the “Input Data” tab –> “START RData file” option.
Additional help information and more detailed instructions are provided under the “Instructions” tab.
The START app has been developed by Jessica Minnier, Jiri Sklenar, Anthony Paul Barnes, and Jonathan Nelson of Oregon Health & Science University, Knight Cardiovascular Institute and School of Public Health.
Please cite our app:
The source code of START is available on Github.
We would appreciate reports of any issues with the app via the issues option of Github or by emailing start.app.help-at-gmail.com.
The START app allows users to visualize RNA-seq data starting with count data.
The app is hosted on the website: https://kcvi.shinyapps.io/START/
Note, this is version 1.1.1, published on 12/9/19. If you need the previous version, it is hosted here: kcvi.shinyapps.io/STARTapp_2018/
Code can be found on github: https://github.com/jminnier/STARTapp
Please post issues on github, and feel free to contribute by forking and submitting development branches.
To run this app locally on your machine, download R or RStudio and run the following commands once to set up the environment:
install.packages(c("reshape2","ggplot2","ggthemes","gplots","ggvis","dplyr","tidyr","DT",
"RColorBrewer","pheatmap","shinyBS","plotly","markdown","NMF","scales","heatmaply"))
## try http:// if https:// URLs are not supported
source("https://bioconductor.org/biocLite.R")
biocLite(c("limma","edgeR"))
You may now run the shiny app with just one command in R:
shiny::runGitHub("STARTapp", "jminnier")
You may use this app by
GROUPNAME_REPLICATE#
, e.g. Treat_1, Treat_2, Treat_3, Control_1, Control_2, High_1, High_2
Count data contains read counts for each gene for each sample, along with gene identifiers.
Analysis: When raw counts are uploaded, the data is then analyzed by the app. The app uses the voom method from the ‘limma’ Bioconductor package to transform the raw counts into logged and normalized intensity values. These values are then analyzed via linear regression where gene intensity is regressed on the group factor. P-values from all pairwise regression tests for group effect are computed and Benjamini-Hochberg false discovery rate adjusted p-values are computed for each pairwise comparison. The “log2cpm” values are the log2-counts-per-million values. The “log2cpm_voom” values are the normalized logcpm values from the voom method. Both methods use an offset of 0.5, which means 0.5 is added to all count values before normalizing (in the case of voom) and log transforming so that 0 counts have non infinite values.
Example file: https://github.com/jminnier/STARTapp/blob/master/data/examplecounts_short.csv
Analyzed data must contain some kind of expression measure for each sample (i.e. counts, normalized intensities, CPMs), and a set of p-values with corresponding fold changes for those p-values. For instance, if you have a p-value for the comparison of group1 vs group2, you can upload the observed fold change or log2(fold change) between group1 vs group2. If you have a more complex design and do not have fold changes readily available, you may upload the test statistics or other similar measures of effect size as placeholders. The fold changes are mainly used in the volcano plots. We recommend uploading p-values that are adjusted for multiple comparisons (such as q-values from the qvalue package, or adjusted p-values from p.adjust() function in R).
Example file: https://github.com/jminnier/STARTapp/blob/master/data/exampleanalysisres_short.csv
After submitting a raw data or analyzed file, you may download the .csv file with the analysis results for your own use (or to upload as an “analyzed data”) or more conveniently click the button “Save Results as RData File for Future Upload” so that you may easily and quickly upload your data to the START app in the future under the “RData from previous START upload” option with one click.
After uploading your data to START, click red button
to download an .RData file to upload your data to START with one click.
Next time use the “Input Data” tab –> “START RData file” option.
This plot uses Principal Component Analysis (PCA) to calculate the principal components of the expression data using data from all genes. Euclidean distances between expression values are used. Samples are projected on the first two principal components (PCs) and the percent variance explained by those PCs are displayed along the x and y axes. Ideally your samples will cluster by group identifier.
This plot displays unsupervised clustering of the Euclidean distances between samples using data from all genes. Again your data should cluster by group.
These plots use the p-values and fold changes to visualize your data.
This is a scatter plot log fold changes vs –log10(p-values) so that genes with the largest fold changes and smallest p-values are shown on the extreme top left and top right of the plot. Hover over points to see which gene is represented by each point. (https://en.wikipedia.org/wiki/Volcano_plot_(statistics))
This is a scatter plot of average gene expression in one group against another group. This allows the viewer to observe which genes have the largest differences between two groups. The smallest distances will be along the diagonal line, and points far away from the diagonal show the most differences. Hover over points to see which gene is represented by each point.
Use the search bar to look up genes in your data set. For selected gene(s) the stripchart (dotplot) and boxplots of the expression values are presented for each group. You may plot one or multiple genes along side each other. Hover over points for more information about the data.
A heatmap of expression values are shown, with genes and samples arranged by unsupervised clustering. You may filter on test results as well as P-value cutoffs. By default the top 100 genes (with lowest P-values) are shown.
Development See github issues. App site: kcvi.shinyapps.io/START_devel/
Version 1.1.1 December 9, 2019. Bug fixes. Lots of changes “under the hood” in adding functions for analysis.
Version 1.1.0 August 15, 2018. Bug fixes. Addition of edgeR and limma analysis functions. Improved visualizations. App site: kcvi.shinyapps.io/STARTapp_2018/
Version 1.0.0. September 23, 2016. START is published and the first publically released version is online.
Version 0.9.0001 May 26, 2014. First development version (previously KCVI Heart Atlas) is online via shinyapps.io alpha!
September 23, 2016. START App manuscript is accepted for publication in Bioinformatics! Please read our manuscript and cite our app when you use it for your work:
The START App: A Web-Based RNAseq Analysis and Visualization Resource
Jonathan W. Nelson; Jiri Sklenar; Anthony P. Barnes; Jessica Minnier
Bioinformatics 2016;
doi: 10.1093/bioinformatics/btw624
A development version of this app can be found at https://kcvi.shinyapps.io/START_devel/.
The code for this development app is in the devel
branch of the github repository.
If you have concerns about the terms of use for this web hosted application, please run the app locally on your computer. See the “Instructions” tab for more information on this.
This application is hosted on a Shinyapps.io server https://www.shinyapps.io/.
By using this app you are agreeing to the terms of use as described by Shinnyaps.io: https://www.rstudio.com/about/shinyapps-terms-use/
We (the authors and maintainers of this app) will not save your data on our servers. However, as the Shinyapps server is not HIPAA compliant, you must refrain from uploading protected health information or confidential data with this app. You may instead download the code and run the app locally on your private computer and network (see above). We are not responsible for the confidentiality, availability, security, loss, misuse or misappropriation of any data you submit to this application.
From the Shinyapps terms of use (https://www.rstudio.com/about/shinyapps-terms-use/): “If you choose to upload data to an application you are using via the RStudio Service, you acknowledge and agree you are giving certain legal rights to the licensor of the application to process and otherwise use your data. Please carefully review any license terms accompanying any application to which you submit your data for the legal rights which you are giving the application licensor. Further, RStudio does not claim ownership of your data; however, you hereby grant RStudio a worldwide, perpetual, irrevocable, royalty-free, fully paid up, transferable and non-exclusive license, as applicable, to use and copy your data in connection with making your data available to the application (and the licensor of such application) to which you have submitted your data. You acknowledge and agree that even if you remove your data from the RStudio Service, your data may have been downloaded by, and remain accessible to, the licensors of those applications to which you submitted your data. Accordingly, do not submit data which you desire to remain confidential or which you wish to limit the right to access or use. You should never submit to the RStudio Service any data which consists of personally identifiable information, credit card information, or protected health information, as such terms are defined by relevant laws, rules, and regulations. RSTUDIO IS NOT RESPONSIBLE FOR THE CONFIDENTIALITY, AVAILABILITY, SECURITY, LOSS, MISUSE OR MISAPPROPRIATION OF ANY DATA YOU SUBMIT TO THE RSTUDIO SERVICE OR ANY APPLICATION MADE AVAILABLE VIA THE RSTUDIO SERVICE.”
ShinyApp created by {Jessica Minnier + Jiri Sklenar + Jonathan Nelson} of
Knight Cardiovascular Institute, Oregon Health & Science University
Copyright (C) 2016, code licensed under GPLv3
Code available on Github: https://github.com/jminnier/STARTapp