Data Visualization
Data Visualization
martin krzywinski
outline
best practices of graphical data design data-to-ink ratio cartjunk circos
anscombes quartet each of the values below is the same for each set number of points average x average y regression line standard error of slope sum of squares q residual sum of squares correlation coefficient r2
TGV
the visual display of quantitative information edward r tufte, 2001, 2nd ed
strive to give your viewer the greatest number of ideas in the shortest time with the least ink ith in the smallest space
the visual display of quantitative information edward r tufte, 2001, 2nd ed
cartjunk
excessive use of grids and patterns cause perceived vibrations avoid hatched patterns to limit moire avoid excessive use of decorative forms
the visual display of quantitative information edward r tufte, 2001, 2nd ed
circos
there are many genome browsers and yg visualizers already available do we really need another one? communicating data visually critical for large data sets yp there certain types of data that obfuscate common diagram formats
standard 2D plots (2 perpendicular axes) are inadequate
scalar mappings
scalar valued mappings are common and easily handled
input genomic position i a scalar i i t i iti is l input t when the output is real-valued (GC content, conservation, etc) use a histogram, line plot, scatter plot genome position on x-axis function value on y-axis
f :g y
f : g g
genome position genome position
sc7 I
sc15 I
I chr04 chr09
I ch
http://www.egg.isu.edu/Members/deborah/genomics
http://www.genome.wustl.edu/projects/human/chr7paper/chr7data/030113/segmental/index.php
f : g g c
reduce sampling
rearrange axes
partition data
circos
written in Perl Apache-style configuration file plain text data input p PNG output
G2G in circos
display characteristics of most elements are f t l t customizable data driven data-driven formatting rules support for data layers
2D data in circos
2D data in circos
box
scatter
line
2D data in circos
tiles tiles
histogram heatmaps
chr2
human chr1
single alignment
circos is flexible
mkweb.bcgsc.ca/circos
download documentation tutorials circos art