TY - CHAP
T1 - Rapid Sampling for Visualizations with Ordering Guarantees
AU - Kim, Albert
AU - Blais, Eric
AU - Parameswaran, Aditya
AU - Indyk, Piotr
AU - Madden, Sam
AU - Rubinfeld, Ronitt
PY - 2015
Y1 - 2015
N2 - Visualizations are frequently used as a means to understand trends and gather insights from datasets, but often take a long time to generate. In this paper, we focus on the problem of rapidly generating approximate visualizations while preserving crucial visual properties of interest to analysts. Our primary focus will be on sampling algorithms that preserve the visual property of ordering; our techniques will also apply to some other visual properties. For instance, our algorithms can be used to generate an approximate visualization of a bar chart very rapidly, where the comparisons between any two bars are correct. We formally show that our sampling algorithms are generally applicable and provably optimal in theory, in that they do not take more samples than necessary to generate the visualizations with ordering guarantees. They also work well in practice, correctly ordering output groups while taking orders of magnitude fewer samples and much less time than conventional sampling schemes.
AB - Visualizations are frequently used as a means to understand trends and gather insights from datasets, but often take a long time to generate. In this paper, we focus on the problem of rapidly generating approximate visualizations while preserving crucial visual properties of interest to analysts. Our primary focus will be on sampling algorithms that preserve the visual property of ordering; our techniques will also apply to some other visual properties. For instance, our algorithms can be used to generate an approximate visualization of a bar chart very rapidly, where the comparisons between any two bars are correct. We formally show that our sampling algorithms are generally applicable and provably optimal in theory, in that they do not take more samples than necessary to generate the visualizations with ordering guarantees. They also work well in practice, correctly ordering output groups while taking orders of magnitude fewer samples and much less time than conventional sampling schemes.
UR - http://www.scopus.com/inward/record.url?scp=84955560112&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84955560112&partnerID=8YFLogxK
U2 - 10.14778/2735479.2735485
DO - 10.14778/2735479.2735485
M3 - Chapter
C2 - 26779380
AN - SCOPUS:84955560112
T3 - Proceedings of the VLDB Endowment
SP - 521
EP - 532
BT - Proceedings of the VLDB Endowment
PB - Association for Computing Machinery
T2 - 3rd Workshop on Spatio-Temporal Database Management, STDBM 2006, Co-located with the 32nd International Conference on Very Large Data Bases, VLDB 2006
Y2 - 11 September 2006 through 11 September 2006
ER -