Overview of findings

These are four of the research findings that emerged from our empirical investigation. Each is supported with an interactive visualization on the pages linked below. Please see the background discussion for details on experimental methodology and key concepts underlying the empirical findings.

Competitive Algorithms

Is there one algorithm that outperforms the rest, across diverse input settings?

Finding: No single algorithm offers uniformly low error. At small scales, data-dependent algorithms dominate; at large scales data-independent algorithms dominate.

Variation with Shape

Are different data-dependent algorithms exploiting the same properties of datasets?

Finding: Algorithm error varies significantly with dataset shape and algorithms differ on the dataset shapes on which they perform well.

Trade-offs of data dependence

When do data-dependent algorithms offer improved error rates?

Finding: Data-dependence can offer significant improvements in error, at smaller scales or lower epsilon values, but some data-dependent algorithms fail to offer benefits at larger scales or higher epsilons.

Comparison with Baselines

Do sophisticated algorithms always outperform simple baseline methods?

Finding: Many algorithms are beaten by the IDENTITY baseline at large scales, in both 1D and 2D. At low scales, many algorithms result in error rates that are comparable to, or worse than, the Uniform baseline.