Make sure your graphs visually represent your results

I was just reading a presentation on SSD performance for PostgreSQL and came across a graph that made my head spin

If you eyeball it, it appears that the hard disk drive (HDD) is roughly 40% as fast as the solid state disk (SSD) for a specific test.  The presentation author did include the yellow star to the right highlighting the fact that the difference is actually 0.5%, but why not make the graph start at 0 so the graph visusally represented the results or  if the difference is that small, don’t have a graph at all and just say there is essentially no difference for this type of benchmark.

I think the way the graph was made was very likely the result of the default settings that the charting tool used, and I don’t think the author had any intention of misleading people (especially since he highlighted the real difference outside the graph),  but if I generated a graph that I felt didn’t reflect what I was trying to convey I would fix it or leave it out.

Besides the egregious graph, I think the presentation overall was very good.  I especially liked the recommendation section at the end, which was very actionable.

One Response to “Make sure your graphs visually represent your results”

  1. Bill Kratzer Says:

    That’s funny. When I first saw the graph, I saw the .50 as “50%” (and not 0.5%)!

    I agree… the graph should represent the data!

Leave a Reply


Copyright © 2010 williamhathaway.com. All Rights Reserved.
No computers were harmed in the 0.361 seconds it took to produce this page.

Designed/Developed by Lloyd Armbrust & hot, fresh, coffee.