You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

ChristianKl comments on Open Thread, September 30 - October 6, 2013 - Less Wrong Discussion

4 Post author: Coscott 30 September 2013 05:18AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (295)

You are viewing a single comment's thread.

Comment author: ChristianKl 01 October 2013 04:50:32PM 1 point [-]

Does anyone have a good resource on learning how to formate graphs and diagrams?

What are the effects on the reader between having 90%, 100% or 110% spacing between letters? When should one centralize text. What about bold and italics?

Is there good research based resource that explains the effects that those choices have on the reader?

Comment author: RolfAndreassen 02 October 2013 05:06:31AM 5 points [-]

Don't have a formal source, but I can give you a quick rundown of the advice my group ends up giving to every student we work with:

  • Label the dang axes.
  • Make the axis labels bigger.
  • Make histogram lines thicker; make dots larger.
  • If the dots are very dense, don't use dots, use a color scale.
  • For the sake of the absent gods, don't make your colour scale brown-yellow-lightgray-black-darkbrown-darkgray-darkyellow, as one often-used plotting package did by default. (It was an inheritance from the early nineties, and honestly it was still weird.) Make it something that humans naturally read as a scale, eg blue to red by way of violet, dark green to light green, or blue to red by way of the rainbow.
  • On a white background, do not use yellow or bright green unless the individual dots or areas are large. Lines, generally speaking, are not large.
  • Put a legend in one corner, explaining what the line styles mean.
  • If you're using (eg) triangles for one data type and circles for another, make the points bigger. Yes, it likely looks perfectly clear on your screen, to your young eyes, at a distance of a foot. You will eventually present it on a crappy twenty-year-old projector to men of sixty and seventy sitting at the back of a large auditorium. EMBIGGEN THE DANG POINTS. Also, use colours to further clarify the difference, unless colour is indicating a different dimension of information.
  • Make bin sizes a round number - 1, 2, or 5 - in a unit of interest.
  • If plotting numbers of something, indicate the bin size by labeling the y axis (for example) "Events / 2 MeV".
  • As a general rule, make both a linear and a semilog plot. You can skip the linear if there are no features of interest at high densities, and the semilog if there are no features of interest at low densities.
Comment author: [deleted] 03 October 2013 04:26:47AM 2 points [-]

blue to red by way of the rainbow.

Here's a few reason not to do that. (Not to mention the possibility of colour-blind viewers.)

Comment author: NancyLebovitz 05 October 2013 07:51:34PM *  0 points [-]

Thanks for the link. I recommend reading it to anyone who's interested in how data gets (mis)represented.

Comment author: RolfAndreassen 03 October 2013 04:33:30AM 0 points [-]

Well, you have to admit it's still a big improvement over the old ROOT default. :)

Comment author: [deleted] 03 October 2013 04:45:59AM 0 points [-]

Well, the old default does make local variations more visible, especially for the colour-blind. OTOH I agree that telling at a glance which of two widely separated spots on the graph has a higher value is all but outright impossible with it.

Comment author: ChristianKl 02 October 2013 12:31:53PM 1 point [-]

Make the axis labels bigger. Make histogram lines thicker; make dots larger.

How do I know that they are big enough?

Comment author: RolfAndreassen 02 October 2013 03:47:20PM 2 points [-]

When the seventy-year-old at the back of the large auditorium with the cheap, ancient projector can read them. Alternatively, when your boss stops complaining. Lines are too thick if they overlap; dots are too big when you can't easily tell the difference between high and medium density. (And if this happens at the default dot size, switch to a colour scale.)

If you're doing PowerPoint or similar presentation tools, you want your axis labels to be the same size as your bullet-point text. One trick I sometimes use is to whiteout the axis labels in the image file of my plot, and put them back in using the same text tool that's creating my bullets.

Comment author: Douglas_Knight 02 October 2013 08:50:58PM 0 points [-]

How many of those suggestions could be replaced by "use ggplot2"?

Comment author: RolfAndreassen 03 October 2013 05:24:05AM 0 points [-]

Within our group, none, because then we'd have to learn R. For ChristianKI, quite possibly all of them.

Comment author: Lumifer 01 October 2013 05:25:04PM *  4 points [-]

Look up Edward Tufte, and in particular his seminal book The Visual Display of Quantitative Information.

Comment author: [deleted] 03 October 2013 12:51:01AM 2 points [-]

Those sorts of questions are asked in a field called Information Visualization, which is a part of Human Factors Engineering.

Comment author: ChristianKl 03 October 2013 08:19:07AM 0 points [-]

What's a good resource to learn about it? Is there a textbook you can recommend?

Comment author: [deleted] 03 October 2013 04:30:02AM 0 points [-]

Look the graphs at 50% their actual size (or less) and notice how much effort it takes you to read them. I'd guess that decently correlates with how much effort it takes someone with worse visual acuity to read them at full size.