I'm assuming you are interested in learning about something by measuring one or more of its attributes, and then using statistics to extract information from the measurements, i.e., you are interested in a hands-on application, then books I found useful include:
Statistics for experimenters by Box, Hunter and Hunter
Design and Analysis of experiments by Montgomery.
Thanks! This is really helpful--I think this is exactly what I'm trying to do.
Are these texts part of a specific academic track/degree or field of study? It sounds like something someone in engineering would spend a semester on. But also like something someone could spend a career on studying.
Being able to accurately assess a paper's claims is, unfortunately, a very high bar. A large proportion of scientists fall short of it. see: [https://statmodeling.stat.columbia.edu/2022/03/05/statistics-is-hard-etc-again/]
Most people with a strong intuition for statistics have taken courses in probability. It is foundational material for the discipline.
If you haven't taken a probability course, and if you're serious about wanting to learn stats well, I would strongly recommend to start there. I think Harvard's intro probability course is good and has free materials: https://projects.iq.harvard.edu/stat110/youtube
I've taught out of Freedman, but not the other texts. It's well written, but it is targeted at a math-phobic audience. A fine choice if you do not wish to embark on the long path
Thanks! I'll look this over.
Out of curiosity,
Most people with a strong intuition for statistics have taken courses in probability. It is foundational material for the discipline.
Do some people learn statistics without learning probability? Or, what's different for someone who learns only stats and not probability?
(I'm trying to grasp what shape/boundaries are at play between these two bodies of knowledge)
A while back I read How to Measure Anything and found it fascinating. In my day job, I spend quite a bit of time trying to make sense of the world by looking at dashboards of requests, latencies, error rates, etc. (software systems).
After finishing the book and taking copious notes, I understood that it gave me a prepackaged process that I could apply as-is, but I found it very difficult to adapt to everyday situations. I don't think I picked up a good intuition about stats, in other words.
I'm looking to change that. Specifically, I want to learn to apply stats in these two situations:
The challenge I'm facing is that I know a bunch of techniques, but not how they relate to each other and the problems they're meant to solve. To illustrate what I mean: I know how to get percentiles and calculate means, but until today morning I didn't know why averaging percentiles is usually a bad idea. I'm missing the map.
I've seen these books recommended as a good way to start:
But I also wanted to ask someone familiar with the field: