One must always invert.
I'm grateful to orthonormal for mentioning the following math problem, because it allowed me to have a significant confusion-dissolving insight (actually going on two, but I'll only discuss one in this post), as well as providing an example of how bad I am at math:
"[I]f you want to average 40 mph on a trip, and you averaged 20 mph for the first half of the route, how fast do you have to go on the second half of the route?"
When I read this, my first thought was "Huh? If you spend an hour going 20 mph and then spend another hour going 60 mph, you've just gone 80 miles in 2 hours -- for an average speed of 40 mph, just as desired. So what do you people mean it's impossible?"
As you can see, my confusion resulted from interpreting "half of the route" to refer to the total time of the journey, rather than the total distance.
This misinterpretation reveals something fundamental about how I (I know better by now than to say "we") think about speed.
In my mind, speed is a mapping from times to distances. The way to compare different speeds is by holding time constant and looking at the different distances traversed in that fixed time. (I know I'm not a total mutant in this regard, because even other people tend to visually represent speeds as little arrows of varying length, with greater lengths corresponding to higher speeds.)
In particular, I don't think of it as a mapping from distances to times. I don't find it natural to compare speeds by imagining a fixed distance corresponding to different travel times. Which explains why I find this problem so difficult, and other people's explanations so unilluminating: they tend to begin with something along the lines of "let d be the total distance traveled", upon which my brain experiences an error message that is perhaps best verbalized as something like "wait, what? Who said anything about a fixed distance? If speeds are varying, distances have to be varying, too!"
If speed is a mapping from times to distances, then the way that you add speeds together and multiply them by numbers (the operations involved in averaging) is by performing the same operations on corresponding distances. (This is an instance of the general definition in mathematics of addition of functions: (f+g)(x) = f(x)+g(x), and similarly for multiplication by numbers: (af)(x) = a*f(x).) In concrete terms, what this means is that in order to add 30 mph and 20 mph together, all you have to do is add 30 and 20 and then stick "mph" on the result. Likewise with averages: provided the times involved are the same, if your speeds are 20 mph and 60 mph, your average speed is 40 mph.
You cannot do these operations nearly so easily, however, if distance is being held fixed and time varying. Why not? Because if our mapping is from times to distances, then finding the time that corresponds to a given distance requires us to invert that mapping, and there's no easy way to invert the sum of two mappings (we can't for example just add the inverses of the mappings themselves). As a result, I find it difficult to understand the notion of "speed" while thinking of time as a dependent variable.
And that, at least for me, is why this problem is confusing: the statement doesn't contain a prominent warning saying "Attention! Whereas you normally think of speed as the being the (longness-of-)distance traveled in a given time, here you need to think of it as the (shortness-of-)time required to travel a given distance. In other words, the question is actually about inverse speed, even though it talks about 'speed'."
Only when I have "inverse speed" in my vocabulary, can I then solve the problem -- which, properly formulated, would read: "If you want your inverse speed for the whole trip to be 1/40 hpm, and your inverse speed for the first half is 1/20 hpm, how 'slow' (i.e. inversely-fast) do you have to go on the second half?"
Solution: Now it makes sense to begin with "let d be the total distance"! For inverse speed, unlike speed, accepts distances as inputs (and produces times as outputs). So, instead of distance = speed*time -- or, as I would rather have it, distance = speed(time) -- we have the formula time = speed-1(distance). Just as the original formula converts questions about speed to questions about distance, this new formula conveniently converts our question about inverse speeds to a question about times: we'll find the time required for the whole journey, the time required for the first half, subtract to find the time required for the second half, then finally convert this back to an inverse speed.
So if d is the total distance, the total time required for the journey is (1/40)*d = d/40. The time required for the first half of the journey is (1/20)*(d/2) = d/40. So the time required for the second half is d/40 - d/40 = 0. Hence the inverse speed must be 0.
So we're being asked to travel a nonzero distance in zero time -- which happens to be an impossibility.
Problem solved.
Now, here's the interesting thing: I'll bet there are people reading this who (despite my best efforts) found the above explanation difficult to follow -- and yet had no trouble solving the problem themselves. And I'll bet there are probably also people who consider my explanation to be an example of belaboring the obvious.
I have a term for people in these categories: I call them "good at math". What unites them is the ability to produce correct solutions to problems like this without having to expend significant effort figuring out the sort of stuff I explained above.
If for any reason anyone is ever tempted to describe me as "good at math", I will invite them to reflect on the fact that an explicit understanding of the concept of "inverse speed" as described above (i.e. as a function that sends distances to times) was a necessary prerequisite for my being able to solve this problem, and then to consider that problems of this sort are customarily taught in middle- or high school, by middle- and high school teachers.
No indeed, I was not sorted into the tribe of "good at math".
I should find some sort of prize to award to anyone who can explain how to solve "mixing" problems in a manner I find comprehensible. (You know the type: how much of x% concentration do you add to your y% concentration to get z% concentration? et similia.)
I am fully on board with this. Higher-level concepts ftw. Of course, just setting up the algebra can help you recognize two problems as similar too. By “setting up the algebra” I mean “accepting the statements made in the word problem and translating them into math, introducing symbols to represent unknown quantities as necessary”.
Well yes, I expect you did recognize it, in context; I did everything I could to make the similarity explicit. And naturally, having recognized it as the same problem, you can then solve it by the same method you used for the first problem. But does having “inverse speed” in your arsenal really make it natural to approach this problem in the same way? I guess I’ll have to take your word for it.
Yeah, I get that you can do it that way. I agree that it’s kinda neat, even. It’s sort of like how you calculate the total resistance of resistors wired in parallel. The only thing I take issue with is that you cannot, or should not, do this problem without it.
Ok, using formulas by rote totally is a failure mode, so I would be against that. (Btw on the subject of “intuitive explanations”, so we can avoid arguing about terms, I understand that to mean not “an explanation that appeals to the intuition you already have,” but “an explanation that fixes your intuition.” Explanations like that are fun and cool, but I don’t insist on having one before I do every new problem.)
I guess I don't get why you can't just use "average rate of change = (total change in one variable)/(total change in other variable)" as your organizing concept, introduce symbols to represent the various quantities, and grind it out. That approach is certainly useful for much more than just finding average speeds or average mileages in this particular special case. You don’t need a new formula; you don’t even need a new concept. You can just use the ones you’ve already got.
The idea in my arsenal is not just inverse speed, of course; it's inverses in general, and the fact that finding a rate may first require finding the inverse of that rate.
I recognized the two problems as similar not merely because the context implied that they were constructed to be similar (in point of fact the mileage one contained a large amount of distracting detail), but because they involve the same difficulty -- I "got stuck"... (read more)