One must always invert.
I'm grateful to orthonormal for mentioning the following math problem, because it allowed me to have a significant confusion-dissolving insight (actually going on two, but I'll only discuss one in this post), as well as providing an example of how bad I am at math:
"[I]f you want to average 40 mph on a trip, and you averaged 20 mph for the first half of the route, how fast do you have to go on the second half of the route?"
When I read this, my first thought was "Huh? If you spend an hour going 20 mph and then spend another hour going 60 mph, you've just gone 80 miles in 2 hours -- for an average speed of 40 mph, just as desired. So what do you people mean it's impossible?"
As you can see, my confusion resulted from interpreting "half of the route" to refer to the total time of the journey, rather than the total distance.
This misinterpretation reveals something fundamental about how I (I know better by now than to say "we") think about speed.
In my mind, speed is a mapping from times to distances. The way to compare different speeds is by holding time constant and looking at the different distances traversed in that fixed time. (I know I'm not a total mutant in this regard, because even other people tend to visually represent speeds as little arrows of varying length, with greater lengths corresponding to higher speeds.)
In particular, I don't think of it as a mapping from distances to times. I don't find it natural to compare speeds by imagining a fixed distance corresponding to different travel times. Which explains why I find this problem so difficult, and other people's explanations so unilluminating: they tend to begin with something along the lines of "let d be the total distance traveled", upon which my brain experiences an error message that is perhaps best verbalized as something like "wait, what? Who said anything about a fixed distance? If speeds are varying, distances have to be varying, too!"
If speed is a mapping from times to distances, then the way that you add speeds together and multiply them by numbers (the operations involved in averaging) is by performing the same operations on corresponding distances. (This is an instance of the general definition in mathematics of addition of functions: (f+g)(x) = f(x)+g(x), and similarly for multiplication by numbers: (af)(x) = a*f(x).) In concrete terms, what this means is that in order to add 30 mph and 20 mph together, all you have to do is add 30 and 20 and then stick "mph" on the result. Likewise with averages: provided the times involved are the same, if your speeds are 20 mph and 60 mph, your average speed is 40 mph.
You cannot do these operations nearly so easily, however, if distance is being held fixed and time varying. Why not? Because if our mapping is from times to distances, then finding the time that corresponds to a given distance requires us to invert that mapping, and there's no easy way to invert the sum of two mappings (we can't for example just add the inverses of the mappings themselves). As a result, I find it difficult to understand the notion of "speed" while thinking of time as a dependent variable.
And that, at least for me, is why this problem is confusing: the statement doesn't contain a prominent warning saying "Attention! Whereas you normally think of speed as the being the (longness-of-)distance traveled in a given time, here you need to think of it as the (shortness-of-)time required to travel a given distance. In other words, the question is actually about inverse speed, even though it talks about 'speed'."
Only when I have "inverse speed" in my vocabulary, can I then solve the problem -- which, properly formulated, would read: "If you want your inverse speed for the whole trip to be 1/40 hpm, and your inverse speed for the first half is 1/20 hpm, how 'slow' (i.e. inversely-fast) do you have to go on the second half?"
Solution: Now it makes sense to begin with "let d be the total distance"! For inverse speed, unlike speed, accepts distances as inputs (and produces times as outputs). So, instead of distance = speed*time -- or, as I would rather have it, distance = speed(time) -- we have the formula time = speed-1(distance). Just as the original formula converts questions about speed to questions about distance, this new formula conveniently converts our question about inverse speeds to a question about times: we'll find the time required for the whole journey, the time required for the first half, subtract to find the time required for the second half, then finally convert this back to an inverse speed.
So if d is the total distance, the total time required for the journey is (1/40)*d = d/40. The time required for the first half of the journey is (1/20)*(d/2) = d/40. So the time required for the second half is d/40 - d/40 = 0. Hence the inverse speed must be 0.
So we're being asked to travel a nonzero distance in zero time -- which happens to be an impossibility.
Problem solved.
Now, here's the interesting thing: I'll bet there are people reading this who (despite my best efforts) found the above explanation difficult to follow -- and yet had no trouble solving the problem themselves. And I'll bet there are probably also people who consider my explanation to be an example of belaboring the obvious.
I have a term for people in these categories: I call them "good at math". What unites them is the ability to produce correct solutions to problems like this without having to expend significant effort figuring out the sort of stuff I explained above.
If for any reason anyone is ever tempted to describe me as "good at math", I will invite them to reflect on the fact that an explicit understanding of the concept of "inverse speed" as described above (i.e. as a function that sends distances to times) was a necessary prerequisite for my being able to solve this problem, and then to consider that problems of this sort are customarily taught in middle- or high school, by middle- and high school teachers.
No indeed, I was not sorted into the tribe of "good at math".
I should find some sort of prize to award to anyone who can explain how to solve "mixing" problems in a manner I find comprehensible. (You know the type: how much of x% concentration do you add to your y% concentration to get z% concentration? et similia.)
Interesting post. Just had me pondering it all day and I thought I'd propose another way to go about this: visually. Open THIS in another tab. What you have is a plot of distance vs. time. The black line is shown with a constant slope of 20mph.
I have plotted two points on that line, (t1,d1) and (t2,d2). Imagine these as snapshots of your journey. You travel along the black line and at some moment check your clock and odometer. These "checkpoints" would be like t1/t2 and d1/d2. You compare your travel time to your distance traveled and determine that d/t = 20mph.
At each of these snapshots, you decide to maximize your speed over the same distance traveled so far. If you were traveling along on the black line, this means that you take a sharp left and travel infinitely fast along the dotted line until your distance is twice what you'd traveled so far. Since you traveled infinitely fast, you spent no time and your new coordinates are either (t1, 2d1) or (t2, 2d2).
Note that both of these points lie on a new line, shown in green. This line's slope indicates the final rate achieved. We can now see that:
r = (2d1)/t1 = (2d2)/t2
But d1/t1 and d2/t2 are equal to d/t = 20mph because they lie on the original constant sloped line. So we know that the green line's slope is 2 d/t = 2 20mph = 40mph.
Just thought it might be interesting to see this portrayed another way. I've shown what happens if you travel infinitely fast and use a vertical line to head on up to the final distance. If you traveled less than infinitely fast, your line would lie between the original rate (20mph) and our shown theoretical maximum. This area is the solution set and is highlighted in light green.
Very nice! This illustrates the idea that at any time during the journey, the average speed up to that point constrains the possible average speeds for the whole journey.
I thought I'd point out that merely the first point (t1, d1) suffices to construct the second (green) line, since the lines both start at the same point (the origin).