I did the standard historical thing and looked at median adult males. Average is usually worse than median.
This is fair, because I'm comparing only Western Europe for the last few centuries, and divisions now are mostly geographical - situation of different people in the same country tends to be similar; but situation of different people in different countries varies drastically. It used to be the other way - and correctly only one way would be rather unfair.
As for women, I'm definitely not going to compare that, as situation of large-family stay-at-home housewife (a model which goes back at least to Ancient Greece) is simply completely different way of life than what 20th century women do. This kind of separation of gender roles is incompatible with modern economy, and modern separation of gender roles is incompatible with most historical economies. I don't think that assigning low value to such life is based on much more than prejudice, but for this discussion look at clusterfuck which spread out of Bryan Caplan's article about 19th century women's freedoms to half of the blogosphere by now.
GDP measures essentially how good we are at making widgets - and while widgets are useful, it is a very weak and indirect measure of welfare. For example UK GDP per capita doubled between 1975 and 2007 - and people's quality of life indeed improved - but it would be extremely difficult to argue that this improvement was "doubling", and that the gap between 2007's and 1975's quality of life is greater than between 1975's and hunter-gatherer times.
It's not essential to this post, but my very quick theory is that we overestimate GDP thanks to economic equivalent of Amdahl's Law - if someone's optimal consumption mix consisted of 9 units of widgets and 1 unit of personalized services - and their purchasing power increased so now they can acquire 100x as many widgets, but still the same number of services as before - amount of the mix they can purchase increased only 9x, not 90x you'd get by weighted average of original consumption levels (and they spend 92% of their purchasing power on services now). The least scalable factor - whichever it is - will be the bottleneck.
If we're unhappy with GDP there are alternative measures like HDI, but they're highly artificial. It would be very easy to construct completely different measures which would "feel" about as right.
Fortunately there exists a very natural measure of welfare, which I haven't seen used before in this context - preference utilitarian lotteries. Would you rather live in 1700, or take a 50% chance of living in 2010 or 700? Make a list of such bets, assign numbers coherent with bet values (with 100 for highest and 0 for your lowest value) and you're done! By averaging many people's estimates we can hopefully reduce the noise, and get some pretty reasonable welfare estimates.
And now disclaimer time. This approach has countless problems, here are just a few but I'm sure you can think about more.
I tried to think about such series of bets and my results are:
This seems far more reasonable than GDP's illusion of exponentially accelerating progress.
I used this Ruby code to convert bets to values on scale of 0 to 100 (bets ordered by preference, not chronologically):