You might have been led to believe the only way to make sense of the equation is with "zeta function regularization," involving analytic continuation or similar high-tech tools, and that there is no way to develop intuition for this.
You were probably was told this by mathematicians, to whom "intuition" means something very different than it does to me.
So here follows what I consider an intuitive explanation for this result. Prerequisites include an intuition for the arithmetic of positive integers (most readers will have this), and a willingness to stfu about rigor.
The "Counting" Function
For historical reasons, involving "counting" the number of individual objects (sheep, stones, sticks, whatever) in a collection, there exists a function that acts on ordered tuples of positive integers and outputs a single positive integer. For example,
Now, this function has several nice[2] properties (which have names like "commutativity" and "associativity"), which means that things like the following is also true:
.
I shall be dropping the superscript over the arrow for the rest of this.
Fractions, Negative numbers, Zero
Now, playing with this function, we might ask questions like,
Is there a number such that ?
Or Is there a number such that ?
Now, obviously, the answer is "No, of course not. The output of the 'counting' function is greater than every number in the input, and there is no number smaller than 1."
However, we're going to make them up in such a way that all the nice properties of the function hold; this is a silly exercise with no real-world meaning, but mathematicians like doing things like this, and they turn out to have surprising real-world applications, in fields like physics (something, something "unreasonable effectiveness"). These new "numbers" are called negative numbers and fractions.
and
Now, playing with these new "numbers," we find that to preserve the nice properties of we need to introduce something called a "zero" or "0" such that .
With me so far?
Infinitely Large Tuples and Pattern Consistency
What happens if we apply our function to a tuple with infinitely many elements? The obvious answer is "That doesn't make any sense! You can't count infinitely many elements; you'll never get to the end. Consider Achilles and the tortoise …." Yes, of course, but like before, just go with it, okay?
The nice property we're going to make up for is that it follows patterns: in cases where the output isn't clear (such as with an infinite number of terms when we've only ever seen finite tuples), but there's a neat pattern that would be formed if it had some specific value, that is the value of the function. This is a little vague, but an example might make it clear:
The pattern we're going to match is , , , , and so on.
The output, following the pattern , , , , … is .
The pattern is actually more general than that, and for any , ,
Here's the key idea:
The outputs we've assigned using this pattern-matching technique can themselves be used as part of new patterns!
For the tuple
the analogous sequence , , , , etc. doesn't suggest any particular value. However, we can match it to the metapattern , where
Important caveat: If there are multiple patterns that suggest different values for the output of a particular unclear tuple, this pattern matching approach won't work. That's fine, let's see how far it takes us! Remember, if some patterns don't suggest any value, that's not a problem as long as all the ones that do suggest the same value.
For some tuples, like the order of elements matters! Keep that in mind when matching patterns.
Finish
There isn't much more intuition involved to get from to . The hard part is finding the right patterns: many of the obvious things you might try don't suggest any value. After a lot of (other people's) work, here's an example of something that works.
…
and if you go far enough (use a computer!), the pattern suggests approximately
Now, we can use the results of these patterns to match a metapattern:
…
Notice that this pattern of infinite tuples does match what we want, . The value suggested by this pattern is . □
It turns out no pattern that suggests a value suggests any other, though this is hard to prove.
Concluding Remarks
In precisely the same sense that we can write
despite that no real-world process of "addition" involving infinitely many terms may be performed in a finite number of steps, we can write
It is only familiarity that keeps us from seeing this.
Further, "limit of partial sums" is only one of a whole class of "patterns" that may be used to assign values to infinite series. Its strength is that it's easy to describe in general, but it is not the only one, and where it fails to suggest a particular value in the limit, there are others that are no less reasonable.
My point is that there is a conflict for divergent series though, which is why 1 + 2 + 3 + … = -1/12 is confusing in the first place. People (wrongly) expect the extension of + and = to infinite series to imply stuff about approximations of partial sums and limits even when the series diverges.
My own suggestion for clearing up this confusion is that we should actually use less overloaded / extended notation even for convergent sums, e.g. (1,12,14,...)∼2 seems just as readable as the usual lim→ and +...+ notation.