a) The best action you should be doing now is to sleep or at least to take a nap, so your brain would process the information it has now and prepare itself for new information
I don't think this is how Kurzban's model would explain it. In Kurzban's model, the "feeling of exhaustion" stems from one or more monitoring mechanisms causing aversive states (in response to trying to decide what to do). The monitoring mechanisms aren't causing the feeling of exhaustion so that you'll feel sleepy - they are just voting by causing aversive states, and Kaj_Sotala happens to interpret that feeling of the aversive states with the word "exhaustion" (which has certain unhelpful connotations and I think should be taboo'd in favor of "aversive state feelings").
c) Your brain is wrong. For reasons that might make sense living in a jungle millenia ago, your brain is convinced that you should do X, which in reality is not an option, or is a bad option. All meaningful options are penalized because your brain insists that you should do X instead. For example, if your job made you too exhausted, your brain may think the best action is to prevent going to the same job tomorrow. Which may be not realistic, or merely not what you want to think about.
I also don't think this applies the model correctly. The feeling of exhaustion would need to stem from aversive states from monitoring mechanisms that are working at the moment you get back from work and are deciding what to do next. Returning to work the next day is a very distant action, relative to all the activities your brain would be thinking about doing at that moment.
b) You are censoring the best action from your thoughts because it does not fit your idea of what you should be doing now. For example you feel that you should work on your personal projects, but your brain thinks you should drink some beer with your friends. When you say you are too exhausted to do anything, you are actually lying, because you are not too exhausted to drink beer with your friends; it is merely outside of your set of acceptable answers.
I think this is more how Kurzban's model would explain this situation. The bad feeling (which Kaj calls "exhaustion") would need to come from multiple monitoring mechanisms returning aversive states for evaluating various possible next actions (in this case, maybe working on projects, watching tv, browsing the internet, practicing, or studying).
It could be that there is no strong winner (so Kaj ends up feeling just the aversive states but not a clear idea of what to do next and has to spend more time feeling that exhaustion before a clear winner emerges). In this model, I think it would be possible for no strong winner to emerge even when there are immediately gratifying activities due to some monitoring mechanisms paying attention to some sort of executive function-centered value system (like "I should feel bad for doing "). If that were the case, I would think Kaj might feel a sort of "oscillation" between deciding to do something instantly gratifying (which is generally a quick judgement for a brain to make) but then deciding not to after the executive function/value system evaluation results are returned (which I think generally takes longer to get a result from).
It could also be that Kaj's brain did actually reach a conclusion on what to do next, but either wasn't introspective enough to figure out what that was and put it into words or they meant the phrase "not doing anything at all" to be a catch-all for some unproductive but instantly gratifying action.
W.r.t. this seeming to occur after "a long day at work", I'm not really clear what is meant by "a long day".
Great comment!
An opportunity cost model of subjective effort and task performance (h/t lukeprog) is a very interesting paper on why we accumulate mental fatigue: Kurzban et al. suggest an opportunity cost model, where intense focus on a single task means that we become less capable of using our mental resources for anything else, and accumulating mental fatigue is part of a cost-benefit calculation that encourages us to shift our attention instead of monomaniacally concentrating on just one task which may not be the most rewarding possible. Correspondingly, the amount of boredom or mental fatigue we experience with a task should correspond with the perceived rewards from other tasks available at the moment. A task will feel more boring/effortful if there's something more rewarding that you could be doing instead (i.e. if the opportunity costs for pursuing your current task are higher), and if it requires exclusive use of cognitive resources that could also be used for something else.
This seems to make an amount of intuitive/introspective sense - I had a much easier time doing stuff without getting bored as a kid, when there simply wasn't much else that I could be doing instead. And it does roughly feel like I would get more quickly bored with things in situations where more engaging pursuits were available. I'm also reminded of the thing I noticed as a kid where, if I borrowed a single book from the library, I would likely get quickly engrossed in it, whereas if I had several alternatives it would be more likely that I'd end up looking at each for a bit but never really get around reading any of them.
An opportunity cost model also makes more sense than resource models of willpower which, as Kurzban quite persuasively argued in his earlier book, don't really fit together with the fact that the brain is an information-processing system. My computer doesn't need to use any more electricity in situations where it "decides" to do something as opposed to not doing something, but resource models of willpower have tried to postulate that we would need more of e.g. glucose in order to maintain willpower. (Rather, it makes more sense to presume that a low level of blood sugar would shift the cost-benefit calculations in a way that led to e.g. conservation of resources.)
This isn't just Kurzban et al's opinion - the paper was published in Behavioral and Brain Sciences, which invites diverse comments to all the papers that they publish. In this particular case, it was surprising how muted the defenses of the resource model were. As Kurzban et al point out in their response to responses:
(I'm leaving out discussion of some commentaries which do attempt to defend resource models.)
Though the model still seems to be missing pieces - as one of the commentaries points out, it doesn't really address the fact that some tasks are more inherently boring than others. Some of it might be explained by the argument given in Shouts, Whispers, and the Myth of Willpower: A Recursive Guide to Efficacy (I quote the most relevant bit here), where the author suggests that "self-discipline" in some domain is really about sensitivity for feedback in that domain: a novice in some task doesn't really manage to notice the small nuances that have become so significant for an expert, so they receive little feedback for their actions and it ends up being a boring vigilance task. Whereas an expert will instantly notice the effects that their actions have on the system and get feedback of their progress, which in the opportunity cost model could be interpreted as raising the worthwhileness of the task they're working on. If we go with Kurzban et al.'s notion of us acquiring further information about the expected utility of the task we're working on as we continue working on it, then getting feedback from the task could possibly be read as a sign of the task being one in which we can expect to succeed in.
Another missing piece with the model is that it doesn't really seem to explain the way that one can come home after a long day at work and then feel too exhausted to do anything at all - it can't really be about opportunity costs if you end up so tired that you can't come up with ~any activity that you'd want to do.