There were studies on than with kittens or pupps and it seems that in fact this works like this:
"When I do this trick, I get a reward! Let's do the trick! I didn't get the reward? Maybe I should try again! I got the reward, the world works like it should, yay! Let's get another one! No reward? Maybe they didn't notice? I have to try harder! Still no reward? Let's try again, I'm sure I'll get it this time".
The puppies noticed the reward, not the punishment. If it was as regular as one out of four, they would notice this regularity and act according to the expectation of the result - not try when a punishment was due and try when a reward was due.
http://measureofdoubt.com/2011/04/12/pulling-levers-killing-monsters-the-lure-of-unpredictable-rewards/ (how do I put a link like this in a word with blue letters?)
I've read that unpredictable rewards associated with a behavior actually encourage that behavior more effectively than consistent rewards.
The optimal habit-forming figure given in the link above is a 25% chance of reward for each instance of performing the behavior.
My hypothesis then, is that if I want to establish a habit by rewarding myself upon successfully performing a certain task, I should reward myself only 25% of the time if I want to ingrain the habit as forcefully as possible into my unconscious.
Anyone else think so, or have any other research to add?