One thing I wonder about voice recognition is the "control characters" issue. If you're typing, you can make capital letters by pressing "shift". A voice recognition system could allow someone to say "shift" to mean that the next word starts with a capital letter, but then what if someone wants the actual word "shift" in their document? There are a few work-arounds that I can think of, but I'm curious as to what approach the programmers went with.
There are cameras that can tell where someone is looking, so it seems to me that one input method would be to have a keyboard where you look at the letters that you want to type.
In Dragon you can easily enter control characters by saying "Press" or "Type" as in "Press alt", "type control alt". The same seems to be true of most keyboard alternatives out there these days, as is creating macros for frequently used phrases, commands, whatever, as you can see in the video posted by the OP. It took a while to adapt my speech to speech-recognition-friendly pronunciation (not just providing a corpus of my own pronunciation) but now the # of typos ("speechos?") doesn't seem to be outrageo...
I'm thinking about writing a more substantive post about how humans work and how we can work better, a little like this one. As is common with these sorts of things, once I started to do research and pull on various threads, it turned out that the field was pretty deep and would require time to understand. But in the meantime, I just thought I would link to this video of someone programming using only their voice.
As I suffer with symptoms of carpal tunnel syndrome, this is of particular interest to me. Once I watched it I decided to start looking at different voice recognition software so that I could still get some work done while typing less. I'm happy to say that even the default software for speech recognition which came with windows is actually very able and accurate. I dictated almost this entire post using that software.
As far as I can tell, Dragon Naturally Speaking is the gold standard in voice recognition software. It does come with a pretty hefty price tag, but it may be worth it if you have serious repetitive stress injuries, or as a preventative measure if you're someone who spends a lot of time at their computer. And if that doesn't work, chances are good your computer has adequate software pre-installed.