Omea, GTD, and Personal Productivity

Blogging with desperation

Controlling Omea with Voice Commands

Over Christmas, I bought myself a small gift that would have proven impossible to describe to my family: Dragon Naturally Speaking. Naturally Speaking is a natural language voice recognition system which is integrated with Microsoft Office tools and other popular applications. As a customer of Nuance's Paperport, I had chance to pick Naturally Speaking up for $49, so, with dreams of turning my computer into a responsive voice-command device, I ordered the product and began to play around with it.

The base Naturally Speaking package is more of a taste of voice recognition than flexible software you can use to control all your applications. The Pro version with flexibility to create new commands and text was about $700 on Nuance's web site, which put it well out of the range of my wallet - especially since I had no idea how well it worked.

After setting up and training the software, I began trying to use it, with mixed results. Dictation can be quick if you enunciate well and improves with training if you're willing to put in the effort. At the same time, though, you say "scratch that" a lot and it takes a bit to get used to doing things like controlling the cursor.

What I was really hoping to do, though, was quickly process email and RSS feeds, applying a GTD approach to designate whether an email needs to be archived, trashed, collected for processing, etc. I use flags and categories in Outlook for this, and have mapped those over to Omea so I can go back and forth between them (more on this in a future blog post).

Naturally Speaking has some ability to do this type of thing out-of-the box, but it has no built-in compatibility with Omea. Outlook support in the base Naturally Speaking edition covers the basics, but not a lot of what I have to do.

Fortunately, a solution offered itself in a freeware scripting framework called Vocola. Vocola uses the Python scripting language and some additional natural language libraries to control Dragon Naturally Speaking.

After installing the needed Python scripting support and other libraries, I found myself amazed at how well Vocola's support matched what I needed to do. In many respects, Omea was easier to control and command than Outlook, since Jetbrains has built in so many keystroke commands to help navigate through Omea.

It's really rare that you can find a software product so seemingly well-designed for exactly what it is you want to do. For me, Omea Pro is one of them, and Vocola for Dragon Naturally Speaking is another. Both are freeware. 

Creating your own scripts with Vocola/Python to control Naturally Speaking is a very simple task even if you have just a rudimentary knowledge of programming. With DNS running, you can simply say "Edit Voice Commands" in an application and a text editor will pop up with the voice commands for that app. Save changes to the text file, and then say "Load Voice Commands", and DNS will be able to use the commands just programmed for that application.

For Omea, I have crafted just a handful of voice commands to prove out the concept. I have been real happy with the results so far. Here is my Omea voice command file to date:

# Voice commands for omeapro

# Email or Flagged Items View

Clear Flag = {CTRL+0};
Flag It = {CTRL+1};
Mark Read = {ALT+E}r;
Archive = {ALT+E}R{CTRL+SHIFT+V}Curr{Enter};
Categorize = {CTRL+Y};
Reply = {CTRL+R};
Newsgroups = {CTRL+ALT+N};
Delete = {DEL};
Forward = {CTRL+SHIFT+F};

The word on the left is what you say into the microphone for DNS to understand, and the keystrokes on the right are processed. As you can see, so far all I've done is set up simple commands to operate Omea by voice. Using Python, there's far more possible than just keystroke macros.

The "Archive" command is one that is closest to where I want to go with this. This command is for emails which are not actionable, but which I want to keep around for reference and find perhaps later via Google Desktop Search or Omea.

NOTE ON EMAIL: As I've adopted GTD, I find that I don't folder mail other than to stuff it in a area called "Current Month" and then archive it by month and year. I long ago gave up trying to organize mail via folders and now simply use Google Desktop Search to find anything at a later time. Omea's ability to find email by contact is the other primary means I have to pull an email out of storage. Even tagging mail is beyond my time constraints with the volume that I get on a daily basis.

The "Archive" command above marks an email read and moves it out of my In-Box to the Current Month folder. At the end of each month, I move that big bundle of mail to another Outlook Data File for archiving.

Having gotten control of Omea by voice, the question then becomes "Is it any more efficient than using the keyboard?".

At this point, I really don't think I've proven to myself that using voice is any more efficient. That may be a matter of training DNS more so that I don't have to repeat commands or correct dictation. The nice thing about having this option, though, is that using voice commands gives you a nice break from being hunched over a keyboard all day, and it can be good from an ergonomic point of view. Being able to kick back in your chair and get things done just by talking at your computer for a bit is a needed change-of-pace when you get a constant stream of email like I do.

I'm going to continue to experiment with DNS, Omea, and Outlook, and will hopefully have some updated Vocola command files to share in the future.

 

Comments

No Comments

Leave a Comment

(required) 

(required) 

(optional)

(required)