IBM’s Watson and Analytics: Less Than It Seems, Maybe More Than It Will Seem

Updated: February 10, 2011

Deep Analysis of Deep Analysis

First, let's pierce through the hype to understand what, from my viewpoint, Watson is doing. It appears that Watson is building on top of a huge amount of "domain knowledge" amassed in the past at such research centers as GTE Labs, plus the enormous amount of text that the Internet has placed in the public domain - that's its data. On top of these, it places well-established natural-language processing, AI (rules-based and computer-learning-based), querying, and analytics capabilities, with its own "special sauce" being to fine-tune these for a Jeopardy-type answer-question interaction. Note that sometimes Watson must combine two or more different knowledge domains in order to provide its question: "We call the first version of this an abacus (history). What is a calculator (electronics)?"

Nothing in this design suggests that Watson has made a giant leap in AI (or natural-language processing, or analytics). For 40 years and more, researchers have been building up AI rules, domains, natural-language translators, and learning algorithms - but progress towards meeting a true Turing test, in which the human side of the interaction can never tell that a computer is the other side of the interaction, has been achingly slow. All that the Jeopardy challenge shows is that the computer can now provide one-word answers to a particular type of tricky question - using beyond-human amounts of data and of processing parallelism.

Nor should we expect this situation to change soon. The key and fundamental insight of AI is that when faced with a shallow layer of knowledge above a vast sea of ignorance, the most effective learning strategy is to make mistakes and adjust your model accordingly. As a result, brute-force computations without good models don't get you to intelligence, models that attempt to approximate human learning fall far short of reality, and models that try to invent a new way of learning have turned out to be very inefficient. To get as far as it does, Watson uses 40 years of mistake-driven improvements in all three approaches, showing that it's going to require many years of further improvements - not just letting the present approach "learn" more - before we can seriously talk about human and computer intelligence as apples and apples.

The next point is that Jeopardy is all about text data: not numbers, yes, but not video, audio, or graphics (so-called "unstructured" data), either. The amount of text on Web sites is enormous, but it's dwarfed by the amount of other data from our senses inside and outside the business, and in our heads. In fact, even in the "semi-structured data" category to which Watson's Jeopardy data belongs, other types of information such as e-mails, text messages, and perhaps spreadsheets are now comparable in amount - although Watson could to some extent extend to these without effort. In any case, the name of the game in BI/analytics these days is to tap into not only the text on Facebook and Twitter, but also the information inherent in the videos and pictures provided via Facebook, GPS locators, and cell phones. As a result, Watson is still a ways away from providing good unstructured "context" to analytics - rendering it far less useful to BI/analytics. And bear in mind that analysis of visual information in AI, as evidenced in such areas as robotics, is still in its infancy, used primarily in small doses to direct an individual robot.

As noted above, I see the immediate value of Watson's capabilities to the large enterprise (although I suppose the cloud can make it available to the SMB as well) to be more in the area of cross-domain correlation in existing text databases, including archived emails. There, Watson could be used in historical and legal querying to do preliminary context analysis, to avoid having eDiscovery take every reference to nuking one's competitors as a terrorist threat. Ex post facto analysis of help desk interactions (one example that IBM cites) may improve understanding of what the caller wants, but Watson will likely do nothing for user irritation at language or dialect barriers from offshoring, not to mention encouraging "interaction speedup" that the most recent Sloan Management Review suggests actually loses customers.

Featured Research
  • The Social Side of Service

    Did you know that 83% of Twitter users who tweeted a complaint said they loved receiving a response from the brand? In order to provide the best possible service to your customers, you MUST provide service on the channels that they are utilizing. Social customer service might seem scary and undefined, but can be much more effective and less expensive than traditional channels. more

  • Video Conferencing

    For many, the mere mention of video conferencing brings about bad memories of conference rooms full of people staring at a screen with dodgy sound, fuzzy images, and broken connections. What if we were to tell you that over the past decade, video conferencing solutions have evolved to where they are affordable to businesses of every size and have evolved beyond just the standard boardroom. Today, 74% of B2C marketers and 94% of B2B marketers use video in their marketing efforts. more

  • EHR Implementation

    More and more medical practices are selecting and implementing electronic health records (EHR) than ever before. In fact, statistics show that the number of practices who have purchased an EHR has doubled in just three years. That being said, many practices fail to prepare for their new EHR and thus do not gain the full benefits that come with implementing a solution. more

  • Selecting the Right EHR for Your Practice

    The purchase and implementation of an electronic health record (EHR) system is no small feat and is a big step for a practice, small or large, to take. Selecting your new EHR is one of the most important decisions that you will make for your practice. more

  • 8 Ways Business Travelers Can Save with VoIP

    Do you or any part of your workforce travel for work, or even telecommute? If that answer is yes, then you should be utilizing mobile VoIP. With VoIP, businesses have been found to save as much as 40% on local calls and a whopping 90% on international calling expenses. more