Analytics & Big Data

Visual Recognition Made Simple

“A picture paints a thousand words”

but only if you know how to read it . . 

Visual Recognition does for huge numbers of images what text analytics does for large collections of written text: it allows pictures to be processed automatically to find material of interest. This can provide the basis for open source intelligence or provide a means to process collected images to automatically find specified targets, such as military infrastructure.

The number of images from public sources, soldiers with (body) cameras, coalition partners and other sources is exploding. Clearly achieving timely, accurate classification of the images by “throwing people at the problem” does not scale and can be error prone.

Cognitive computing – in the form of unsupervised (machine) learning – will transform our ability to classify images.

Let’s consider recognising a military aircraft in an image using the Watson Visual Recognition (web) service. The service uses semantic classifiers built with machine-learning technology to recognize visual entities such as settings, objects, and events based on content such as color, texture, shape, and edges. Watson Visual Recognition allows us to build a classifier specific to our problem (“spot a military aircraft”) by learning from images we provide.

VizRec 1

Figure 1 – Watson Visual Recognition Service (in IBM BlueMix)

To train the new classifier, we provide a set of positive (correct) and negative (incorrect) images as training sets, as shown in Figure 2. Careful examination of the negative images shows a combination of civil aircraft and birds.

VizReq 2

Figure 2 – Training Sets: Positive & Negative

Once the classifier is trained, we can ask it to “recognize” images for us. Sample actual results are shown in Figure 3, but if the classifier misbehaves or the confidence is not high enough it can be re-trained with more complete and/ or comprehensive image libraries.

VIZREQ 3

Figure 3 – Classification Results

For a real life problem, several classifiers would be built and applied to the same image, resulting in competing “image type” results with associated confidences. System accuracy could improve over time if the training sets are augmented with correct and incorrect classifications and the classifiers periodically re-trained.

Want to see this Watson Visual Recognition in action? We will be demonstrating it at or SPADE customer conference [3] in Berlin on the 12th & 13th April. Alternatively take a look at our BlueMix cloud integration platform.

References & Credits

  1. I’m grateful to my close colleague & friend Dr Doug Dykeman (@doug_dykeman) for showing me just how easy this can be!
  2. For insights into the underlying technology “Unsupervised One-Class Learning for Automatic Outlier Removal”
    by John Smith et al
  3. If you are interested in attending our SPADE conference, please contact your local IBM account representative – or me if you don’t have one!

Agree, disagree, disinterested?  I’d much appreciate an active debate on this topic!  Contact me through leaving a comment, twitter or LinkedIn!

Add Comment
No Comments

Leave a Reply

Your email address will not be published.Required fields are marked *

More IBM Watson Stories

Open Innovation in Government

Open Innovation is a term originally coined by Dr Henry Chesbrough in the early 2000’s best summarised by Figures 1 & 2 below. Figure 1 – Open Innovation  Figure 2 – Open vs Closed Innovation Open Innovation has been compelling to Governments for some time, but lacking the technical platform to realise the enormous potential gains. This is where IBM’s Bluemix platform […]

How do you achieve real improvements in your organization?

In my working life, both 29 years in the Navy and 8 years at IBM, I have always been interested in what makes an organization perform its business in a more effective manner. This includes such diverse areas as getting a submarine crew to fire torpedoes more effectively (I was Commanding Officer of a submarine) […]

Blockchain, where’s my stuff?

Even though a physical supply chain may be integrated, the supporting information systems are often fragmented.  Members of the supply chain – ranging from the source of raw materials to the consumer – keep their own records of asset information, which drives up costs (through duplication & inefficiencies) increases complexity and causes regulatory challenges. Figure 1 – Simplified Supply […]