What AI can really do for your business (and what it can’t)

Artificial intelligence, machine learning, and deep learning are no silver bullets. A CIO explains what every business should know before investing in AI

What AI can really do for your business (and what it can’t)
Thinkstock

How can you tell whether an emerging technology such as artificial intelligence is worth investing time into when there is so much hype being published daily? We’re all enamored by some of the amazing results such as AlphaGo beating the champion Go player, advances in autonomous vehicles, the voice recognition being performed by Alexa and Cortana, and the image recognition being performed by Google Photos, Amazon Rekognition, and other photo-sharing applications.

When big, technically strong companies like Google, Amazon, Microsoft, IBM, and Apple show success with a new technology and the media glorifies it, businesses often believe these technologies are available for their own use. But is it true? And if so, where is it true?

This is the type of question CIOs think about every time a new technology starts becoming mainstream:

  • To a CIO, is it a technology that we need to invest in, research, pay attention to, or ignore? How do we explain to our business leaders where the technology has applicability to the business and whether it represents a competitive opportunity or a potential threat?
  • To the more inquisitive employees, how do we simplify what the technology does in understandable terms and separate out the hype, today’s reality, and its future potential?
  • When select employees on the staff show interest in exploring these technologies, should we be supportive, what problem should we steer them toward, and what aspects of the technology should they invest time in learning?
  • When vendors show up marketing the facts that their capabilities are driven by the emerging technology and that they have expert PhDs on their staff supporting the product’s development, how do we evaluate what has real business potential versus services that are too early to leverage versus others that are really hype, not substance?

What artificial intelligence really is, and how it got there

AI technology has been around for some time, but to me it got its big start in 1968-69 when the SHRDLU natural language processing (NLP) system came out, research papers on perceptrons and backpropagation were published, and the world became aware of AI through HAL in 2001: A Space Odyssey. The next major breakthroughs can be pinned to the late 1980s with the use of back propagation in learning algorithms and then their application to problems like handwriting recognition. AI took on large scale challenges in the late 1990s with the first chatbot (ALICE) and Deep Blue beating Garry Kasparov, the world chess champion.

I got my first hands-on experience with AI in the 1990s. In graduate school at the University of Arizona, several of us were programming neural networks in C to solve image-recognition problems in medical, astronomy, and other research areas. We experimented with various learning algorithms, techniques to solve optimization problems, and methods to make decisions around imprecise data.

If we were doing neural networks, we programmed the perceptron’s math by hand, then looped through the layers of the network to produce output, then looped backward to apply the backpropagation algorithms to adjust the network. We then waited long periods of time for the system to stabilize its output.

When early results failed, we were never sure if we were applying the wrong learning algorithms, hadn’t tuned our network optimally for the problem we were trying to solve, or simply had programming errors in the perceptron or backpropagation algorithms.

Flash-forward to today and it’s easy to see why there’s an exponential leap in AI results over the last several years thanks to several advances.

First, there’s cloud computing, which enables running large neural networks on a cluster of machines. Instead of looping through perceptrons one at a time and working with only one or two network layers, computation is distributed across a large array of computing nodes. This is enabling deep learning algorithms, which are essentially neural networks with a large number of nodes and layers that enable processing of large-scale problems in reasonable amounts of time.

Second, there’s the emergence of commercial and open source libraries and services like TensorFlow, Caffe, Apache MXNet, and other services providing data scientists and software developers the tools to apply machine learning and deep learning algorithms to their data sets without having to program the underlying mathematics or enable parallel computing. Future AI applications will be driven by AI on a chip or board driven by the innovation and competition among Nvidia, Intel, AMD, and others.

Don’t confuse AI hype with AI realities

Once you have a grasp of history and an understanding of the technology, it’s often useful to review where an emerging technology is in its life cycle.

Gartner has machine learning and deep learning at the peak of their hype cycles and forecasts that “general AI” (AI applied to any intelligence problem) will emerge after 2020. Venture Scanner shows that about two-thirds of startup funding in AI is going to early rounds (seed, Series A, and Series B rounds), indicating that many companies selling or marketing AI solutions are early in their product development and sales cycles. McKinsey states that only 20 percent of AI-aware firms are adopting AI and that more than 50 percent of AI investments are coming from tech giants and startups versus businesses that happen to use technology.

Those stats should give any CIO or business executive a pause before jumping into AI investments with both feet. Although AI is certainly demonstrating a lot of promise, the commercial application of these algorithms at scale is still relatively young.

And the early winners are big tech companies and startups with the talent, funding, and patience to experiment with new technologies. Most enterprises and medium businesses simply don’t have these luxuries and are just starting their AI journeys.

AI is a highly disruptive technology, so you should not ignore it. Just proceed judiciously and avoid getting hypnotized by the AI hype.

For example, when voice becomes a better human machine interface than screens and keyboards for some applications, or as chatbots become smarter and faster than human customer service agents, many businesses will have to upgrade their user experiences with these technologies.

Likewise, when deep learning algorithms get better at detecting fraud, risky transactions, or security threats, enterprises will have to be ready to use these approaches.

And when we begin to pull intelligence from spoken language, audio, and video as effectively as we can with more structured data, using these capabilities will provide significant competitive advantages to a large array of businesses.

When is the operative word.

Most businesses should aim to be fast followers, not early adopters. That means paying attention and even experimenting with AI in these early days, but waiting to rely on AI until the technology is sufficiently mature, proven, and able to deliver at scale.

As you learn about AI capabilities, look for tools and practical example to help evaluate applications of AI and their maturity. Examples include:

Start your AI program with fundamental business goals

The hype around AI and machine learning is driving some technology and business leaders to dive into a technology-first strategy. If you are starting your journey by experimenting with machine learning libraries or courting vendors that are beating the AI drums, you’re missing some key starting steps.

Instead, start by looking at business problems and opportunities with significant upside to offset the costs for research and development. These opportunities should be backed by very large data sets that you already have or that you can acquire and data integrate in easily. Draw inspiration from what other industries where other nontech companies are demonstrating success.

One reason to start with a defined business opportunity is that you might find solutions that don’t require the latest AI technologies. If some form of AI is required, this defined-business-opportunity approach lets you classify the type of solutions and evaluate the overall maturity of the AI required.

For example, if you’re trying to automate a highly manual business process involving visual inspection of parts coming off an assembly line, you might identify that a combination of image recognition and robotic process automation as part of the solution set. Both are more mature AI areas, as evidenced by the variety of successes and vendor solutions in this area.

On the other hand, if the solution requires significant cognitive evaluation and thinking, you’re heading into an immature AI space.

One way to gauge AI maturity is look at the various vendor landscapes published on AI startups, such as the ones from Venture Scanner, the current state of machine intelligence, O’Reilly’s bot landscape, and the AI fintech landscape. Reviewing these lists, you’ll see that many startups have focused solutions around discrete problem sets rather than generalized cognitive solutions.

Don’t be fooled when a vendor says something like, “Just throw your data at our AI” and expect expert intelligence to be returned. It won’t happen.

To truly work, your AI will need lots of data

That brings up the second prerequisite for running AI successfully: You need large amounts of relatively clean data to train AI solutions and evaluate outputs.

One reason autonomous vehicles are possible is the 4,000 GB generated from one hour of driving generated from lidar and other sensors found in these cars. That’s a lot of data being used to make what are really just a handful of fundamental decisions on whether the car should turn, speed up, slow down, or fully stop.

Many successful AI solutions fall into this same category of churning large amounts of data into a finite number of decisions. In image recognition, for example, am I looking at a picture that contains you in it, or not? In collaborative filtering, is a newly published article more relevant to you based on your past reading experiences versus other reading options? When evaluating a transaction, does it have similar patterns to fraudulent transactions?

The AI “inside the box” is trying to approximate a curve to make these decisions. In deep learning, for example, the number of layers and neurons in the network can approximate highly complex curves to differentiate outcomes. To develop this network, you need a large, tagged data set so the network can be trained by comparing its computed results against your tagged result with the desired outcome. The errors are then used to tune the network using backpropagation or other learning algorithms, and the exercise is repeated multiple times across all tagged data until the network stabilizes to an optimized curve. These are supervised learning solutions, developed using a training set.

If the data isn’t tagged, networks can use unsupervised learning approaches that rely on entropy expressions that evaluate the outcome. For example, when Google’s DeepMind was used to learn to play the Atari game Breakout, it used the score to evaluate outcomes.

Going beyond data sets, your organization needs a data integration and automation capability so you can move data into and out of any AI processing engine. If your organization is used to having people run scripts manually to push data around, I strongly suggest first investing in automation before diving into AI solutions.

Your options for experimenting with AI

Once you have business opportunities identified and large, cleansed data sets available, you are ready to consider an AI journey. Those two steps are prerequisites for preparing your organization for artificial intelligence. The main next steps are to consider the type of AI solution and implementation. If you have the talent, you can experiment with TensorFlow or one of the other AI engines. If you don’t have the expertise, think twice about trying to recruit for it; the tech giants are paying huge salaries for scarce AI talent, and so the costs are enormous to just get into the game.

A second option is to use vendors that have embedded AI in their solutions. One example is Salesforce Einstein, an AI platform that can perform forecasting and other functions on top of CRM data stored in Salesforce. Likewise, you can look at industry-specific solutions, such as Synechron’s Neo for financial tech (fintech).

Once you settle on one or more approaches, it’s important to set realistic expectations with stakeholders. Investing in AI requires a commitment to agile experimentation because you’re likely to encounter many dead ends and experiments that require many runs before they are optimized. Set those expectations for budget, timeframe, and talent upfront.

Copyright © 2017 IDG Communications, Inc.