Insights and Must-Knows
from the Big Data Blog

Big Data Blog

Darth Vader, Big Data, and Predictive Analytics

Oct 2

Written by:
10/2/2012 6:15 AM  RssIcon

“He’s more machine now than man.  His mind is twisted and evil.”

Those were the words of Jedi Master Obi-Wan Kenobi, describing his old friend Anakin Skywalker who, after being seduced by the Dark Side of the Force, became the cyborg Sith Lord Darth Vader.

I sense a similar sentiment in business leaders who resist the machine learning algorithms that are becoming more of a necessity in the era of big data.  Some business leaders fear that the future of big data analytics could twist decision making into becoming more machine than man — if more decisions are made by a computer algorithm, and fewer decisions are made by a human mind.

A common challenge for predictive analytics is that algorithmic predictions often seem counterintuitive to business leaders, whose intuition is rightfully based on their business expertise, which has guided their business success to date.  Even those who pride themselves on being data-driven in their decision making naturally resist any counterintuitive insights found in their data.

To paraphrase Obi-Wan, your intuition serves you well, but it could also be your undoing.

Instead, some business leaders are afraid that the Dark Side of the Analytical Force is more likely to be their undoing, citing examples such as the 2010 Flash Crash (the complexity of which included more than just algorithms gone wild), and the 2011 Amazon book price algorithm war that temporarily priced a book at well over a million dollars (the algorithmic-driven price topped out at $23,698,655.93 — plus an additional $3.99 for shipping, of course).

However, despite their profound lack of the tacit understanding provided by human cognition, it’s simply undeniable that computer automation has its advantages.

Computers have much greater memory capacity and much faster processing speed.  Computers excel at mathematics and statistics (even more so than humans with PhDs in mathematics and statistics).  Computers don’t get tired or hungry.  Computers don’t get upset when the local sports team loses or get despondent because the cute fax machine that works on the fifth floor won’t answer any of its calls.  And computers don’t get too distracted looking for cat videos on YouTube.

Algorithms make consistent decisions in that when given the same inputs, they always return the same answer.  Algorithms also more quickly learn when their previous answer was a mistake.  Thanks in part to suffering no embarrassment from making mistakes (and admitting that they made mistakes), algorithms can continually learn without ever assuming (or becoming adamant) that they have nothing left to learn because of their vast experience and proven expertise.

But it’s also undeniable that big data analytics is not a forced choice between human cognition and computer automation — instead, it requires a fusion of the best of both.

“One of the reasons that I am a firm believer in business rules and predictive analytics being used in tandem,” James Taylor explained during a podcast discussion about decision management, “is that I find that you’re less likely to go off the rails in either direction.”

”Business rules are very explicit,” Taylor continued.  “These are our policies.  These are the regulations.  This is our expertise.  This is what makes sense.  Business rules can be a little bit of a blunt instrument at times because they don’t have the nuances you would get out of data analysis, and they tend to rely on what everybody knows, which is not always true.  But business rules are explicit, easy to track, and bounded.”

”Predictive analytics, on the other hand, can sometimes produce somewhat surprising results.  Although predictive analytics learns quickly as new data comes in, it isn’t bounded quite as much by the sensibilities of the people who run the business.  Therefore, business rules and predictive analytics work much better when they work together.”

Remember that Darth Vader did not intentionally became a cyborg (i.e., a fusion of humanity and technology).  You needn’t lose your limbs in lightsaber battles with big data, or get burned in a lava flow of information overload, or literally become a cyborg in order to accept that humans and computers must work together to bring balance to the Analytical Force.

May the Analytical Force be with you.  Always.


Your name:
Gravatar Preview
Your email:
(Optional) Email used only to show Gravatar.
Your website:
Title:
Comment:
Security Code
CAPTCHA image
Enter the code shown above in the box below
Add Comment   Cancel 

Search Big Data Blogs

Tags

Big Data (113)
Analytics (59)
Pervasive (50)
DataRush (32)
Hadoop (29)
predictive analytics (20)
Scalability (20)
Industry trends (18)
Multicore (15)
Data Mining (12)
Parallelism (10)
Java (9)
Cloud (8)
Cyber Security (8)
MapReduce (8)
KNIME (7)
Volumes (7)
Algorithms (6)
Cost-effective (6)
Data Volumes (6)
Data Warehouse (6)
RushAnalytics (6)
RushAnalyzer (6)
David Loshin (5)
Decision Support (5)
machine learning (5)
Dataflow (4)
Forrester (4)
Google (4)
Green IT (4)
Healthcare (4)
Actian (3)
analytics processes (3)
BigQuery (3)
data centers (3)
data integration (3)
Data Preparation (3)
data-driven (3)
DataMatcher (3)
Jim Harris (3)
Malstone B (3)
Opera Solutions (3)
Retail Analytics (3)
Security (3)
Solutions (3)
telecommunications (3)
transportation analytics (3)
YARN (3)
analytics tools (2)
Austin (2)
big data analytics (2)
Big Data Science (2)
Bloor (2)
Bloor Research (2)
Data Rush (2)
David Inbar (2)
David Norris (2)
fraud (2)
fraud detection (2)
Gartner (2)
GigaOM (2)
Hadoop Summit (2)
IntegrationWorld (2)
Julie Hunt (2)
machine generated data (2)
meetup (2)
Mike Hoskins (2)
ParAccel (2)
Pervasive DataRush (2)
Rexer Analytics (2)
software (2)
#FollowFriday (1)
Amazon (1)
analytics accuracy (1)
analytics workflow (1)
Application Development (1)
automation (1)
Benchmarks (1)
Business Intelligence (1)
Cloud Analytics Summit (1)
cloud computing (1)
Cloudera (1)
contests (1)
cost (1)
cyber security issues (1)
Data Integrator - Hadoop Edition (1)
data quality (1)
Data Science (1)
data tools (1)
Door64 (1)
easy big data analytics (1)
Ericson (1)
Esri (1)
Facebook (1)
Fuzzy Matching (1)
Goverment (1)
Hadoop User Group (1)
Hadoop World (1)
intelligent transportation systems (1)
Jazoon (1)
Jim Falgout (1)
MalStoneB (1)
Mansour Raad (1)
McKinsey (1)
Neil Raden (1)
Netflix (1)
NetFlow (1)
para (1)
PIG (1)
Predictive Analytics World (1)
psychohistory (1)
Public Sector (1)
Redshift (1)
RushAccelerator (1)
Sampling (1)
Signal and Noise (1)
Smart Grid (1)
SmartDataCollective (1)
spatial analytics (1)
speed (1)
sports (1)
Stephen Swoyer (1)
Steve Shine (1)
Strata (1)
SXSW (1)
Telecom Analytics (1)
Telecommunications Industry Association (1)
TIA (1)
Transportation (1)
TurboRush (1)
VectorWise (1)
Zementis (1)

Big Data Blog Archives

Archive
<May 2013>
SunMonTueWedThuFriSat
2829301234
567891011
12131415161718
19202122232425
2627282930311
2345678
Monthly
Go