MRIA National Conference 2014 — Understanding Predictive Analytics

The following notes were live blogged from the “Understanding Predictive Analytics” session given by Chuck Chakrapani (Leger Marketing) on June 10, 2014. Minimal editing was done on the post, so there will be typos in the post. Below is a video interview with the presenter:

Is interested in technology enabled predictive analytics (as opposed to technology driven)

What is Data Analysis:

big data
machine learning
data mining predictive analytics
text mining
etc.

Everything is predictive:

do we want to go to this session or another
do i take this job offer
will my stocks go up as well

Business

will this new product succeed
can i icrese the price
who will be by my target audience

Steps:

What will happen — A or B will happen, will have consequences on either results

Google Fusion

Enable you to pull information from the web
This means we have access to a vast amount of secondary data

The New Science of Data Science

Data science is the study of the generalizable extraction of knowledge from data. It builds on techniques and theories from many fields:

signal processing
probability
etc

What is big data?

A large amount of data?
More data than your desktop could handle?
One zetabyte of data
No agreed upon definitions
A tentaive framework
From the data universe that is infinite and constantly in flux

Big Data and the Flu

Google searches conversations about the flu to predict infection rates. So big data is great when it works. The problem with big data is that it is only correlations

Machine learning

Example: Amazon tells me what I should read based on what I am reading now
Machine learns and predicts

What Happens When You Use Gmail

Google ads based on emails

Two Functions of Predictive Analytics

Classification
Prediction

The objectives haven’t changed, but:

Lower costs
better predictability
faster turn-around

Example

25 years ago, a single cluster analysis of 600 respondents on 30 variable will run for 24 hours on a pc
Today you can run 100 cluster analysis of 1000 respondents on 30 variables in one afternoon

How does that help?

Then:

one respondent randomly to represent a segment
everyone close is assigned to the segment
there is nothing to indicate if it is reasonable
no way of validating your segments
holdout sample is better than nothing, not good enough

Now:

We can have larger samples which help us split the sample into a Training set and Test set
We can do hundred of clutters on analysis on the same data

Message:

Do not think of big data as everything. Unless you combine data with analysis the whole thing is useless. You need to have objectives.

MRIA National Conference 2014 — Understanding Predictive Analytics

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

Download: FK ft Shenky – Nakuyewa ”Prod by: Shenky”

How to win at Markstrat (Markstrat Tips and Tricks) – Vodites

Ominde Commission Report and Recommendations – Ominde Report of 1964

Bureau of Internal Revenue: Regional Offices (Directory)

GO 53 on Enhancement of Ex-gratia upto 5 Lakhs Toddy Tappers in Telangana

Cakewalk CA-2A Leveling Amplifier v2.0.1.97 WiN, v2.0.1.96 OSX Incl Keygen

Mp3 Download: Mdu - Kunjenjenjena

How the kill the job , when DTP request running for long hours.

Microsoft Intune から展開しているアプリのアップデートについて

18-year-old girl was beaten for half an hour by two Northampton men in 'an...

Car crash in Dunton Bassett leaves driver in critical condition

Macky 2, Two Others In Road Accident

Application log 00000000000000089514: Could not convert queue DLVST90CLNT

Detroit mafia: D’Anna Brothers agree to plea deal

Delivery block field greyed out using VA02

Muloraki Au

【個人撮影】スマホのプライベート映像♪「中に出さないで///」カラオケ屋での生ハメ撮りが流出ｗ【リベンジポルノ】＠PornHub

BREAKING NEWS: Diamond Platnumz Is Reported Dead After Ghastly Car Accident

FIAT 500 B0111 B0112