Beth Schultz

UnitedHealthcare Wants High-Performance Analytics

NO RATINGS
View Comments: Newest First | Oldest First | Threaded View
Page 1 / 2   >   >>
Analytik
User Rank
Prospector
What is the value of High-Performance Analytics
Analytik   12/19/2012 11:02:25 AM
NO RATINGS
Many organizations struggle with identifying the business value of accelerating the performance of analytics. Can Mark describe what the business value will be to UnitedHealthcare of accelerating the time of the process from 4 hours 15 minutes to 10 seconds, and how it will impact on the business?

thomaswdinsmore
User Rank
Prospector
Re: Update?
thomaswdinsmore   11/29/2012 2:46:40 PM
NO RATINGS
Right -- because a run time of ten seconds is great, but you have to consider time to load from Greenplum to HPA when weighing in-memory analytics versus in-database analytics (which don't require the extra load).

Usually people love to talk about stuff that works well, so the load time must be an issue.

 

 

 

BethSchultz
User Rank
Blogger
Re: Several Questions
BethSchultz   11/29/2012 1:56:15 PM
NO RATINGS
Whoops! Missed this earlier. This OptimumInsight project looks fascinating, at a quick flip through the slides. I have to set aside some time to give it my full attention!

BethSchultz
User Rank
Blogger
Re: Update?
BethSchultz   11/29/2012 1:54:54 PM
NO RATINGS
@thomaswdinsmore -- Mark declined going into further detail, but I would assume that, yes, data load time is a consideration!

thomaswdinsmore
User Rank
Prospector
Update?
thomaswdinsmore   11/29/2012 9:30:27 AM
NO RATINGS
Any response to the last two questions?   It's curious that nobody wants to talk about the time to load data from Greenplum to HPA -- that strikes me as a material consideration when evaluating a co-located approach.

thomaswdinsmore
User Rank
Prospector
Re: Several Questions
thomaswdinsmore   11/26/2012 5:31:57 PM
NO RATINGS
It's production work, not a test.  Here's a link.  OptumInsight is a unit of UHG    

http://www.ehcca.com/presentations/predmodel5/wickstrom_2.pdf

 

BethSchultz
User Rank
Blogger
Re: Several Questions
BethSchultz   11/26/2012 4:46:12 PM
NO RATINGS
@thomasedinsmore, while I'm checking in with Mark on these answers, maybe you could briefly share the results of the other UHG high-performance test?

thomaswdinsmore
User Rank
Prospector
Re: Several Questions
thomaswdinsmore   11/26/2012 10:22:17 AM
NO RATINGS
Beth,

Thanks for the response.  Two follow-up questions:

(1) How much time did it take to load the data from Greenplum into HPA's memory?

(2) Did you test any larger problems?  There is a group inside UHG that successfully run predictive analytics on billion-row datasets (using alternative technologies).

 

BethSchultz
User Rank
Blogger
Re: Several Questions
BethSchultz   11/26/2012 9:07:51 AM
NO RATINGS
@thomaswdinsmore, I've checked with Mark Pitts, and here are his responses: 

(1) UHG has not yet purchased the product. 


(2) The load rate referenced in the article -- is that the time needed to load raw data into EMC Greenplum, or the time needed to load data from Greenplum into HPA's memory?
    - The load rate in the article is the time to load raw data into the EMC Greenplum DCA.
  

(3) The analysis on four million rows that takes four hours in the current state environment -- what analysis did Mr. Pitts's team test, and what environment does this currently run on?
    This was one of many tests we ran.  This one was a simulation written in DS2 that ran an algorithm that was computationally intensive and I/O intensive - we wrote it intentionally to put the DCA through it's paces.  The 4-hour run for the same algorithm ran on a dedicated, 16-core SMP Unix server - with dedicated meaning that nothing else was running on the server during the test.

thomaswdinsmore
User Rank
Prospector
Several Questions
thomaswdinsmore   11/25/2012 12:54:46 PM
NO RATINGS
Several questions about this story.

(1) Did Mr. Pitts' firm actually license the product for production?  The article simply refers to a POC.

(2) The load rate referenced in the article -- is that the time needed to load raw data into EMC Greenplum, or the time needed to load data from Greenplum into HPA's memory?

(3) The analysis on four million rows that takes four hours in the current state environment -- what analysis did Mr. Pitts's team test, and what environment does this currently run on?

 

 

 

Page 1 / 2   >   >>
Information Resources
More Blogs from Beth Schultz
It's been a fun three years, but now it's time to say goodbye.
Take inspiration from Christopher Columbus as you pursue your analytical journeys.
The "big" in big data is no reflection of the size of the organization embracing its potential.
Whether you're an undergrad, a graduate student, or an analytics professional already, the same best-practices advice lives large.
Satellite data can help solve puzzles, from the lofty to the mundane.
Radio Show
Radio Shows
UPCOMING
James M. Connolly
Predictive Analytics Create a New World of Marketing


3/31/2015   REGISTER   1
ARCHIVE
James M. Connolly
Sports Analytics Mean Fun and Business


3/24/2015  LISTEN   4
ARCHIVE
James M. Connolly
Secure Your Big Data in the Cloud


2/26/2015  LISTEN   114
ARCHIVE
James M. Connolly
Make It Big As a Data Scientist in 2015


2/11/2015  LISTEN   106
ARCHIVE
James M. Connolly
Big Data, Decisions & the Simulated Experience


2/3/2015  LISTEN   87
ARCHIVE
James M. Connolly
A Chat About Big Data, Machine Learning & Value


1/15/2015  LISTEN   125
ARCHIVE
Curtis Franklin Jr.
An Infrastructure for Analytics


12/18/2014  LISTEN   63
ARCHIVE
James M. Connolly
Prepare for the Internet of Things Data Blitz


12/16/2014  LISTEN   50
ARCHIVE
James M. Connolly
How Mature Is Your Analytics Program?


11/18/2014  LISTEN   148
ARCHIVE
James M. Connolly
Drive Big Decisions Using Data & Analytics


11/10/2014  LISTEN   73
ARCHIVE
Beth Schultz
Data Science & the Data-Driven Culture


10/30/2014  LISTEN   134
Information Resources
Infographic
Infographic
It Pays to Keep Insurance Fraud in Check
While 97% of insurers say that insurance fraud has increased or remained the same in the past two years, most of those companies report benefits from anti-fraud technology in limiting the impact of fraud, including higher quality referrals, the ability to uncover organized fraud, and improve efficiency for investigators.
Follow us on Twitter
Follow us on Twitter
Like us on Facebook
Like us on Facebook
Quick Poll
Quick Poll
About Us  |  Contact Us  |  Help  |  Register  |  Twitter  |  Facebook  |  RSS