In a recent Enterprise Strategy Group study, we asked participants to identify the biggest drivers for deploying a new data analytics platform for big data requirements and to tell us the greatest benefits they expected to realize as a result.
We found this: Organizations are looking at new data analytics platforms as a way to cut costs. But their expectations aren't centered on completing analytics on larger datasets closer to real time. Rather, "improving business agility" won the top spot. And that is exactly what high-performance computing (HPC) and big data analytics platforms such as Hadoop and massively parallel processing (MPP) analytical databases are advertised to deliver.
When you can support high-performance data analytics across your infrastructure, you can discover patterns or develop theses for predictive models in hours and days rather than weeks and months. Likewise, if you need to be able to execute a ranking algorithm in near real-time to identify a fraudulent activity, the model needs to be able to complete in microseconds to milliseconds. Delivering results in such speedy fashion means giving the business the ability to react quickly and with the agility that's much desired.
But you need to know what you need out of a high-performance, big data analytics platform.
With big data analytics platforms based on Hadoop, there's the expectation that high performance is relative -- meaning seconds, minutes, and hours versus the milliseconds, seconds, and minutes of the MPP analytical databases. More real-time analytics with high performance and high volumes that fall into HPC are really an extreme case requiring more strategic investments. If you can wait a few more seconds for an answer, Hadoop may be good enough for a fraction of the cost.
Either way, big data architectures aren't without their red flags for enterprise deployment.
Hadoop, for example, is still emerging and has a few holes when it comes to security, data protection, and high availability. There are workarounds, and not all applications will need advanced enterprise-class data management requirements. Yet, as Hadoop-based applications become mainstream, those requirements will absolutely surface. Until then, you will need to look at your IT Infrastructure Library, or ITIL, processes and make some exceptions for supporting a big-data analytics platform.
Also, if your network is not set up to move massive volumes of data from point A to point B in the timeframe the business requires, an upgrade in your network infrastructure may be on the cards.
What's more, any Hadoop or MPP analytical database-based application will require specialized skillsets that are already in short supply. You better have the budget allocated to either hire specialists or send your top developers to training.
I think for analysis along with platform, tools also matters. Whatever may be the platform, the ability of tools to interactive and make use of the underlined resources are important. If the tools are not compactable with the deployed hardware and associated software, then the analysis process may happen with lesser speed, there may be always certain bottlenecks.
Julie writes
if you need to be able to execute a ranking algorithm in near real-time to identify a fraudulent activity, the model needs to be able to complete in microseconds to milliseconds. Delivering results in such speedy fashion means giving the business the ability to react quickly and with the agility that's much desired.
One would think credit card companies would be big users. I wonder approximately what the percentage is of credit card companies that are delopying this technology (HPC analytics etc.). I'd also wonder if this enables some more technologically forward-thinking companies to react to fraud faster than others.
" high performance is relalitve"..."What makes one company agile isn't the same for everyone. "
High performance evaluation may indeed depend on the company's data processing needs and requirements. Companies should first try to audit and understand such needs in oder to choose the right analytics tools.
I guess it would only make sense high performance is relalitve. that different companies would have different values on saving costs, performance and speed.
As mentioned in the article, banks need answers in milliseconds when it comes to fraud prevention, while Procter and Gamble would have very different needs.
What makes one company agile isn't the same for everyone.
Hi Louis, This environment had both an HPC and data warehouse environment. The team wanted to be able to enhance the DW with more detailed data and have the HPC environment have access to it. This new architecture added Hadoop as the landing place for detailed data that was accessible to both environments. Once they found how much quicker it was to add data elements tot he model, that was just the beginning of new opportunities.
@JulieLockner, fewer false positives and quicker response time certainly would be ideal in fraud detection analytics! I'm curious, outside of latency-sensitive operational analytics are there some areas where companies just aren't willing to get quite as automated with their responses as they might?
Hi Shawn, From an agility perspective, one Hadoop user had a challenge with their ability to add attributes to their Cognos cube for a risk model. The job took 4 days to complete with Oracle. When they switched the underlying data store to Hadoop, they were able to reduce that rebuild to 1 day. From a new opportunity perspective, another user was not able to support their business users' request to mine detailed transactional data to identify opportunities to improve business process efficiencies because the warehouse could not scale. The warehouse was only designed to look at aggregate data. By opening up detailed transactional data cost affordably on a Hadoop cluster, they could investigate as to why processes were inefficient.
LEADERS FROM THE BUSINESS AND IT COMMUNITIES DUEL OVER CRITICAL TECHNOLOGY ISSUES
The Current Discussion
Visual Analytics: Who Carries the Onus? The Issue: Data visualization is an up-and-coming technology for businesses that want to deliver analytical results in a visual way, enabling analysts the ability to spot patterns more easily and business users to absorb the insight at a glance and better understand what questions to ask of the data. But does it make more sense to train everybody to handle the visualization mandate or bring on visualization expertise? Our experts are divided on the question. The Speakers: Hyoun Park, Principal Analyst, Nucleus Research; Jonathan Schwabish, US Economist & Data Visualizer
To save this item to your list of favorite AllAnalytics content so you can find it later in your Profile page, click the "Save It" button next to the item.
If you found this interesting or useful, please use the links to the services below to share it with other readers. You will need a free account with each service to share an item via that service.
Elizabeth Barth-Thacker, a BI and informatics technology manager at Humana, tells us how her team is creating data transparency and building engagement with the business – with the help of an internal collaboration portal called Humanalytics.
Speaking at SAS Global Forum Executive Conference, Rajeev Kaul, SVP of pricing at OfficeMax, uses a Chinese proverb to explain one of the reasons he's deploying SAS Visual Analytics.
In an All Analytics interview, Mike Cavaretta, technical leader, predictive analytics at Ford Research & Advanced Engineering, shares how big-data is fueling vehicle decisions.
Analytics professionals and SAS executives share how organizations can get on with their work so much faster when working in a high-performance and visual analytics environment.
Analytics professionals who attended SAS's recent Executive Briefing in New York share how they think visual analytics might help their organizations get better value from data.
At Boeing, effective decision making comes down to this simple formula: QxA=E, as executive Jerry Allyne explained at the recent INFORMS analytics conference.
Whether working in major league sports, financial services, or healthcare, analytics, and data, professionals are checking out how visual analytics and high-performance technologies can help them optimize their environments, shrink their cycle times, and improve decision making, as attendees at the recent SAS Executive Briefing in New York share with us.
SAS CEO Jim Goodnight speaks with us at a recent SAS Executive Briefing about getting a feel for what's in your big-data and other new realities powered by advanced analytics.
Dynamic data visualizations let analysts and business users interact with the data, changing variables or drilling down into data points, and see results in a flash. Advance your use of data visualization with tools that support features like auto-charting, explanatory pop-ups, and mobile sharing.
No doubt your enterprise is amassing loads of data for fact-based decision-making. Hand in hand with all that data comes big computational requirements. Can traditional IT infrastructure handle the increasing number and complexity of your analytical work? Probably not, which is why you need a backend rethink. Big data calls for a high-performance analytics infrastructure, as Fern Halper, a partner at the IT consulting and research firm, Hurwitz & Associates, discusses here.
Redbox's bright-red DVD kiosks are all but ubiquitous these days, located in more than 28,000 spots across the country. Jayson Tipp, Redbox VP of Analytics and CRM, provides an insider's look at how the company has accomplished its phenomenal nine-year growth.
InterContinental Hotels Group (IHG), a seven-brand global hotelier, has woven analytics into the fabric of its operations. David Schmitt, director of performance strategy and planning, shares IHG's analytics story and his lessons learned.