REGISTER   |   LOGIN   |   HELP
Home  |  Blogs  |  Message Boards  |  Webinars  |  Resources |  By Channel
Beth Schultz

Point/Counterpoint: Open-Source Analytics

NO RATINGS
View Comments: Newest First | Oldest First | Threaded View
Shyam Desigan
User Rank
Blogger
Re: The Rise of Open Source Analytics
Shyam Desigan   9/26/2011 11:00:58 PM
NO RATINGS
Robert,

Thanks for sharing your insights on R/Rattle. Companies like Revolution Analytics are taking the same approach with providing development support around R like with Red Hat did with Linux a decade earlier. As we all recognize, Linux has a huge presence in High performance computing today and given the trajectory that Open source tools like Hadoop or the approach that Lexis Nexis has recently taken open-sourcing their Big Data analysis tools, It is not difficult to fathom a time when Open source languages like R running on Eclipse would lead the Analytics space. The more interesting area is the confluence of Big Data/High performance Computing and Open-Source Analytics and how each could potentially leverage of each other to build a truly robust ecosystem a la the LAMP stack when it took Enterprise Architects by a storm.

BethSchultz
User Rank
Blogger
Re: The Rise of Open Source Analytics
BethSchultz   9/22/2011 1:43:15 PM
NO RATINGS
Robert, thanks for sharing. I think you make an interesting point about old school vs. new school requirements and the birth of a new generation of R-savvy statisticians. Of course we see this sort of new knowledge base all the time regarding social media, and so, I suppose, this similar sort of young worker/new technology in the analytics discipline should come as no surprise. Worth exploring ...

 




ryerex
User Rank
Prospector
The Rise of Open Source Analytics
ryerex   9/22/2011 1:25:54 PM
NO RATINGS

Having hands on experience with SAS for more than 25 years (several SUGI presentations/publications), including using the unmatched capabilities of Enterprise Miner, I can attest to what it means to have access to a really solid analytics tool to count on.

When open source analytical tools became available I took a look at them and was not impressed with the capabilities, correctness, or stability of these applications. However within the past couple of years this has changed dramatically. Tools like R have developed an extensive following and many of the packages for R are fairly cutting edge when it comes to algorithms, etc. Support is, of course, only available in an ad-hoc fashion.

Most recently, tools such as Revolution R have been made available that leverage the R platform and add capability, and support (for a fee).

I have successfully used R/ Rattle, running in Eclipse (an open source IDE which is now used by more developers than any commercial IDE). This kind of setup is becoming the de-facto environment for many “Data Scientists” who need access to a standard toolset but also need to interface to a range of data sources, write simple integration code in languages such as Python, and create deployable (at no additional cost) tools.

The most obvious benefit to using the open source tools is the low initial cost. I have consulted with a number of start-ups where high-end analytics is part of their product or service, or required in the development phase (for example automated cyber-crime detection using data mining/neural net techniques). Ideally such firms would be able to use SAS Enterprise Miner from the very start, but they simply cannot afford the cost. While they may understand the shortcomings of using an open source platform, and that they may “pay for it later” when the tools they are using hit their maximum capacity, simple survival dictates minimal investment now, and if they get to the point where they have a viable product/service, etc, they will deal with it then.

Students are becoming increasingly exposed to tools such as R in the classroom and then continue to use it once employed. Back in the 80s when I got my Ph.D., SAS was the only real tool available to students. It was provided to the University at a much reduced cost and this created a generation of SAS literate statisticians. We are now seeing a new generation of R (and other open source tool) savvy statisticians.

Taking a look at job descriptions for open positions, one sees an interesting dichotomy. Start-ups and smaller high-tech firms require experience in R/Rattle, etc, as well as SQL and often a language or two such as Ruby/Python. “Old School” firms are more likely to require SAS experience.

The currently available open-source tools do work, and one can build robust tools with them. I recently created a tool for a client that calculates real-time expected success rates for an online cybercrime detection system. It accesses hundreds of thousands of records (stored on a Google cloud server). The heavy lifting analysis work is done in R and the integration with the web portal and data are in Python, and SQL with local storage in MySQL. The most interesting part being that I needed to do some Markov Chain modeling, which I am no expert in, and was able to find a plug-in (PEPA) for Eclipse. I must admit that I spent a good deal of time just getting the environment up and running. Version compatibility in open source tools is a critical problem, especially when mixing environments. In the end I had a stable tool that can be used (by a knowledgeable person) to generate estimates. This could all have been done, possibly more easily, in SAS but would have required a number of add-on interfaces for the DB and the Web portal, and in the end the tool could not be deployed without additional licensing or additional seats of SAS.

Robert Yerex

Salient Analytics 

Shawn Hessinger
User Rank
Blogger
RE: Great debate with unique points
Shawn Hessinger   9/21/2011 3:41:00 PM
NO RATINGS
Beth,

A great debate with very unique points on both sides. The argument for commercial analytics with strong customer support and specialized features is the one most people make about commercial solutions. The argument for open source as more customized, however, is not one you see everyday. I think what's missing here is discussion of the benefits of a customized commercial solution, customized cloud solution (if such a thing exists) and what happens when analytics is not a core function and you do not have the expertise to customize solutions yourself.  

More Blogs from Beth Schultz
We honor eight years of YouTube with a random collection of videos on data.
Randy Bartlett, author and longtime business analytics leader, shares his views on the analytically smart organizational structure.
UPS should be as famous for its longtime commitment to business analytics and its analytics culture as it is for package delivery.
Patient engagement advocate Dave deBronkart encourages everyone to take ownership of their own and their family's medical data.
Randy Bartlett, author and seasoned analytics professional, will join us this Friday, May 17, at 2:00 p.m. ET for a radio show on ensuring organizational change for the good of business analytics.
Quick Poll
AllAnalytics Videos
Visual Analytics, Instant ...
Analytics results delivered in visual form are ...

2:06

1 comment
Big Data, Fast Infrastructure
Big data calls for a high-performance analytics ...

3:35

6 comments
Red Hot Analytics
Jayson Tipp, Redbox VP of Analytics and CRM, ...

3:51

3 comments
Hotelier Checks Out Analytics
InterContinental Hotels Group has woven analytics ...

06:55

11 comments
Cartoon
Most recent post: @Broadway   So it worked ?
CARTOON ARCHIVE
AllAnalytics Video Blogs
Marketing Your Analytics
Humana's Elizabeth Barth-Thacker tells us how her ...

2:21

0 comments
Amazon & Analytics
Amazon has expanded into the world of business ...

3:04

1 comment
The High Price of a Big Banana
There are no analytics to explain the volatility of ...

2:53

8 comments
Fraud Failure
Insurance companies have no excuse not to be using ...

2:26

2 comments
Teaching Users to 'Fish'
Rajeev Kaul, SVP of pricing at OfficeMax, explains ...

2:04

2 comments
Stuck on the Train
Cutting the number of cars on my commuter train was ...

2:22

11 comments
Strength in Numbers
Hear, hear! to the folks who count themselves among ...

1:32

1 comment
Fool's Gold
You don't always find what you want when you data-mine.

1:50

3 comments
Ford Revs Up With Big-Data
In an All Analytics interview, Mike Cavaretta, ...

2:44

2 comments
Get On With It!
Analytics professionals and SAS executives share ...

2:32

1 comment
Power to the Visualization
Analytics professionals who attended SAS's recent ...

2:03

1 comment
Mental Model Lifts Boeing
At Boeing, effective decision making comes down to ...

2:01

2 comments
What Users Want Next
Attendees at the recent SAS Executive Briefing in ...

2:31

4 comments
The Power to Discover
SAS CEO Jim Goodnight talks about new realities ...

3:36

1 comment
Breaking Down Big-Data ...
SAS's Jim Davis talks about how high-performance ...

3:06

0 comments
Digital Audio
Latest Archived Broadcast
Companies today must be analytically agile to compete based on their data and analytics.
Live Video
On-demand Video with Chat
Analytics-fueled data visualizations can be a real game-changer when you're exploring the data and assessing results.
Readerboards
Have a question or topic but don't want to write a blog? Post it on our readerboards and get feedback from the community!
MORE READERBOARDS
Upcoming Events
for the Business and IT Communities
Executive forums with additional hands-on learning opportunities offered around the world
Each ideal for practitioners, Business leaders & senior executives
NYC, Boston, Philadelphia, Chicago, Minneapolis/St. Paul, Rockville, San Francisco, Los Angeles/Irvine, Dallas, Atlanta
AllAnalytics on Twitter
AllAnalytics Twitter Feed
Like Us on Facebook
Point/CounterpointBlog
LEADERS FROM THE BUSINESS AND IT COMMUNITIES DUEL OVER CRITICAL TECHNOLOGY ISSUES

The Current Discussion

Visual Analytics: Who Carries the Onus?
The Issue: Data visualization is an up-and-coming technology for businesses that want to deliver analytical results in a visual way, enabling analysts the ability to spot patterns more easily and business users to absorb the insight at a glance and better understand what questions to ask of the data. But does it make more sense to train everybody to handle the visualization mandate or bring on visualization expertise? Our experts are divided on the question.
The Speakers: Hyoun Park, Principal Analyst, Nucleus Research; Jonathan Schwabish, US Economist & Data Visualizer
MORE POINT/COUNTERPOINT BLOGS
About Us  |  Contact Us  |  Help  |  Register  |  Twitter  |  Facebook  |  RSS


Videos
Visual Analytics, Instant Insight

5|16|13   |   2:06   |   (1) comment


Dynamic data visualizations let analysts and business users interact with the data, changing variables or drilling down into data points, and see results in a flash. Advance your use of data visualization with tools that support features like auto-charting, explanatory pop-ups, and mobile sharing.
Videos
Big Data, Fast Infrastructure

2|14|12   |   3:35   |   (6) comments


No doubt your enterprise is amassing loads of data for fact-based decision-making. Hand in hand with all that data comes big computational requirements. Can traditional IT infrastructure handle the increasing number and complexity of your analytical work? Probably not, which is why you need a backend rethink. Big data calls for a high-performance analytics infrastructure, as Fern Halper, a partner at the IT consulting and research firm, Hurwitz & Associates, discusses here.
Videos
Red Hot Analytics

1|10|12   |   3:51   |   (3) comments


Redbox's bright-red DVD kiosks are all but ubiquitous these days, located in more than 28,000 spots across the country. Jayson Tipp, Redbox VP of Analytics and CRM, provides an insider's look at how the company has accomplished its phenomenal nine-year growth.
Videos
Hotelier Checks In With Analytics

12|14|11   |   06:55   |   (11) comments


InterContinental Hotels Group (IHG), a seven-brand global hotelier, has woven analytics into the fabric of its operations. David Schmitt, director of performance strategy and planning, shares IHG's analytics story and his lessons learned.
Beth Schultz
Marketing Your Analytics

5|14|13   |   2:21   |   (0) comments


Elizabeth Barth-Thacker, a BI and informatics technology manager at Humana, tells us how her team is creating data transparency and building engagement with the business – with the help of an internal collaboration portal called Humanalytics.
Pierre DeBois
Amazon & Analytics

5|7|13   |   3:04   |   (1) comment


With Redshift, Amazon has expanded into the world of business intelligence. Could web analytic solutions for e-commerce be next?
Noreen Seebacher
The High Price of a Big Banana

5|6|13   |   2:53   |   (8) comments


There are no analytics to explain the volatility of banana prices in New York City.
Beth Schultz
Fraud Failure

5|3|13   |   2:26   |   (2) comments


Insurance companies have no excuse not to be using advanced analytics in their fight against fraud.
Beth Schultz
Teaching Users to 'Fish'

5|1|13   |   2:04   |   (2) comments


Speaking at SAS Global Forum Executive Conference, Rajeev Kaul, SVP of pricing at OfficeMax, uses a Chinese proverb to explain one of the reasons he's deploying SAS Visual Analytics.
Noreen Seebacher
Stuck on the Train

4|24|13   |   2:22   |   (11) comments


Cutting the number of cars on my commuter train was an analytics fail, simple as that.
Beth Schultz
Strength in Numbers

4|22|13   |   1:32   |   (1) comment


Hear, hear! to the folks who count themselves among analytics professionals and who will be gathering next week at SAS Global Forum.
Noreen Seebacher
Fool's Gold

4|15|13   |   1:50   |   (3) comments


You don't always find what you want when you data-mine.
Beth Schultz
Ford Revs Up With Big-Data

4|12|13   |   2:44   |   (2) comments


In an All Analytics interview, Mike Cavaretta, technical leader, predictive analytics at Ford Research & Advanced Engineering, shares how big-data is fueling vehicle decisions.
Beth Schultz
Get On With It!

4|11|13   |   2:32   |   (1) comment


Analytics professionals and SAS executives share how organizations can get on with their work so much faster when working in a high-performance and visual analytics environment.
Beth Schultz
Power to the Visualization

4|11|13   |   2:03   |   (1) comment


Analytics professionals who attended SAS's recent Executive Briefing in New York share how they think visual analytics might help their organizations get better value from data.
Beth Schultz
Mental Model Gives Boeing Lift

4|9|13   |   2:01   |   (2) comments


At Boeing, effective decision making comes down to this simple formula: QxA=E, as executive Jerry Allyne explained at the recent INFORMS analytics conference.
Beth Schultz
What Users Want Next

4|8|13   |   2:31   |   (4) comments


Whether working in major league sports, financial services, or healthcare, analytics, and data, professionals are checking out how visual analytics and high-performance technologies can help them optimize their environments, shrink their cycle times, and improve decision making, as attendees at the recent SAS Executive Briefing in New York share with us.
Beth Schultz
The Power to Discover

4|4|13   |   3:36   |   (1) comment


SAS CEO Jim Goodnight speaks with us at a recent SAS Executive Briefing about getting a feel for what's in your big-data and other new realities powered by advanced analytics.
Beth Schultz
Breaking Down Big-Data Barriers

4|4|13   |   3:06   |   (0) comments


Jim Davis, SVP and CMO at SAS, talks with us at a recent SAS Executive Briefing about how high-performance analytics and visual analytics take away the concerns over big-data and let companies get down to business with their data.