REGISTER   |   LOGIN   |   HELP
Home  |  Blogs  |  Message Boards  |  Webinars  |  Resources |  By Channel
Marshall Sponder

What to Do With Unstructured Data

NO RATINGS
View Comments: Newest First | Oldest First | Threaded View
Page 1 / 3   >   >>
BritInBigD
User Rank
Prospector
Graphic
BritInBigD   1/21/2013 8:45:31 AM
NO RATINGS
Hi Marshall,

I was curious to know the source of the graphic that accompanies your post (and specifically the indicative growth rates shown for the different data categories). Is that data from IDC? 

Maryam@Impact
User Rank
Blogger
Re: cleaning with integrity
Maryam@Impact   12/31/2012 6:56:50 PM
NO RATINGS
@Hospice I agree, but I don't know of any standard protocools because their is so much varaibaility with unstructured data depending on industry etc. Still a challenge to get standards.

kicheko
User Rank
Blogger
Re: Yikes - some misconceptions here that need to be addressed
kicheko   12/31/2012 8:50:05 AM
NO RATINGS
webmetricsguru, - One big challenge i see is analytics of sentiment oriented data even though the area has grown a great deal in the course of this year -- new apps and all. This kind of data at least for now may still need to be reviewed closer because its difficult to automate it in one given pattern without locking out new sentiments that are outside of that original set. However as intelligent systems learn this data, it will cut down what we have to look at even in sentiment analytics.

webmetricsguru
User Rank
Blogger
Re: Yikes - some misconceptions here that need to be addressed
webmetricsguru   12/30/2012 11:35:11 PM
NO RATINGS
If I understand you correctly, that's the bane of the PR / Marcom industry - that you can actually look at bunch of verbatim (maybe that's ok for 10-30) but what happens when you have thousands or more?

I think a discussion on just what cleaning data is and how to to best do it would be good for AllAnalytics.com personally.  I'd like to see what we come up with, and I bet a lot of others would too.

Hospice_Houngbo
User Rank
Prospector
Re: Yikes - some misconceptions here that need to be addressed
Hospice_Houngbo   12/30/2012 11:21:17 PM
NO RATINGS
"If we can find the essential information or pattern, we might not need to look at most of it"

I see. I suppose that those hand-written patterns can just be domain specific and will be difficult to generalize. I agree that extracting the most useful patterns might be enough in most cases, as it difficult to think of all possible patterns. One of drawback of such model is that human patterns are often low-recall, even if precision is high. 

Hospice_Houngbo
User Rank
Prospector
Re: Step 1
Hospice_Houngbo   12/30/2012 11:05:50 PM
NO RATINGS
It is true that some of the points you mentioned are debatable - like "Distribute the data in the cloud". But they are valid points to take into account when dealing with unstructured data. To the question how to clean unstructured data? I think that it depends on the shape and the model that has been defined.

webmetricsguru
User Rank
Blogger
Re: Yikes - some misconceptions here that need to be addressed
webmetricsguru   12/30/2012 11:05:17 PM
NO RATINGS
What I meant is that currently, people usually end up needing to look at the data to understand it (because it is un structured information) and attempts to use software to understand it, in my opinion, won't work, at least not today. What you can do, I think, and maybe our friends here can confirm or argue this, is cut down on what we have to look at. If we can find the essential information or pattern, we might not need to look at most of it - and hopefully the software created can help surface that information, and maybe that's the best we can hope for (big data hype or not). At any rate, this is an interesting discussion and I don't have all the answers - but I am wondering just what they are.

Hospice_Houngbo
User Rank
Prospector
Re: Yikes - some misconceptions here that need to be addressed
Hospice_Houngbo   12/30/2012 10:52:47 PM
NO RATINGS
@marshall,

"I define it as something a human needs to look at to fully process"

I still don't get it. Do you mean that there is the need for human intervention to figure out whether the data is unstructured or not? Won't that be time consuming and practically impossible for human to go through all the instances of the data due to it size? Maybe it is not what you mean?

Hospice_Houngbo
User Rank
Prospector
Re: Sifting through the sands of the unstructured
Hospice_Houngbo   12/30/2012 10:44:49 PM
NO RATINGS
@Jenn,

"So I guess, we need to find better search and organizing processes."

I think that is what the cleaning and storage processes are all about. Some data can fit in many categories depending on the search parameters. This may complicate the storage process as the same data will be duplicated - sometimes unnecessary.

Hospice_Houngbo
User Rank
Prospector
Re: cleaning with integrity
Hospice_Houngbo   12/30/2012 9:09:42 PM
NO RATINGS
@Maryam,

As the cleaning stage of unstructured data is becoming difficult with information explosion, I wonder if we can come up with an efficient "cleaning prototol" that can be applied to every scenario. 

Page 1 / 3   >   >>
More Blogs from Marshall Sponder
When the data we don't know is as important as the data we do, our analytics platform are all but guaranteed to fail us.
Segmentation, multichannel integration, and intelligent dashboard reporting are vital capabilities, yet many business analytics solutions fall short.
Social media is playing an important role in politics, but determining a victor based on what's happening out there isn't so easy.
Experts gathered at a conference to share the latest in this niche analytics technology.
Quick Poll
AllAnalytics Videos
Visual Analytics, Instant ...
Analytics results delivered in visual form are ...

2:06

1 comment
Big Data, Fast Infrastructure
Big data calls for a high-performance analytics ...

3:35

6 comments
Red Hot Analytics
Jayson Tipp, Redbox VP of Analytics and CRM, ...

3:51

3 comments
Hotelier Checks Out Analytics
InterContinental Hotels Group has woven analytics ...

06:55

11 comments
AllAnalytics Video Blogs
Marketing Your Analytics
Humana's Elizabeth Barth-Thacker tells us how her ...

2:21

0 comments
Amazon & Analytics
Amazon has expanded into the world of business ...

3:04

1 comment
The High Price of a Big Banana
There are no analytics to explain the volatility of ...

2:53

8 comments
Fraud Failure
Insurance companies have no excuse not to be using ...

2:26

2 comments
Teaching Users to 'Fish'
Rajeev Kaul, SVP of pricing at OfficeMax, explains ...

2:04

2 comments
Stuck on the Train
Cutting the number of cars on my commuter train was ...

2:22

11 comments
Strength in Numbers
Hear, hear! to the folks who count themselves among ...

1:32

1 comment
Fool's Gold
You don't always find what you want when you data-mine.

1:50

3 comments
Ford Revs Up With Big-Data
In an All Analytics interview, Mike Cavaretta, ...

2:44

2 comments
Get On With It!
Analytics professionals and SAS executives share ...

2:32

1 comment
Power to the Visualization
Analytics professionals who attended SAS's recent ...

2:03

1 comment
Mental Model Lifts Boeing
At Boeing, effective decision making comes down to ...

2:01

2 comments
What Users Want Next
Attendees at the recent SAS Executive Briefing in ...

2:31

4 comments
The Power to Discover
SAS CEO Jim Goodnight talks about new realities ...

3:36

1 comment
Breaking Down Big-Data ...
SAS's Jim Davis talks about how high-performance ...

3:06

0 comments
Digital Audio
Latest Archived Broadcast
Companies today must be analytically agile to compete based on their data and analytics.
Live Video
On-demand Video with Chat
Analytics-fueled data visualizations can be a real game-changer when you're exploring the data and assessing results.
Readerboards
Have a question or topic but don't want to write a blog? Post it on our readerboards and get feedback from the community!
5/23/2013 8:57:20 AM
Noreen Seebacher on Ain't wasting time no more?
5/22/2013 8:55:01 PM
Noreen Seebacher on Adults to students
MORE READERBOARDS
Upcoming Events
for the Business and IT Communities
Executive forums with additional hands-on learning opportunities offered around the world
Each ideal for practitioners, Business leaders & senior executives
NYC, Boston, Philadelphia, Chicago, Minneapolis/St. Paul, Rockville, San Francisco, Los Angeles/Irvine, Dallas, Atlanta
AllAnalytics on Twitter
AllAnalytics Twitter Feed
Like Us on Facebook
Point/CounterpointBlog
LEADERS FROM THE BUSINESS AND IT COMMUNITIES DUEL OVER CRITICAL TECHNOLOGY ISSUES

The Current Discussion

Visual Analytics: Who Carries the Onus?
The Issue: Data visualization is an up-and-coming technology for businesses that want to deliver analytical results in a visual way, enabling analysts the ability to spot patterns more easily and business users to absorb the insight at a glance and better understand what questions to ask of the data. But does it make more sense to train everybody to handle the visualization mandate or bring on visualization expertise? Our experts are divided on the question.
The Speakers: Hyoun Park, Principal Analyst, Nucleus Research; Jonathan Schwabish, US Economist & Data Visualizer
MORE POINT/COUNTERPOINT BLOGS
About Us  |  Contact Us  |  Help  |  Register  |  Twitter  |  Facebook  |  RSS


Videos
Visual Analytics, Instant Insight

5|16|13   |   2:06   |   (1) comment


Dynamic data visualizations let analysts and business users interact with the data, changing variables or drilling down into data points, and see results in a flash. Advance your use of data visualization with tools that support features like auto-charting, explanatory pop-ups, and mobile sharing.
Videos
Big Data, Fast Infrastructure

2|14|12   |   3:35   |   (6) comments


No doubt your enterprise is amassing loads of data for fact-based decision-making. Hand in hand with all that data comes big computational requirements. Can traditional IT infrastructure handle the increasing number and complexity of your analytical work? Probably not, which is why you need a backend rethink. Big data calls for a high-performance analytics infrastructure, as Fern Halper, a partner at the IT consulting and research firm, Hurwitz & Associates, discusses here.
Videos
Red Hot Analytics

1|10|12   |   3:51   |   (3) comments


Redbox's bright-red DVD kiosks are all but ubiquitous these days, located in more than 28,000 spots across the country. Jayson Tipp, Redbox VP of Analytics and CRM, provides an insider's look at how the company has accomplished its phenomenal nine-year growth.
Videos
Hotelier Checks In With Analytics

12|14|11   |   06:55   |   (11) comments


InterContinental Hotels Group (IHG), a seven-brand global hotelier, has woven analytics into the fabric of its operations. David Schmitt, director of performance strategy and planning, shares IHG's analytics story and his lessons learned.
Beth Schultz
Marketing Your Analytics

5|14|13   |   2:21   |   (0) comments


Elizabeth Barth-Thacker, a BI and informatics technology manager at Humana, tells us how her team is creating data transparency and building engagement with the business – with the help of an internal collaboration portal called Humanalytics.
Pierre DeBois
Amazon & Analytics

5|7|13   |   3:04   |   (1) comment


With Redshift, Amazon has expanded into the world of business intelligence. Could web analytic solutions for e-commerce be next?
Noreen Seebacher
The High Price of a Big Banana

5|6|13   |   2:53   |   (8) comments


There are no analytics to explain the volatility of banana prices in New York City.
Beth Schultz
Fraud Failure

5|3|13   |   2:26   |   (2) comments


Insurance companies have no excuse not to be using advanced analytics in their fight against fraud.
Beth Schultz
Teaching Users to 'Fish'

5|1|13   |   2:04   |   (2) comments


Speaking at SAS Global Forum Executive Conference, Rajeev Kaul, SVP of pricing at OfficeMax, uses a Chinese proverb to explain one of the reasons he's deploying SAS Visual Analytics.
Noreen Seebacher
Stuck on the Train

4|24|13   |   2:22   |   (11) comments


Cutting the number of cars on my commuter train was an analytics fail, simple as that.
Beth Schultz
Strength in Numbers

4|22|13   |   1:32   |   (1) comment


Hear, hear! to the folks who count themselves among analytics professionals and who will be gathering next week at SAS Global Forum.
Noreen Seebacher
Fool's Gold

4|15|13   |   1:50   |   (3) comments


You don't always find what you want when you data-mine.
Beth Schultz
Ford Revs Up With Big-Data

4|12|13   |   2:44   |   (2) comments


In an All Analytics interview, Mike Cavaretta, technical leader, predictive analytics at Ford Research & Advanced Engineering, shares how big-data is fueling vehicle decisions.
Beth Schultz
Get On With It!

4|11|13   |   2:32   |   (1) comment


Analytics professionals and SAS executives share how organizations can get on with their work so much faster when working in a high-performance and visual analytics environment.
Beth Schultz
Power to the Visualization

4|11|13   |   2:03   |   (1) comment


Analytics professionals who attended SAS's recent Executive Briefing in New York share how they think visual analytics might help their organizations get better value from data.
Beth Schultz
Mental Model Gives Boeing Lift

4|9|13   |   2:01   |   (2) comments


At Boeing, effective decision making comes down to this simple formula: QxA=E, as executive Jerry Allyne explained at the recent INFORMS analytics conference.
Beth Schultz
What Users Want Next

4|8|13   |   2:31   |   (4) comments


Whether working in major league sports, financial services, or healthcare, analytics, and data, professionals are checking out how visual analytics and high-performance technologies can help them optimize their environments, shrink their cycle times, and improve decision making, as attendees at the recent SAS Executive Briefing in New York share with us.
Beth Schultz
The Power to Discover

4|4|13   |   3:36   |   (1) comment


SAS CEO Jim Goodnight speaks with us at a recent SAS Executive Briefing about getting a feel for what's in your big-data and other new realities powered by advanced analytics.
Beth Schultz
Breaking Down Big-Data Barriers

4|4|13   |   3:06   |   (0) comments


Jim Davis, SVP and CMO at SAS, talks with us at a recent SAS Executive Briefing about how high-performance analytics and visual analytics take away the concerns over big-data and let companies get down to business with their data.