REGISTER   |   LOGIN   |   HELP
Home  |  Blogs  |  Message Boards  |  Webinars  |  Resources |  By Channel
Tom Reamy

How Text Analytics & Text Mining Can Be Mutually Enriching

NO RATINGS
View Comments: Newest First | Oldest First | Threaded View
Page 1 / 3   >   >>
BethSchultz
User Rank
Blogger
Re: Other text analytics subsets coming?
BethSchultz   4/13/2012 5:34:44 PM
NO RATINGS
Interesting idea, Tom! 

BethSchultz
User Rank
Blogger
Re: TM and TA clarification
BethSchultz   4/13/2012 5:32:20 PM
NO RATINGS
Tom, I like the hybrid content management/text analytics model you propose. I would think that'd be an approach that would be appealing to content authors and editors, too -- not too burdensome.

 

 

treamy
User Rank
Master Analyst
Re: TM and TA clarification
treamy   4/13/2012 4:06:32 PM
NO RATINGS

Beth, I always get nervous when software vendors talk about everything being automatic.  There is always a human touch – it just depends where and how.  Sometimes it is afterwards like your example, and sometimes it is prior – setting up the categorization rules or statistical models or a combination of the two.  One model I've pushed for years in terms of finding information is the hybrid model of a content management software with integrated text analytics capabilities helping authors and editors to add structure (metadata) to documents as they are published – using the strengths of both human and machine.

 

treamy
User Rank
Master Analyst
Re: Other text analytics subsets coming?
treamy   4/13/2012 3:52:24 PM
NO RATINGS

Beth, absolutely agree on the need for an interdisciplinary approach – in most things, but certainly for anything to do with getting value from text.  Hmm, we now have knowledge management departments in organizations, what about text analytics departments?  There would be a need for interdisciplinary cooperation between the TA department and other departments like IT, BI, CI, business units, and even sales.  And also, the TA department should be interdisciplinary – NLP experts, statisticians, linguists, programmers or script writers, librarian-taxonomists, cognitive modelers, predictive analysis experts, and business analysts along-side.  (Can you tell I was just reading about Bell Labs where they had theorists and practitioners sitting next to each other?)

 

And the model could scale down by having some people with multiple skills so even smaller organizations could have their own TA department. 

 

Right now, there are new departments / titles growing out of social media (Customer Experience analysts, etc.), so who knows maybe a TA department with connections throughout the organization might become a reality?

 

Jeff
User Rank
Data Doctor
Re: Rinse and Repeat
Jeff   4/13/2012 3:00:10 PM
NO RATINGS
I guess the kind of app I would like to see is something that can take the Mining results from an analysis of business environment, or a country political environment and then feed that into an internal program to generate a list of possible outcomes internal to a particular business.  I know this is not very acute, but as an example.  What if you pushed the reality of an earthquake in Japan six months ago and the program came back with, "hey go buy some hard drives now, you won't be able to get them soon."

 

Probably asking too much.

 

 

BethSchultz
User Rank
Blogger
Re: Other text analytics subsets coming?
BethSchultz   4/12/2012 4:58:06 PM
NO RATINGS
Tom, thanks so much for this explanation. The IT as platform analogy is perfect, and the TA/TM skills distinction an important one to note. I could see how more purposeful use could come from cross-functional teams and lots more knowledge sharing within companies on their TA, TM, BI, IT and business goals!

BethSchultz
User Rank
Blogger
Re: TM and TA clarification
BethSchultz   4/12/2012 4:46:31 PM
NO RATINGS
Tom, I recall talking with the director of information and semantics management for The Tribune Co. not so long about the need for the human touch, which I think applies to both sentiment analysis and ontology as you've described. He cited a few examples where the machine learning just isn't adequate. One of his examples:


While the Trib's text analytics platform might watch out for the term "Barack Obama" and the word "president" in the same sentence, what's it do when a new nickname -- say, President Numero Uno -- surfaces? "We might not find 'President Numero Uno' appear in a news article," says DeWeese, referring to the Trib's chief business case for text analytics, "but boy, when people start commenting, that's what they're calling him."

If you're trying to deliver all relevant content or collect sentiments about President Obama, any comments referencing that latest nickname would surely be among the most desirable but likely missed.

"You have to know or somehow be able to see this -- it has to get on your radar, and that's where the initial human analysis comes into play. Humans create that bridge to the machine."


treamy
User Rank
Master Analyst
Re: Other text analytics subsets coming?
treamy   4/12/2012 2:12:32 PM
NO RATINGS

Certainly Homeland Security has lots of people doing text analysis – they bought copies of every single vendor of TA/TM software out there – which is why companies finally stopped citing them as a customer.  And yes, all those other fields have some TA/TM capabilities that they are using – but the question is how are they using them?  Sentiment analysis became its own "field" for a lot of reasons (social media hype and promise of more sales, etc.) but also because it uses a specific set of words (the sentiment dictionary).  The methods used in sentiment analysis are really a combination of TM and TA when it's done right, so it is a difficult but interesting task to figure out which if any of these specific application areas will take off and become its own field.

 

I'm giving a talk on future directions in social media next week at the Social Media Analytics Summit in San Francisco in which I talk about expertise analysis and behavior prediction as two possibilities but I'm not sure is they have the potential for becoming their own field like sentiment analysis – combination of scale of interest along with specific techniques and/or terminology.  Have to give that one some more thought. 

 

It seems to me that the examples you cite are more applications areas rather than different fields (yes, it's a somewhat arbitrary line).

treamy
User Rank
Master Analyst
Re: TM and TA clarification
treamy   4/12/2012 1:52:35 PM
NO RATINGS

One immediate example is sentiment analysis which more often than not is done with mostly text mining approaches (statistical) but with a large dictionary of sentiment terms.  The vendors like to focus on the strength of their software out of the box and downplay the need for customization.  But in this case, customization is basically adding categorization to look at the context around various sentiment statements.  This context is what is needed to distinguish things like sarcasm, but also even to distinguish things like conditionals ("I'd like "this feature" if it had better design") which can come out as positive "like this feature". 

 

As far as ontology development, the situation is similar in that you are trying to extract "facts" or triples which can also be ambiguous in a number of ways.  First, in terms of identifying the entities (classic example is Ford the car, the person, the company, and a way to cross a stream).  Second,  in identifying the relationships between those entities where you have some of the same kinds of issues as with sentiment – conditionals, etc.

 

As an aside, I'd like to see ontology development have a text mining piece in it that would function like the taxonomy example I used earlier.

treamy
User Rank
Master Analyst
Re: Other text analytics subsets coming?
treamy   4/12/2012 1:26:19 PM
NO RATINGS

Beth, on silo effect – I don't see it so much as a traditional silo, but rather that TA/TM is a platform technology and most organizations are organized around specific applications.  So while, for example, Business Intelligence depends on TA/TM, they don't consider it as fundamental to what they do and the unfortunate effect of that is to downplay its importance and be satisfied with simple, one dimensional approaches. 

 

In some ways it is like IT as a platform for building applications, but in this case the platform involves the actual language of the business functions that want to have the application.  This sets up an unusual situation, which is that business people would not assume they could develop an IT application because it involves a specialized language, but too often think that because they know the business language, that they can develop a TA application.  However, this overlooks the fact that having an expertise in a given subject doesn't mean that the person has the expertise to categorize/organize/analyze the contents of that expertise.

 

 As far as the TA/TM split that does seem to be somewhat of a silo effect – but in this case a silo based on different skills such as mathematics and language or knowledge organization (I'd say librarian but that title carries such negative connotations that I hesitate to use it these days. The trick is to figure out how to enable communication between silos without losing the benefits of a silo (they do exist for a reason).

 

 

Page 1 / 3   >   >>
More Blogs from Tom Reamy
Presentation proposals for the fall Text Analytics World, in Boston Oct. 3 and 4, are due Friday, June 8.
Expertise analysis can top off sentiment analysis and social media analytics.
This sentiment analysis offshoot builds a characterization of the expertise level of documents, authors, and communities.
Here's a three-phase methodology for helping to ensure you buy the right text analytics software for your company needs.
AllAnalytics House Ad
AllAnalytics House Ad
AllAnalytics Video Blogs
Nate Silver Unplugged
Celebrity statistician Nate Silver at the SAS ...

1:37

2 comments
Marketing Your Analytics
Humana's Elizabeth Barth-Thacker tells us how her ...

2:21

0 comments
Amazon & Analytics
Amazon has expanded into the world of business ...

3:04

2 comments
The High Price of a Big Banana
There are no analytics to explain the volatility of ...

2:53

8 comments
Fraud Failure
Insurance companies have no excuse not to be using ...

2:26

2 comments
Teaching Users to 'Fish'
Rajeev Kaul, SVP of pricing at OfficeMax, explains ...

2:04

2 comments
Stuck on the Train
Cutting the number of cars on my commuter train was ...

2:22

11 comments
Strength in Numbers
Hear, hear! to the folks who count themselves among ...

1:32

1 comment
Fool's Gold
You don't always find what you want when you data-mine.

1:50

3 comments
Ford Revs Up With Big-Data
In an All Analytics interview, Mike Cavaretta, ...

2:44

2 comments
Get On With It!
Analytics professionals and SAS executives share ...

2:32

1 comment
Power to the Visualization
Analytics professionals who attended SAS's recent ...

2:03

1 comment
Mental Model Lifts Boeing
At Boeing, effective decision making comes down to ...

2:01

2 comments
What Users Want Next
Attendees at the recent SAS Executive Briefing in ...

2:31

4 comments
The Power to Discover
SAS CEO Jim Goodnight talks about new realities ...

3:36

1 comment
Quick Poll
AllAnalytics Videos
Visual Analytics, Instant ...
Analytics results delivered in visual form are ...

2:06

1 comment
Big Data, Fast Infrastructure
Big data calls for a high-performance analytics ...

3:35

6 comments
Red Hot Analytics
Jayson Tipp, Redbox VP of Analytics and CRM, ...

3:51

3 comments
Hotelier Checks Out Analytics
InterContinental Hotels Group has woven analytics ...

06:55

11 comments
Digital Audio
Latest Archived Broadcast
Companies today must be analytically agile to compete based on their data and analytics.
Live Video
On-demand Video with Chat
Analytics-fueled data visualizations can be a real game-changer when you're exploring the data and assessing results.
Readerboards
Have a question or topic but don't want to write a blog? Post it on our readerboards and get feedback from the community!
5/23/2013 8:57:20 AM
Noreen Seebacher on Ain't wasting time no more?
5/22/2013 8:55:01 PM
Noreen Seebacher on Adults to students
MORE READERBOARDS
Upcoming Events
for the Business and IT Communities
Executive forums with additional hands-on learning opportunities offered around the world
Each ideal for practitioners, Business leaders & senior executives
NYC, Boston, Philadelphia, Chicago, Minneapolis/St. Paul, Rockville, San Francisco, Los Angeles/Irvine, Dallas, Atlanta
AllAnalytics on Twitter
AllAnalytics Twitter Feed
Like Us on Facebook
Point/CounterpointBlog
LEADERS FROM THE BUSINESS AND IT COMMUNITIES DUEL OVER CRITICAL TECHNOLOGY ISSUES

The Current Discussion

Visual Analytics: Who Carries the Onus?
The Issue: Data visualization is an up-and-coming technology for businesses that want to deliver analytical results in a visual way, enabling analysts the ability to spot patterns more easily and business users to absorb the insight at a glance and better understand what questions to ask of the data. But does it make more sense to train everybody to handle the visualization mandate or bring on visualization expertise? Our experts are divided on the question.
The Speakers: Hyoun Park, Principal Analyst, Nucleus Research; Jonathan Schwabish, US Economist & Data Visualizer
MORE POINT/COUNTERPOINT BLOGS
About Us  |  Contact Us  |  Help  |  Register  |  Twitter  |  Facebook  |  RSS


Noreen Seebacher
Nate Silver Unplugged

5|23|13   |   1:37   |   (2) comments


Celebrity statistician Nate Silver shares some thoughts after delivering a keynote address at the SAS Government Leadership Summit in Washington, D.C.
Beth Schultz
Marketing Your Analytics

5|14|13   |   2:21   |   (0) comments


Elizabeth Barth-Thacker, a BI and informatics technology manager at Humana, tells us how her team is creating data transparency and building engagement with the business – with the help of an internal collaboration portal called Humanalytics.
Pierre DeBois
Amazon & Analytics

5|7|13   |   3:04   |   (2) comments


With Redshift, Amazon has expanded into the world of business intelligence. Could web analytic solutions for e-commerce be next?
Noreen Seebacher
The High Price of a Big Banana

5|6|13   |   2:53   |   (8) comments


There are no analytics to explain the volatility of banana prices in New York City.
Beth Schultz
Fraud Failure

5|3|13   |   2:26   |   (2) comments


Insurance companies have no excuse not to be using advanced analytics in their fight against fraud.
Beth Schultz
Teaching Users to 'Fish'

5|1|13   |   2:04   |   (2) comments


Speaking at SAS Global Forum Executive Conference, Rajeev Kaul, SVP of pricing at OfficeMax, uses a Chinese proverb to explain one of the reasons he's deploying SAS Visual Analytics.
Noreen Seebacher
Stuck on the Train

4|24|13   |   2:22   |   (11) comments


Cutting the number of cars on my commuter train was an analytics fail, simple as that.
Beth Schultz
Strength in Numbers

4|22|13   |   1:32   |   (1) comment


Hear, hear! to the folks who count themselves among analytics professionals and who will be gathering next week at SAS Global Forum.
Noreen Seebacher
Fool's Gold

4|15|13   |   1:50   |   (3) comments


You don't always find what you want when you data-mine.
Beth Schultz
Ford Revs Up With Big-Data

4|12|13   |   2:44   |   (2) comments


In an All Analytics interview, Mike Cavaretta, technical leader, predictive analytics at Ford Research & Advanced Engineering, shares how big-data is fueling vehicle decisions.
Beth Schultz
Get On With It!

4|11|13   |   2:32   |   (1) comment


Analytics professionals and SAS executives share how organizations can get on with their work so much faster when working in a high-performance and visual analytics environment.
Beth Schultz
Power to the Visualization

4|11|13   |   2:03   |   (1) comment


Analytics professionals who attended SAS's recent Executive Briefing in New York share how they think visual analytics might help their organizations get better value from data.
Beth Schultz
Mental Model Gives Boeing Lift

4|9|13   |   2:01   |   (2) comments


At Boeing, effective decision making comes down to this simple formula: QxA=E, as executive Jerry Allyne explained at the recent INFORMS analytics conference.
Beth Schultz
What Users Want Next

4|8|13   |   2:31   |   (4) comments


Whether working in major league sports, financial services, or healthcare, analytics, and data, professionals are checking out how visual analytics and high-performance technologies can help them optimize their environments, shrink their cycle times, and improve decision making, as attendees at the recent SAS Executive Briefing in New York share with us.
Beth Schultz
The Power to Discover

4|4|13   |   3:36   |   (1) comment


SAS CEO Jim Goodnight speaks with us at a recent SAS Executive Briefing about getting a feel for what's in your big-data and other new realities powered by advanced analytics.
Videos
Visual Analytics, Instant Insight

5|16|13   |   2:06   |   (1) comment


Dynamic data visualizations let analysts and business users interact with the data, changing variables or drilling down into data points, and see results in a flash. Advance your use of data visualization with tools that support features like auto-charting, explanatory pop-ups, and mobile sharing.
Videos
Big Data, Fast Infrastructure

2|14|12   |   3:35   |   (6) comments


No doubt your enterprise is amassing loads of data for fact-based decision-making. Hand in hand with all that data comes big computational requirements. Can traditional IT infrastructure handle the increasing number and complexity of your analytical work? Probably not, which is why you need a backend rethink. Big data calls for a high-performance analytics infrastructure, as Fern Halper, a partner at the IT consulting and research firm, Hurwitz & Associates, discusses here.
Videos
Red Hot Analytics

1|10|12   |   3:51   |   (3) comments


Redbox's bright-red DVD kiosks are all but ubiquitous these days, located in more than 28,000 spots across the country. Jayson Tipp, Redbox VP of Analytics and CRM, provides an insider's look at how the company has accomplished its phenomenal nine-year growth.
Videos
Hotelier Checks In With Analytics

12|14|11   |   06:55   |   (11) comments


InterContinental Hotels Group (IHG), a seven-brand global hotelier, has woven analytics into the fabric of its operations. David Schmitt, director of performance strategy and planning, shares IHG's analytics story and his lessons learned.