Sample Data for the Taking

When learning a new system, you need data! Here are some tips to help locate it.
NO RATINGS
1/9/2013 |  7
View Comments: Newest First | Oldest First | Threaded View
Louis Watson
User Rank
Blogger
Re: Lots of upsides, any downsides?
Louis Watson   1/30/2013 7:19:49 PM
NO RATINGS
Thank you Tricia for these tips on how to get some data for testing.  Your recommended sites are great resources for data manipulators of all kinds.

Pierre DeBois
User Rank
Blogger
Re: Lots of upsides, any downsides?
Pierre DeBois   1/11/2013 10:57:03 PM
NO RATINGS
I second Beth's thanks, Tricia.  It's a great overview of data samples that can give some inspired ideas as to the kind of model inputs possible.

BethSchultz
User Rank
Blogger
Re: Lots of upsides, any downsides?
BethSchultz   1/10/2013 1:17:47 PM
NO RATINGS
Great. Thanks for the additional info!

 

TAanderud
User Rank
Blogger
Re: Lots of upsides, any downsides?
TAanderud   1/10/2013 11:09:58 AM
NO RATINGS
If you want to learn how to use your new BI system or the companies BI system - then you most likely want to get the data to your system.

You can download it in several formats (i.e. CSV, TXT, or XLS) and then import into your system. The XLS file format would be the most likely to contain a virus - so if that is an issue you would probably choose to get the data as CSV and convert to something easy to import into your BI tool. 

Many of the sites also note that the data is virus free and checked so I have not seen that as an issue.

 

 

BethSchultz
User Rank
Blogger
Re: Lots of upsides, any downsides?
BethSchultz   1/10/2013 11:01:18 AM
NO RATINGS
So Tricia does playing around with sample data typically involved downloading it (ie, bringing it in house) or are we talking about playing around with it on the provider's site (in the cloud, in today's parlance)?

TAanderud
User Rank
Blogger
Re: Lots of upsides, any downsides?
TAanderud   1/10/2013 10:56:36 AM
NO RATINGS
Good question Beth.  Many companies have a development or test area where it is playpen area. My larger concern has always been making sure the data is good to use for learning.

I really liked the BIRT data because the data has issues (missing values, complicated joins) that force you to learn how to use the tool to overcome those situations. Real data often has issues so you have to know how to work with the data.

 

BethSchultz
User Rank
Blogger
Lots of upsides, any downsides?
BethSchultz   1/9/2013 4:26:32 PM
NO RATINGS
Hi Tricia, I love the idea of using sample data, especially when it's free. But do BI and analytics professionals need to be aware of any risks, especially if you bring the sample data inhouse? Do we have to be concerned about bugs or other security issues, for example?

Latest Blogs
To understand data virtualization, start by imagining the chatter across the multiple channels at NASA's mission control center.
If consumers in China are moving toward machine-based purchasing, maybe that explains who is paying $150 for bottles of fresh British air.
Quantum physics and quantum computing could turn many of the core principles of analytics on their heads. Big becomes small. True becomes maybe. Predictability becomes unpredictability.
Take a step back and think about the dramatic changes in healthcare that data and analytics have driven or enabled. It isn't your grandmother's doctor with black bag on a housecall.
Analytics on the evolving smart grid take center stage next week at the Distributech 2016 conference in Orlando.
Information Resources
Radio Show
A2 Conversations
UPCOMING
James M. Connolly
See How Data is Revolutionizing Healthcare


2/25/2016   REGISTER   0
ARCHIVE
James M. Connolly
The Analytics Job and Salary Outlook for 2016


1/28/2016  LISTEN   16
ARCHIVE
James M. Connolly
See How Analytics Drive Change in the Retail World


1/7/2016  LISTEN   103
ARCHIVE
James M. Connolly
All Analytics Conversations: Forecasts for Analytics in 2016


12/18/2015  LISTEN   3
ARCHIVE
James M. Connolly
Don't Make This Mistake With Big Data


12/11/2015  LISTEN   4
ARCHIVE
James M. Connolly
Understand the Difference Between Data Science and Analytics


11/23/2015  LISTEN   30
ARCHIVE
James M. Connolly
Shape the Next Generation of Data Scientists


10/20/2015  LISTEN   69
ARCHIVE
James M. Connolly
Analytics Best Practices: Plan for the Human Factor


10/14/2015  LISTEN   117
ARCHIVE
James M. Connolly
See How Your Analytics Initiative Measures Up


9/24/2015  LISTEN   50
ARCHIVE
James M. Connolly
Use Mobile Analytics to See the Big Picture


9/1/2015  LISTEN   83
ARCHIVE
James M. Connolly
Hire and Manage a Great Analytics Team


9/1/2015  LISTEN   152
Quick Poll
Quick Poll
About Us  |  Contact Us  |  Help  |  Register  |  Twitter  |  Facebook  |  RSS