Sample Data for the Taking

When learning a new system, you need data! Here are some tips to help locate it.
NO RATINGS
1/9/2013 |  7
View Comments: Newest First | Oldest First | Threaded View
Louis Watson
User Rank
Blogger
Re: Lots of upsides, any downsides?
Louis Watson   1/30/2013 7:19:49 PM
NO RATINGS
Thank you Tricia for these tips on how to get some data for testing.  Your recommended sites are great resources for data manipulators of all kinds.

Pierre DeBois
User Rank
Blogger
Re: Lots of upsides, any downsides?
Pierre DeBois   1/11/2013 10:57:03 PM
NO RATINGS
I second Beth's thanks, Tricia.  It's a great overview of data samples that can give some inspired ideas as to the kind of model inputs possible.

BethSchultz
User Rank
Blogger
Re: Lots of upsides, any downsides?
BethSchultz   1/10/2013 1:17:47 PM
NO RATINGS
Great. Thanks for the additional info!

 

TAanderud
User Rank
Blogger
Re: Lots of upsides, any downsides?
TAanderud   1/10/2013 11:09:58 AM
NO RATINGS
If you want to learn how to use your new BI system or the companies BI system - then you most likely want to get the data to your system.

You can download it in several formats (i.e. CSV, TXT, or XLS) and then import into your system. The XLS file format would be the most likely to contain a virus - so if that is an issue you would probably choose to get the data as CSV and convert to something easy to import into your BI tool. 

Many of the sites also note that the data is virus free and checked so I have not seen that as an issue.

 

 

BethSchultz
User Rank
Blogger
Re: Lots of upsides, any downsides?
BethSchultz   1/10/2013 11:01:18 AM
NO RATINGS
So Tricia does playing around with sample data typically involved downloading it (ie, bringing it in house) or are we talking about playing around with it on the provider's site (in the cloud, in today's parlance)?

TAanderud
User Rank
Blogger
Re: Lots of upsides, any downsides?
TAanderud   1/10/2013 10:56:36 AM
NO RATINGS
Good question Beth.  Many companies have a development or test area where it is playpen area. My larger concern has always been making sure the data is good to use for learning.

I really liked the BIRT data because the data has issues (missing values, complicated joins) that force you to learn how to use the tool to overcome those situations. Real data often has issues so you have to know how to work with the data.

 

BethSchultz
User Rank
Blogger
Lots of upsides, any downsides?
BethSchultz   1/9/2013 4:26:32 PM
NO RATINGS
Hi Tricia, I love the idea of using sample data, especially when it's free. But do BI and analytics professionals need to be aware of any risks, especially if you bring the sample data inhouse? Do we have to be concerned about bugs or other security issues, for example?

Latest Blogs
Join A2 Radio for advice from Robert Half Technology on how to recruit and retain data scientists and other analytics professionals at a time when there are dreary predictions of a talent shortage. Then, A2 Radio goes on the road to SAS Global Forum to explore how analytics pros can help make the world a better place.
Voice-over on hover is a nice addition to a CDC map tracking the incidence of flu.
In launching a big data initiative the work extends far beyond the acquisition of Hadoop or other technologies.
Translating analytical insights into business actions remains difficult for many companies, according to research by MIT Sloan Management Review and SAS.
A key topic at the HIMSS 2015 conference is how data can be used to improve our wellness and reduce costs.
Radio Show
Radio Shows
UPCOMING
James M. Connolly
How to Hire Great Analytics Talent


4/23/2015   REGISTER   0
UPCOMING
James M. Connolly
Live Interviews From SAS Global Forum


4/28/2015   REGISTER   0
ARCHIVE
James M. Connolly
Sports Analytics Mean Fun and Business


3/24/2015  LISTEN   4
ARCHIVE
James M. Connolly
Secure Your Big Data in the Cloud


2/26/2015  LISTEN   114
ARCHIVE
James M. Connolly
Make It Big As a Data Scientist in 2015


2/11/2015  LISTEN   106
ARCHIVE
James M. Connolly
Big Data, Decisions & the Simulated Experience


2/3/2015  LISTEN   87
ARCHIVE
James M. Connolly
A Chat About Big Data, Machine Learning & Value


1/15/2015  LISTEN   125
ARCHIVE
Curtis Franklin Jr.
An Infrastructure for Analytics


12/18/2014  LISTEN   63
ARCHIVE
James M. Connolly
Prepare for the Internet of Things Data Blitz


12/16/2014  LISTEN   50
ARCHIVE
James M. Connolly
How Mature Is Your Analytics Program?


11/18/2014  LISTEN   148
ARCHIVE
James M. Connolly
Drive Big Decisions Using Data & Analytics


11/10/2014  LISTEN   73
ARCHIVE
Beth Schultz
Data Science & the Data-Driven Culture


10/30/2014  LISTEN   134
Information Resources
Quick Poll
Quick Poll
Infographic
Infographic
It Pays to Keep Insurance Fraud in Check
While 97% of insurers say that insurance fraud has increased or remained the same in the past two years, most of those companies report benefits from anti-fraud technology in limiting the impact of fraud, including higher quality referrals, the ability to uncover organized fraud, and improve efficiency for investigators.
Follow us on Twitter
Follow us on Twitter
Like us on Facebook
Like us on Facebook
About Us  |  Contact Us  |  Help  |  Register  |  Twitter  |  Facebook  |  RSS