Sample Data for the Taking

When learning a new system, you need data! Here are some tips to help locate it.
NO RATINGS
1/9/2013 |  7
View Comments: Newest First | Oldest First | Threaded View
louisw900
User Rank
Blogger
Re: Lots of upsides, any downsides?
louisw900   1/30/2013 7:19:49 PM
NO RATINGS
Thank you Tricia for these tips on how to get some data for testing.  Your recommended sites are great resources for data manipulators of all kinds.

Zimana
User Rank
Blogger
Re: Lots of upsides, any downsides?
Zimana   1/11/2013 10:57:03 PM
NO RATINGS
I second Beth's thanks, Tricia.  It's a great overview of data samples that can give some inspired ideas as to the kind of model inputs possible.

BethSchultz
User Rank
Blogger
Re: Lots of upsides, any downsides?
BethSchultz   1/10/2013 1:17:47 PM
NO RATINGS
Great. Thanks for the additional info!

 

TAanderud
User Rank
Blogger
Re: Lots of upsides, any downsides?
TAanderud   1/10/2013 11:09:58 AM
NO RATINGS
If you want to learn how to use your new BI system or the companies BI system - then you most likely want to get the data to your system.

You can download it in several formats (i.e. CSV, TXT, or XLS) and then import into your system. The XLS file format would be the most likely to contain a virus - so if that is an issue you would probably choose to get the data as CSV and convert to something easy to import into your BI tool. 

Many of the sites also note that the data is virus free and checked so I have not seen that as an issue.

 

 

BethSchultz
User Rank
Blogger
Re: Lots of upsides, any downsides?
BethSchultz   1/10/2013 11:01:18 AM
NO RATINGS
So Tricia does playing around with sample data typically involved downloading it (ie, bringing it in house) or are we talking about playing around with it on the provider's site (in the cloud, in today's parlance)?

TAanderud
User Rank
Blogger
Re: Lots of upsides, any downsides?
TAanderud   1/10/2013 10:56:36 AM
NO RATINGS
Good question Beth.  Many companies have a development or test area where it is playpen area. My larger concern has always been making sure the data is good to use for learning.

I really liked the BIRT data because the data has issues (missing values, complicated joins) that force you to learn how to use the tool to overcome those situations. Real data often has issues so you have to know how to work with the data.

 

BethSchultz
User Rank
Blogger
Lots of upsides, any downsides?
BethSchultz   1/9/2013 4:26:32 PM
NO RATINGS
Hi Tricia, I love the idea of using sample data, especially when it's free. But do BI and analytics professionals need to be aware of any risks, especially if you bring the sample data inhouse? Do we have to be concerned about bugs or other security issues, for example?

Latest Blogs
Visualizations help communicate the meaning behind analytics to a variety of users. Now virtual reality is taking that a step further.
You've heard all about the data science talent gap that McKinsey cited in 2011, but there's a lot more -- including new information -- that you need to know about McKinsey's ongoing research. Learn more Thursday on All Analytics Radio.
What hybrid automobile offers the highest MPG? It's not the Prius anymore. Take a look at these visualizations to find out the new leader.
Understanding retail customers means knowing what they will want and when they will want it. To deliver that, retailers must be able to see customer behavior across physical stores, the web, mobile apps, and more.
Chatbots, AI, virtual reality, machine learning, and more will be featured as leading edge technologies for retailers attending the NRF Annual Convention and Expo in New York City. But many retailers are still getting their arms around advanced analytics.
Radio Show
A2 Conversations
ARCHIVE
Jessica Davis
Analytics: Make the Most of Data's Potential in 2017


1/19/2017  LISTEN   19
ARCHIVE
Jessica Davis
A2 Radio: Can You Trust Your Data?


12/20/2016  LISTEN   70
ARCHIVE
James M. Connolly
Retail Analytics: See Where Style Meets Statistics


12/6/2016  LISTEN   53
ARCHIVE
James M. Connolly
Why the IoT Matters to Your Business


11/29/2016  LISTEN   45
ARCHIVE
James M. Connolly
Will Data and Humans Become Friends in 2017?


11/22/2016  LISTEN   40
ARCHIVE
James M. Connolly
We Can Build Smarter Cities


10/20/2016  LISTEN   31
ARCHIVE
James M. Connolly
Visualization: Let Your Data Speak


10/13/2016  LISTEN   70
ARCHIVE
James M. Connolly
How Colleges and Tech Are Grooming Analytics Talent


9/7/2016  LISTEN   56
ARCHIVE
James M. Connolly
How Machine Learning Takes Handwriting Recognition to New Levels


8/25/2016  LISTEN   40
ARCHIVE
AllAnalytics
A Look at Tomorrow's Data Scientist


8/9/2016  LISTEN   83
Information Resources
Quick Poll
Quick Poll
About Us  |  Contact Us  |  Help  |  Register  |  Twitter  |  Facebook  |  RSS