All Analytics Academy
The Internet of Things Joins the Enterprise

Jun 9 - Jun 23
Join 5 interactive classes & chat with peers
 

Sample Data for the Taking

When learning a new system, you need data! Here are some tips to help locate it.
NO RATINGS
1/9/2013 |  7
View Comments: Newest First | Oldest First | Threaded View
Louis Watson
User Rank
Blogger
Re: Lots of upsides, any downsides?
Louis Watson   1/30/2013 7:19:49 PM
NO RATINGS
Thank you Tricia for these tips on how to get some data for testing.  Your recommended sites are great resources for data manipulators of all kinds.

Pierre DeBois
User Rank
Blogger
Re: Lots of upsides, any downsides?
Pierre DeBois   1/11/2013 10:57:03 PM
NO RATINGS
I second Beth's thanks, Tricia.  It's a great overview of data samples that can give some inspired ideas as to the kind of model inputs possible.

BethSchultz
User Rank
Blogger
Re: Lots of upsides, any downsides?
BethSchultz   1/10/2013 1:17:47 PM
NO RATINGS
Great. Thanks for the additional info!

 

TAanderud
User Rank
Blogger
Re: Lots of upsides, any downsides?
TAanderud   1/10/2013 11:09:58 AM
NO RATINGS
If you want to learn how to use your new BI system or the companies BI system - then you most likely want to get the data to your system.

You can download it in several formats (i.e. CSV, TXT, or XLS) and then import into your system. The XLS file format would be the most likely to contain a virus - so if that is an issue you would probably choose to get the data as CSV and convert to something easy to import into your BI tool. 

Many of the sites also note that the data is virus free and checked so I have not seen that as an issue.

 

 

BethSchultz
User Rank
Blogger
Re: Lots of upsides, any downsides?
BethSchultz   1/10/2013 11:01:18 AM
NO RATINGS
So Tricia does playing around with sample data typically involved downloading it (ie, bringing it in house) or are we talking about playing around with it on the provider's site (in the cloud, in today's parlance)?

TAanderud
User Rank
Blogger
Re: Lots of upsides, any downsides?
TAanderud   1/10/2013 10:56:36 AM
NO RATINGS
Good question Beth.  Many companies have a development or test area where it is playpen area. My larger concern has always been making sure the data is good to use for learning.

I really liked the BIRT data because the data has issues (missing values, complicated joins) that force you to learn how to use the tool to overcome those situations. Real data often has issues so you have to know how to work with the data.

 

BethSchultz
User Rank
Blogger
Lots of upsides, any downsides?
BethSchultz   1/9/2013 4:26:32 PM
NO RATINGS
Hi Tricia, I love the idea of using sample data, especially when it's free. But do BI and analytics professionals need to be aware of any risks, especially if you bring the sample data inhouse? Do we have to be concerned about bugs or other security issues, for example?

Latest Blogs
Effective use of models help us execute on our thoughts, and there are tools to enable each part of the process, except for what our imagination brings to the game.
Maryanne Schretzman, executive director of the New York City Center for Innovation through Data Intelligence (CIDI), discusses why analytics professionals would want to work in an organization that is devoted to helping people live better lives.
Robert explores new ways to highlight the dental issues of the US elderly population.
Judging enterprise interest in analytics calls for a new metric that looks beyond what companies spend on hardware, software, and services.
Predictive analytics have been proving their worth in the retail sector, with examples showing showing the sector how predictive analytics can blend with industry experience in decision making.
VIDEO BLOGS
VIDEO BLOGS
Quick Poll
Quick Poll
Radio Show
Radio Shows
UPCOMING
James M. Connolly
Survive the Digital Transformation


8/18/2015   REGISTER   0
ARCHIVE
James M. Connolly
Health Analytics: Find Data Beyond the Hospital Doors


7/28/2015  LISTEN   47
ARCHIVE
James M. Connolly
Finding Answers Through Prescriptive Analytics


7/21/2015  LISTEN   117
ARCHIVE
James M. Connolly
Visualization: How to Bring Data to Life


6/22/2015  LISTEN   55
ARCHIVE
James M. Connolly
Learn Why Analytics Are at Home in the Cloud


6/15/2015  LISTEN   26
ARCHIVE
James M. Connolly
Analytics: Your Defense Against Cyber Threats


5/27/2015  LISTEN   60
ARCHIVE
James M. Connolly
Big Data & Big Pharma: How Analytics Might Save Your Life


5/19/2015  LISTEN   37
ARCHIVE
James M. Connolly
Live Interviews From SAS Global Forum


4/28/2015  LISTEN   11
ARCHIVE
James M. Connolly
How to Hire Great Analytics Talent


4/23/2015  LISTEN   51
ARCHIVE
James M. Connolly
Sports Analytics Mean Fun and Business


3/24/2015  LISTEN   3
ARCHIVE
James M. Connolly
Secure Your Big Data in the Cloud


2/26/2015  LISTEN   114
Information Resources
Infographic
Infographic
It Pays to Keep Insurance Fraud in Check
While 97% of insurers say that insurance fraud has increased or remained the same in the past two years, most of those companies report benefits from anti-fraud technology in limiting the impact of fraud, including higher quality referrals, the ability to uncover organized fraud, and improve efficiency for investigators.
Follow us on Twitter
Follow us on Twitter
Like us on Facebook
Like us on Facebook
About Us  |  Contact Us  |  Help  |  Register  |  Twitter  |  Facebook  |  RSS