So I followed the herd and had a look at the Kaggle site. Interesting indeed.
I liked the little video about the Titanic but missed Celine Dion.
I'm not involved in R programming but the analysis "Exploring Survival on the Titanic" by Megan Risdal is kinda interesting, especially the breakdown by family size that shows that the survival rate of single passengers was horrible compared with families of up to 4. But familes >4 also had a rotten rate.
I guess a word to the wise would be: Don't travel alone on a 1912 ocean liner ...
@magneticnorth: Anonymizing the data (or, at the very least, the data source, the organization, and related salient points) helps, I tend to think.
In any case, I think it depends upon the nature of the data and the business (e.g., something bland like bounce rates on an unidentified website versus trade secrets). Just as in regular conversation, there are some things you can talk about with outsiders (e.g., at a dinner party) when it comes to your work and your business, and some things you can't. Your mileage may vary.
@kq4ym It is amazing, isn't it? I was under the impression that most data scientists' work would be under some non-disclosure agreement, but now there's a community where you could showcase your work. Kaggle is something I wouldn't have thought would emerge but here it is. I guess there's more collaboration to be had than I'd foreseen.
Kaggle can also help users find free datasets from governments and other entities that might be hard to otherwise find. You also create and upload your own datasets and share links to it for potential employers.
HIPAA is supposed to protect the privacy of your medical information, but the sale of anonymized medical data and advances in analytics have made it possible for organizations to re-link your name to your private records.
Healthcare organizations have already provided some of the most interesting and promising applications of advanced analytics use cases. The HIMSS 2017 event next week in Orlando will provide a window into what's next.
Are you burning revenues and sending them up the chimney rather than applying them to your bottom line? In this episode, All Analytics Radio looks at how to apply analytics to identify potential energy savings.
In conjunction with Internet Safety Day, the Wikimedia Foundation has released two new public data sets of online harassment in Wikipedia edits. The Foundation leveraged machine learning to detect harassment.