Waqas, right, Bill does make an interesting point. In the book he points to Web logs as the perfect example of semi- or multistructured data: "Web logs are pretty ugly when you look at them; however, each piece of information does, in fact, serve a purpose of some sort." The trick is figuring out whether any given piece of a Web log serves your particular purpose -- and that's more complex and will take more time than working with structured data.
I love the point Bill made about the usual complaints made by people that most data is unstructured or semi-structured. A thorough analysis can reveal that the data was structured and just the apparent look made it look like unstructured.
By the its always great to hear about how the industry defines Big Data or how the industry looks at the importance of keeping the records of sources of data from a professional such as Chief Analytics Officer. People like Bill can tell us facts about data management, analytics and big data which university professors cant esp when we talk from the practical perspective.
It may be impossible to predict the perfect bracket, but these academics have managed to predict perfectly the "at large" bids that were included in the March Madness NCAA college basketball tournament this year and with 96% accuracy over the last 6 years.