I have to agree with one of the previous reviewers when he said that given the absence of practically *ANY* documentation provided by Microsoft, this book is your only real source of information about Microsoft's data mining product.I'm a big fan of OLAP amd data mining which made me better appreciate the time the author took to lay the groundwork for the discipline of data mining. Unlike a previous reviewer, I think that the author shares lots of real-world evperience which you can see by the way he bring up problems (which I have encountered myself) that occur when moving from raw data to a data mining model. He also catches some glitches and unreported features in the product for you and shows you how to work around them.
The book is actually very complete considering that the data mining product put out by Microsoft is promising, but extremely rudimentary. It provides only two basic data mining algorithms and gives a very clumsy way to try to add other algorithms. Thankfully, the author discusses techniques and pitfalls of mining numerical data and even shows you how to use SQL Server 2000 to perform a regression analysis for that purpose.
I would have given this book five stars except for two points :
1: The mushroom database is a good illustration of the use of the decision tree algorithm, but I think it may have been good to include a more business-oriented example that would bring data mining closer to it's intended purpose.
2: I was a little disappointed not to see any explanation as to how to add your own algorithms to the data mining product. Even if doing so requires C++ experience, it would have been perfectly fine to include it in a separate chapter or in an appendix. I don't know why the author chose not to include it.
Byond that, I would definitely recommend this book if you need to use MS data mining. The book is well written, and considering the infancy of the product, it's also very complete. Besides, you have no other real resource out there!
A lot of the information found in this book should have already been in the online documentation. The lack of documentation for data mining under sql server 2000 makes this book the only usefull reference out there. But overall, the book is poorly organised, badly written and requires a lot more in-dept information in order to put data mining into practical use.You will find some information on DTS, but there are much better books out there on the topic. You will find some sample code for using DSO, but this topic is only touched upon and the code is NOT explained very well. The most important chapters were very thin (programming data mining and data mining queries). After reading the book, you will have an introduction to data mining, but you won't be able to use it effectively.
The examples in the book have no commercial value and are completely worthless. There is no CDROM that contains the data the author is using, and the sample data on the web is different to the data in the book. You will also have to start with chapter 8 (DTS) to load the sample data before you can follow the examples in the book.
I was really looking forward to get a copy of this book, but now that I have a copy, I am very dissapointed. The contents of this book shows that the author has no real world experience on the topic or is not willing to share it.