Weka Machine Learning Software Weka Data Mining Book Weka Machine Learning Project



Errata (First Printing)

This page lists known errors in the first printing of the book "Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations" by Ian H. Witten and Eibe Frank.

Chapter 4

Page 96
    Table 4.7: "0.029/1.362" should be "gain ratio: 0.029/1.557" and "0.021" should be "0.019"
    (reported by Chiung-Wei Huang, Department of Electronic Engineering, Ching-Yun Institute of Technology, December 2002)
Page 104
    Line 34: "divide-and-conquer" should be "separate-and-conquer"
Page 110
    Lines 5-6: ", because otherwise the resulting item set would contain more than four items. These are" should be ". For example, we do not consider (A C D) and (B C D) because (A B C D) can also be generated from (A B C) and (A B D) and if these two are not candidate three-item sets then (A B C D) cannot be a candidate four-item set. This leaves"
    (reported by J.C. Bioch, Department of Computer Science, Erasmus University Rotterdam, March 2001)

Chapter 5

Pages 123 and 125
    Page 123, Line 25; Page 123, Line 28; Page 125, Line 5: "95%" should be "80%"
    (reported by Ray Littler, Department of Statistics, University of Waikato, January 2000)
Page 123
    Lines 25-26: "73.3%" should be "73.2%" and "76.8%" should be "76.7%"
    (reported by Juan Carlos Borr�s, August 2000)
Page 123
    Lines 28-29: "70%" should be "69.1%" and "81%" should be "80.1%"
    (reported by Juan Carlos Borr�s, August 2000)
Page 124
    Line 10: "z" should be "-z"
Page 125
    Line 6: "[0.733,0.768]" should be "[0.732,0.767]" and "[0.70,0.81]" should be "[0.691,0.801]"
    (reported by Juan Carlos Borr�s, August 2000)
Page 125
    Lines 8-9: "[0.65,1.02]" should be "[0.549,0.881]" and ", particularly the upper one!" should be "!"
    (reported by Juan Carlos Borr�s, August 2000)
Page 125
    Line 10: "Probabilities, of course, can never be greater than 1." should be deleted
    (reported by Juan Carlos Borr�s, August 2000)
Page 127
    Line 36: "repeating it all:" should be "repeating it at all:"
Page 131
    Line 2: the Greek character mu (denoting the true mean) should be replaced by an overlined x (denoting the mean of the sample)
Page 144
    Line 8: "ptA+qtA" should be "ptA+qtB" and "pfA+qfA" should be "pfA+qfB"
Page 146
    Table 5.5: precision should be "TP / (TP + FP)" instead of "TP / (TP + FN)"
    (reported by Stuart Inglis, Intelligenesis Corporation, December 1999)
Page 146
    Line 11: "plotted as two" should be "plotted as"
Page 148
    Table 5.6: the denominator should be replaced by its square root in the calculation of the correlation coefficient
    (reported by Huan Liu, Arizona State University, September 2000)

Chapter 6

Page 165
    Line 34: "the error rate could not possibly" should be replaced by "it would be a bad mistake to let the error rate"
Page 167
    Lines 1-2: ", which frequently exceed the maximum possible value of 0.5, underscore the fact that" should be replaced by " should be taken with a grain of salt because"
Page 174
    Line 23: "0.17%" should be replaced by "0.017%"
    (reported by Xin Xu, University of Waikato, June 2001)
Page 207
    Figure 6.11, Line 39: "subtreEerror" should be "subtreeError"
Page 218
    Line 3: ")" missing before ","
Page 218
    Line 4: "of values that the set of attributes can assume" should be "of attributes"
Page 218
    Line 15: The square root in the denominator should only enclose pi

Chapter 7

Page 237
    Line 1: "up to k*2k" should be "up to 2k"
Page 237
    Line 2: "the effect of adding every attribute to" should be deleted
Page 238
    Line 29: "minimum number of instances" should be "minimum number of instances in the majority class"
Page 244
    Line 22: "given impurity function" should be "given additive impurity function"

Chapter 8

Page 280
    Table 8.1, Line 13: "-I" should be "-i"
    (reported by Leonard Trigg, Intelligenesis Corporation, January 2000)
Page 282
    Table 8.2: last line in first column should be "-S"
Page 286
    Line 37: "jaws.classifiers.j48.J48" should be "weka.classifiers.j48.J48"
    (reported by Leonard Trigg, Intelligenesis Corporation, November 1999)
Page 308
    Line 26: "* }" should be "}"
    (reported by Leonard Trigg, Intelligenesis Corporation, November 1999)
Page 313
    Line 21: "weka.classifiers" should be "weka.core"