mirror of
https://github.com/sjwhitworth/golearn.git
synced 2025-04-28 13:48:56 +08:00

This patch adds: * Gini index and information gain ratio as DecisionTree split options; * handling for numeric Attributes (split point chosen naïvely on the basis of maximum entropy); * A couple of additional utility functions in base/ * A new dataset (see sources.txt) for testing. Performance on Iris performs markedly without discretisation.
165 B
165 B
1 | Attribute1 | Attribute2 | Attribute3 | Class |
---|---|---|---|---|
2 | A | 70 | T | A |
3 | A | 90 | T | B |
4 | A | 85 | F | B |
5 | A | 95 | F | B |
6 | A | 70 | F | A |
7 | B | 90 | T | A |
8 | B | 78 | F | A |
9 | B | 65 | T | A |
10 | B | 75 | F | A |
11 | C | 80 | T | B |
12 | C | 70 | T | B |
13 | C | 80 | F | A |
14 | C | 80 | F | A |
15 | C | 96 | F | A |