Ayush
abed408f9b
Updating Dataset + Naming
2020-07-26 11:21:20 +05:30
Ayush
c0c3b2e1bf
Fixing Sorting
2020-07-25 13:22:15 +05:30
Ayush
b16b60fcb5
Adding Example script for CART
2020-07-23 16:45:31 +05:30
Ayush
c083759523
Adding Changes
2020-07-22 14:34:59 +05:30
Ayush
08529c42cf
Added Comments for Regressor
2020-07-18 14:21:50 +05:30
Ayush
16eac7d86d
Adding Regression Trees
2020-07-18 12:26:50 +05:30
Ayush
d1228c5508
Adding Integration For Fixed Data Grid in Predict And Evaluate
2020-07-18 10:47:22 +05:30
Ayush
8848652943
Added Decision Tree Classifier
...
CART implementation of Decision Tree Classifier, based on Gini Impurity or Entropy, as selected by the user.
2020-07-16 13:37:34 +05:30
Richard Townsend
ac9fa85307
Fix for error that happens on Go 1.11 and above
2019-03-23 18:54:04 +00:00
Soypete
019f1015dd
change gonum matrix definitions to match with current gonum version
2019-03-19 22:49:58 -06:00
ss8651twtw
1e1b5f11fb
Format code
2018-06-16 22:14:18 +08:00
yenck
bf907556f5
testcase
2018-06-16 22:11:59 +08:00
yenck
80bc1ac6f8
some test for C0
2018-06-16 22:11:59 +08:00
yenck
30071eb8a4
some test for C9
2018-06-16 22:11:59 +08:00
Ilya Tocar
676f69a426
trees: speed-up training
...
Avoid quadratic loop in getNumericAttributeEntropy.
We don't need to recalculate whole distribution for each split,
just move changed values. Also use array of slices instead of
map of maps of strings to avoid map overhead.
For our case I see time reductions from 100+ hours to 50 minutes.
I've added benchmark with synthetic data (iris.csv repeated 100 times)
and it also shows a nice improvement:
name old time/op new time/op delta
RandomForestFit-8 117s ± 4% 0s ± 1% -99.61% (p=0.001 n=5+10)
0 is a rounding quirk of benchstat, it should be closer to 0.5s:
name time/op
RandomForestFit-8 460ms ± 1%
2018-05-08 14:59:41 -05:00
Richard Townsend
58ae6f4d1b
trees: Try to fix premature write-after-Close issue
2018-01-28 16:35:55 +00:00
Richard Townsend
e2279995c1
Fixing all tests
2018-01-28 16:22:33 +00:00
Richard Townsend
ce78cd0406
Passes the tests
2018-01-27 18:56:01 +00:00
Richard Townsend
f722f2e59d
trees: implement serialization
2018-01-27 18:00:52 +00:00
Richard Townsend
e7fee0a2d1
Reformat, fix tests
2017-09-10 21:10:54 +01:00
Richard Townsend
fc110aab48
Fix bad import, reformat
2017-09-10 20:35:34 +01:00
Richard Townsend
aee475ca14
Fix the trees tests
2017-09-10 20:13:41 +01:00
Richard Townsend
e27215052b
ensemble: tests pass
2017-09-10 19:30:02 +01:00
Richard Townsend
768d2cd19f
meta: tests are almost passing
2017-09-10 16:59:05 +01:00
Richard Townsend
57e6054404
base: fix unmarshalling attributes, add JSON
2017-08-26 14:56:31 +01:00
Richard Townsend
e68361c162
Genericize for ensemble use
2017-08-08 12:37:57 +01:00
Richard Townsend
a90ef09781
Remove excessive logging
2017-08-08 12:29:00 +01:00
Richard Townsend
d23619eac2
OK, but with a lot of extra printing
2017-08-07 17:26:11 +01:00
meirwahnon
674de9cae3
change Probability order
2017-07-17 16:01:49 +03:00
meirwahnon
518c0d84c4
extren fields of ClassProba
2017-07-17 15:35:35 +03:00
meirwahnon
2b478a0513
fix to float precise
2017-07-17 15:01:08 +03:00
meirwahnon
f56fce1a43
support PredictProba
2017-07-17 14:48:38 +03:00
Ryan Schmukler
cf6192c81c
fix(id3): fix panic on SplitAttribute being nil
2016-06-28 14:36:48 -04:00
Richard Townsend
7ba57fe6df
trees: Handling FloatAttributes.
...
This patch adds:
* Gini index and information gain ratio as
DecisionTree split options;
* handling for numeric Attributes (split point
chosen naïvely on the basis of maximum entropy);
* A couple of additional utility functions in base/
* A new dataset (see sources.txt) for testing.
Performance on Iris performs markedly without discretisation.
2014-10-26 17:40:38 +00:00
Amit Kumar Gupta
4d93b9de89
Convert remaining tests to goconvey
2014-08-23 05:22:16 +00:00
Amit Kumar Gupta
1809a8b358
RandomForest returns error when fitting data with fewer features than the RandomForest plans to use
...
- BaseClassifier Predict and Fit methods return errors
- go fmt ./...
Conflicts:
ensemble/randomforest.go
ensemble/randomforest_test.go
trees/tree_test.go
2014-08-22 13:39:29 +00:00
Amit Kumar Gupta
529b3bcaa5
Avoid renaming packages on import
2014-08-22 13:39:29 +00:00
Amit Kumar Gupta
947ee8380e
Return error instead of panicking when unable to get confusion matrix
2014-08-22 13:39:29 +00:00
Amit Kumar Gupta
14aad31821
Consistently use (t *testing.T) instead of T or testEnv
2014-08-22 08:44:41 +00:00
Amit Kumar Gupta
695aec6eb6
Favor idiomatic t.Fatalf over panic for test failures
2014-08-22 08:07:55 +00:00
Amit Kumar Gupta
45545d6ebd
Remove Println's from automated test suite since they aren't assertions
2014-08-22 07:58:01 +00:00
Amit Kumar Gupta
21bb2fc9fa
Remove redundant import renames
2014-08-22 07:21:24 +00:00
Richard Townsend
f9c1e24e5b
neural: stop-gap support for neural networks
2014-08-09 19:27:20 +01:00
Richard Townsend
47341b2869
base: Cleaned up duplicate Attribute resolution functions
2014-08-03 15:17:20 +01:00
Richard Townsend
c2d040af30
trees: merge from v2-instances
2014-08-03 15:17:13 +01:00
albrow
132e3f4527
Create a new default logger and change some print statements to use the logger instead of fmt.Println.
2014-07-20 15:26:13 -04:00
Niclas Jern
627a5537d3
Comments should be of the form "<Struct> ..." or "<MethodName> ..."
2014-07-18 13:48:28 +03:00
Niclas Jern
32f36f28c3
if block ends with a return statement -> drop this else and outdent its block
2014-07-18 13:20:46 +03:00
Remo Hertig
f77c1dcde0
use multiple return values instead of an array in InstancesTrainTestSplit
2014-06-06 21:33:17 +02:00
Richard Townsend
a6072ac9de
Package documentation
2014-05-19 12:59:11 +01:00