Package: ForestDisc 0.1.0

ForestDisc: Forest Discretization

Supervised, multivariate, and non-parametric discretization algorithm based on tree ensembles learning and moment matching optimization. This version of the algorithm relies on random forest algorithm to learn a large set of split points that conserves the relationship between attributes and the target class, and on moment matching optimization to transform this set into a reduced number of cut points matching as well as possible statistical properties of the initial set of split points. For each attribute to be discretized, the set S of its related split points extracted through random forest is mapped to a reduced set C of cut points of size k. This mapping relies on minimizing, for each continuous attribute to be discretized, the distance between the four first moments of S and the four first moments of C subject to some constraints. This non-linear optimization problem is performed using k values ranging from 2 to 'max_splits', and the best solution returned correspond to the value k which optimum solution is the lowest one over the different realizations. ForestDisc is a generalization of RFDisc discretization method initially proposed by Berrado and Runger (2009) <doi:10.1109/AICCSA.2009.5069327>, and improved by Berrado et al. in 2012 by adopting the idea of moment matching optimization related by Hoyland and Wallace (2001) <doi:10.1287/mnsc.47.2.295.9834>.

Authors:Haddouchi Maïssae

ForestDisc_0.1.0.tar.gz
ForestDisc_0.1.0.zip(r-4.7)ForestDisc_0.1.0.zip(r-4.6)ForestDisc_0.1.0.zip(r-4.5)
ForestDisc_0.1.0.tgz(r-4.6-any)ForestDisc_0.1.0.tgz(r-4.5-any)
ForestDisc_0.1.0.tar.gz(r-4.7-any)ForestDisc_0.1.0.tar.gz(r-4.6-any)
ForestDisc_0.1.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
ForestDisc/json (API)

# Install 'ForestDisc' in R:
install.packages('ForestDisc', repos = c('https://hmais.r-universe.dev', 'https://cloud.r-project.org'))

On CRAN:

Conda:

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

1.00 score 1 scripts 139 downloads 4 exports 3 dependencies

Last updated from:ea960566ba. Checks:9 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-x86_64OK108
source / vignettesOK155
linux-release-x86_64OK105
macos-release-arm64OK182
macos-oldrel-arm64OK159
windows-develOK91
windows-releaseOK61
windows-oldrelOK58
wasm-releaseOK98

Exports:Extract_cont_splitsForestDiscRF2SelectedtreesSelect_cont_splits

Dependencies:momentsnloptrrandomForest