Release Notes#
FACET 2.0#
FACET 2.0 brings numerous API enhancements and improvements, accelerates model inspection by up to a factor of 50 in many practical applications, introduces a new, more flexible and user-friendly API for hyperparameter tuning – with support for scikit-learn’s native hyperparameter searchers – and improves the styling of all visualizations.
FACET 2.0 requires pytools 2.0 and sklearndf 2.2, and is now
fully type-checked by mypy.
2.0.1#
API: class
LearnerInspectornow supports inspecting individual regressors and classifiers; it is no longer necessary to wrap them into aRegressorPipelineDForClassifierPipelineDFinstance with empty preprocessingFIX: replace a call to method
get_text_heights()ofmatplotlib.axes.Axes, which is deprecated as ofmatplotlib3.6
2.0.0#
facet.data#
API: class
Sampleraises an exception if the name of any used column is not a stringAPI: class
RangePartitionersupports new optional argumentslower_boundandupper_boundin methodfit()and no longer accepts them in the class initializer
facet.explanation#
REFACTOR: moved explainer factories from module
facet.inspectionto new modulefacet.explanation.API: new explainer factories
ExactExplainerFactoryandPermutationExplainerFactory, in addition to theTreeExplainerFactoryandKernelExplainerFactoryintroduced in FACET 1.0
facet.inspection#
API: new
FunctionInspectorclass for inspecting arbitrary functions, using aExactExplainerFactoryby defaultAPI:
LearnerInspectorno longer uses learner crossfits and instead inspects models using a single pass of SHAP calculations, usually leading to performance gains of up to a factor of 50API: return
LearnerInspectormatrix outputs asMatrixinstancesAPI: diagonals of feature synergy, redundancy, and association matrices are now
naninstead of 1.0API: the leaf order of
LinkageTreeobjects generated byfeature_…_linkagemethods ofLearnerInspectoris now the same as the row and column order ofMatrixobjects returned by the correspondingfeature_…_matrixmethods ofLearnerInspector, minimizing the distance between adjacent leaves. The old sorting behaviour of FACET 1.x can be restored using methodsort_by_weight()
facet.selection#
API:
LearnerSelectorreplaces FACET 1.x classLearnerRanker, and provides a new, more flexible and user-friendly API for hyperparameter tuningAPI:
LearnerSelectorintroduces support for any CV searcher implementing scikit-learn’s CV search API, including scikit-learn’s native searchers such asGridSearchCVorRandomizedSearchCVAPI: new classes
ParameterSpaceandMultiEstimatorParameterSpaceoffer a more convenient and robust mechanism for declaring options or distributions for hyperparameter tuningAPI: new class
LearnerSelectorsupports a new, more flexible and user-friendly API for hyperparameter tuning
facet.simulation#
API: simulations no longer depend on learner crossfits and instead are carried out as a single pass on the full dataset, using the standard error of mean predictions to obtain confidence intervals that less conservative yet more realistic
VIZ: minor tweaks to simulation plots and reports generated by
SimulationDrawer
facet.validation#
API: removed class
FullSampleValidator
Other#
VIZ: significant updates to the styling of all visualizations, especially those generated for output of
LearnerInspector, using the all-new versions ofpytoolsmatrix and dendrogram drawersAPI: class
LearnerCrossfitis no longer needed in FACET 2.0 and has been removedAPI: support new
fitted_onlydecorator introduced inpytools2.1.
FACET 1.2#
FACET 1.2 adds support for sklearndf 1.2 and scikit-learn 0.24. It also introduces the ability to run simulations on a subsample of the data used to fit the underlying crossfit. One example where this can be useful is to use only a recent period of a time series as the baseline of a simulation.
1.2.3#
BUILD: pin down matplotlib version to < 3.6 and scipy version to < 1.9 to ensure compatibility with pytools 1.2 and sklearndf 1.2
1.2.2#
catch up with FACET 1.1.2
1.2.1#
FIX: fix a bug in
UnivariateProbabilitySimulatorthat was introduced in FACET 1.2.0catch up with FACET 1.1.1
1.2.0#
BUILD: added support for sklearndf 1.2 and scikit-learn 0.24
API: new optional parameter
subsamplein methodBaseUnivariateSimulator.simulate_feature()can be used to specify a subsample to be used in the simulation (but simulating using a crossfit based on the full sample)
FACET 1.1#
FACET 1.1 refines and enhances the association/synergy/redundancy calculations
provided by the LearnerInspector.
1.1.2#
DOC: use a downloadable dataset in the getting started notebook
FIX: import catboost if present, else create a local module mockup
FIX: correctly identify if
sample_weightsis undefined when re-fitting a model on the full dataset in aLearnerCrossfitBUILD: relax package dependencies to support any numpy version 1.`x` from 1.16
1.1.1#
DOC: add reference to FACET research paper on the project landing page
FIX: correctly count positive class frequency in UnivariateProbabilitySimulator
1.1.0#
API: SHAP interaction vectors can (in part) also be influenced by redundancy among features. This can inflate quantifications of synergy, especially in cases where two variables are highly redundant. FACET now corrects interaction vectors for redundancy prior to calculating synergy. Technically we ensure that each interaction vector is orthogonal w.r.t the main effect vectors of both associated features.
API: FACET now calculates synergy, redundancy, and association separately for each model in a crossfit, then returns the mean of all resulting matrices. This leads to a slight increase in accuracy, and also allows us to calculate the standard deviation across matrices as an indication of confidence for each calculated value.
API: Method
LearnerInspector.shap_plot_data()now returns SHAP values for the positive class of binary classifiers.API: Increase efficiency of
ModelSelectorparallelization by adopting the newpytools.parallelization.JobRunnerAPI provided bypytoolsBUILD: add support for
shap0.38 and 0.39
FACET 1.0#
1.0.3#
FIX: restrict package requirements to gamma-pytools 1.0 and sklearndf 1.0, since FACET 1.0 is not compatible with gamma-pytools 1.1
1.0.2#
This is a maintenance release focusing on enhancements to the CI/CD pipeline and bug fixes.
API: add support for shap 0.36 and 0.37 via a new
BaseExplainerstub classFIX: apply color scheme to the histogram section in
SimulationMatplotStyleBUILD: add support for
numpy1.20BUILD: updates and changes to the CI/CD pipeline
1.0.1#
Initial release.