Release Notes#

sklearndf 2.3#

2.3.0#

sklearndf 2.3 adds support for scikit-learn 1.3 and drops support for scikit-learn 0.24.

  • API: add DF wrapper classe HDBSCANDF for native estimator HDBSCAN

  • API: add DF wrapper class TargetEncoderDF for native estimator TargetEncoder

sklearndf 2.2#

sklearndf 2.2 adds support for scikit-learn 1.2, and enhances the EstimatorDF API.

2.2.1#

  • VIZ: use scikit-learn’s native HTML representation of estimators, if available

2.2.0#

sklearndf 2.2 adds support for scikit-learn 1.2. It drops support for scikit-learn 0.23 and earlier due to incomplete support of sparse output (see below).

  • API: DF estimators now support native estimators using sparse matrices as input or output, and automatically convert them to or from sparse DataFrame objects

  • API: new property EstimatorDF.output_names_ to get the names of the output columns the estimator was fitted with

  • API: new method LearnerPipelineDF.preprocess to apply the preprocessing step to a data frame

  • API: remove properties feature_names_out_ and feature_names_original_ from class LearnerPipelineDF

  • API: Index instances obtained from EstimatorDF.feature_names_in_ and TransformerDF.feature_names_out_ are now named "feature" instead of "feature_in" and "feature_out", respectively, and Series instances obtained from TransformerDF.feature_names_original_ are now named "feature_original" instead of "feature_in", and their indices are now named "feature" instead of "feature_out"; this is to separate the semantics of the originating property from the column index, which may be used in other contexts

sklearndf 2.1#

sklearndf 2.1 adds support for scikit-learn 1.1.

2.1.1#

This is a maintenance release to catch up with sklearndf 2.0.2.

2.1.0#

sklearndf 2.0#

sklearndf 2.0 adds support for scikit-learn 1.0, adds data frame support for clusterers along with additional API enhancements and improvements, and is now subject to static type checking with mypy.

2.0.2#

  • BUILD: add support for pandas 2.0 and above

  • FIX: property PCADF.n_components_ now returns the value of n_components_, not n_components

  • FIX: detect missing and extra columns when validating data frames resulting from transforms, even when the total column count is correct

2.0.1#

  • API: upon declaration of new wrapper classes, automatically validate that their associated native estimators are compatible with the wrapper class

  • API: new public constants DROP and PASSTHROUGH in ColumnTransformerDF

  • FIX: base LGBMClassifierDF and XGBClassifierDF on the the correct wrapper class ClassifierWrapperDF

  • FIX: support array-like values for parameter drop of OneHotEncoderDF

  • FIX: various minor tweaks and stability improvements

2.0.0#

sklearndf 1.2#

This release adds support for scikit-learn 0.24.

1.2.3#

This is a maintenance release to catch up with sklearndf 1.1.3.

1.2.2#

This release makes small API tweaks, and catches up with sklearndf 1.1.2.

  • API: make type hints more specific in signatures for make_df_transformer(), make_df_classifier(), and make_df_regressor()

1.2.1#

This is a maintenance release to catch up with sklearndf 1.1.1.

1.2.0#

sklearndf 1.1#

1.1.3#

This release relaxes package dependencies to support any numpy version 1.x from 1.16.

1.1.2#

This release improves compatibility with scikit-learn and fixes bugs.

1.1.1#

This release addresses compatibility issues with meta-estimators.

  • FIX: support complex DF estimators inside StackingEstimatorDF

  • FIX: raise an exception if a base estimator is not supported by one of sklearndf’s implementations for DF meta-estimators

1.1.0#

This release exposes the wrapper API used to generate augmented DF estimators from native scikit-learn estimators.

  • API: expose the EstimatorWrapperDF class hierarchy through the new sklearndf.wrapper package

  • API: create new scikit-learn wrapper classes with the new functions make_df_estimator(), make_df_classifier(), make_df_regressor(), and make_df_transformer()

sklearndf 1.0#

1.0.2#

This is a maintenance release focusing on enhancements to the CI/CD pipeline and bug fixes.

  • FIX: correctly mirror __init__ signatures of native estimators to their corresponding DF estimators

  • FIX: do not mirror native estimator class attributes and protected members to DF estimators

  • FIX: support "passthrough" transformer in ColumnTransformerDF

  • FIX: support drop parameter in OneHotEncoderDF

  • BUILD: add support for numpy 1.20

  • BUILD: updates and changes to the CI/CD pipeline

1.0.1#

Initial release.