Category "cross-validation"

how to use the train_x and train_y from sklearn k-fold split generator

I am using the sklearn k-fold generator to split some data 10 times. When I run the code below I expect train_x,train_y,test_x,test_y to contain all 10 splits h

How can I get the history of the KerasRegressor?

I want to get KerasRegressor history but all the time I get (...) object has no attribute 'History' ''' # Regression Example With Boston Dataset: Standardized a

Right way to use RFECV and Permutation Importance - Sklearn

There is a proposal to implement this in Sklearn #15075, but in the meantime, eli5 is suggested as a solution. However, I'm not sure if I'm using it the right w

SMOTE within a recipe versus SMOTE in trainControl

I am trying to understand where exactly SMOTE-ing should occur when training a model with cross-validation. I understand that all pre-processing steps should oc

Catboost overfits training data but test performance increases

I'm training catboost on a dataset made of 41k observations and ~60 features. The dataset is a longitudinal series (9 years) that is spatially distributed. At t

Difference between GroupSplitShuffle and GroupKFolds

As the title says, I want to know the difference between sklearn's GroupKFold and GroupShuffleSplit. Both make train-test splits given for data that has a group

data partitionning function CreateDataPartition cross validation problem

I am trying to get predictions of a multiple variables model, its eplt, its made of 7 scores and one final exam score moy_exam2, I want to predict the later usi

Does sklearn LogisticRegressionCV use all data for final model

I was wondering how the final model (i.e. decision boundary) of LogisticRegressionCV in sklearn was calculated. So say I have some Xdata and ylabels such that

RandomForestClassifier instance not fitted yet. Call 'fit' with appropriate arguments before using this method

I am trying to train a decision tree model, save it, and then reload it when I need it later. However, I keep getting the following error: This DecisionTre

Does the caret package for R properly implement repeated CV when passed a multifold object to trainControl's index option?

I'm hoping the answer to this question is a quick "yes" or "no" but I cannot find it explicitly in the caret documentation or elsewhere online. I want to perfor

Category "cross-validation"

how to use the train_x and train_y from sklearn k-fold split generator

How can I get the history of the KerasRegressor?

Right way to use RFECV and Permutation Importance - Sklearn

SMOTE within a recipe versus SMOTE in trainControl

Catboost overfits training data but test performance increases

Difference between GroupSplitShuffle and GroupKFolds

data partitionning function CreateDataPartition cross validation problem

Does sklearn LogisticRegressionCV use all data for final model

RandomForestClassifier instance not fitted yet. Call 'fit' with appropriate arguments before using this method

Does the caret package for R properly implement repeated CV when passed a multifold object to trainControl's index option?

Optuna catboost pruning

How i can extracte x_train and y_train from train_generator?

pandas create Cross-Validation based on specific columns

GridSearchCV on LogisticRegression in scikit-learn

Category "cross-validation"

Other Categories