GridSearchCV Random Forest Regressor optimiert die besten Parameter

GridSearchCV Random Forest Regressor optimiert die besten Parameter ⇐ Python

Post Reply Previous topic Next topic

1 post • Page 1 of 1

Anonymous

GridSearchCV Random Forest Regressor optimiert die besten Parameter

Report
Quote

Post by Anonymous » 30 Nov 2025, 02:02

Ich möchte die Parameter dieses GridSearchCV für einen Random Forest Regressor verbessern.

Code: Select all

def Grid_Search_CV_RFR(X_train, y_train):
from sklearn.model_selection import GridSearchCV
from sklearn.model_selection import ShuffleSplit
from sklearn.ensemble import RandomForestRegressor

estimator = RandomForestRegressor()
param_grid = {
"n_estimators"      : [10,20,30],
"max_features"      : ["auto", "sqrt", "log2"],
"min_samples_split" : [2,4,8],
"bootstrap": [True, False],
}

grid = GridSearchCV(estimator, param_grid, n_jobs=-1, cv=5)

grid.fit(X_train, y_train)

return grid.best_score_ , grid.best_params_

def RFR(X_train, X_test, y_train, y_test, best_params):
from sklearn.ensemble import RandomForestRegressor
estimator = RandomForestRegressor(n_jobs=-1).set_params(**best_params)
estimator.fit(X_train,y_train)
y_predict = estimator.predict(X_test)
print "R2 score:",r2(y_test,y_predict)
return y_test,y_predict

def splitter_v2(tab,y_indicator):
from sklearn.model_selection import train_test_split
# Asignamos X e y, eliminando la columna y en X
X = correlacion(tab,y_indicator)
y = tab[:,y_indicator]
# Separamos Train y Test respectivamente para X e y
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)
return X_train, X_test, y_train, y_test

Ich habe diese Funktion 5 Mal mit diesem Code verwendet:

Code: Select all

for i in range(5):
print "Loop: " , i
print "--------------"
X_train, X_test, y_train, y_test = splitter_v2(tabla,1)
best_score, best_params = Grid_Search_CV_RFR(X_train, y_train)
y_test , y_predict = RFR(X_train, X_test, y_train, y_test, best_params)
print "Best Score:" ,best_score
print "Best params:",best_params

Dies sind die Ergebnisse:

Code: Select all

Loop:  0
--------------
R2 score: 0.900071279487
Best Score: 0.61802821072
Best params: {'max_features': 'log2', 'min_samples_split': 2, 'bootstrap': False, 'n_estimators': 10}
Loop:  1
--------------
R2 score: 0.993462885564
Best Score: 0.671309726329
Best params: {'max_features': 'log2', 'min_samples_split': 4, 'bootstrap': False, 'n_estimators': 10}
Loop:  2
--------------
R2 score: -0.181378339338
Best Score: -30.9012120698
Best params: {'max_features': 'log2', 'min_samples_split': 4, 'bootstrap': True, 'n_estimators': 20}
Loop:  3
--------------
R2 score: 0.750116663033
Best Score: 0.71472985391
Best params: {'max_features': 'log2', 'min_samples_split': 4, 'bootstrap': False, 'n_estimators': 30}
Loop:  4
--------------
R2 score: 0.692075744759
Best Score: 0.715012972471
Best params: {'max_features': 'sqrt', 'min_samples_split': 2, 'bootstrap': True, 'n_estimators': 30}

¿Warum erhalte ich unterschiedliche Ergebnisse im R2-Score?, ¿Das liegt daran, dass ich CV=5 ausgewählt habe?, ¿Das liegt daran, dass ich auf meinem RandomForestRegressor() keinen random_state=0 ermittelt habe?

1764464560

Anonymous

[url=viewtopic.php?t=30561]Ich möchte[/url] die Parameter dieses [b]GridSearchCV[/b] für einen [b]Random Forest Regressor[/b] verbessern.

[code]def Grid_Search_CV_RFR(X_train, y_train):
from sklearn.model_selection import GridSearchCV
from sklearn.model_selection import ShuffleSplit
from sklearn.ensemble import RandomForestRegressor

estimator = RandomForestRegressor()
param_grid = {
"n_estimators"      : [10,20,30],
"max_features"      : ["auto", "sqrt", "log2"],
"min_samples_split" : [2,4,8],
"bootstrap": [True, False],
}

grid = GridSearchCV(estimator, param_grid, n_jobs=-1, cv=5)

grid.fit(X_train, y_train)

return grid.best_score_ , grid.best_params_

def RFR(X_train, X_test, y_train, y_test, best_params):
from sklearn.ensemble import RandomForestRegressor
estimator = RandomForestRegressor(n_jobs=-1).set_params(**best_params)
estimator.fit(X_train,y_train)
y_predict = estimator.predict(X_test)
print "R2 score:",r2(y_test,y_predict)
return y_test,y_predict

def splitter_v2(tab,y_indicator):
from sklearn.model_selection import train_test_split
# Asignamos X e y, eliminando la columna y en X
X = correlacion(tab,y_indicator)
y = tab[:,y_indicator]
# Separamos Train y Test respectivamente para X e y
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)
return X_train, X_test, y_train, y_test
[/code]

Ich habe diese [b]Funktion[/b] 5 Mal mit diesem Code verwendet:

[code]for i in range(5):
print "Loop: " , i
print "--------------"
X_train, X_test, y_train, y_test = splitter_v2(tabla,1)
best_score, best_params = Grid_Search_CV_RFR(X_train, y_train)
y_test , y_predict = RFR(X_train, X_test, y_train, y_test, best_params)
print "Best Score:" ,best_score
print "Best params:",best_params
[/code]

Dies sind die [b]Ergebnisse[/b]:

[code]Loop:  0
--------------
R2 score: 0.900071279487
Best Score: 0.61802821072
Best params: {'max_features': 'log2', 'min_samples_split': 2, 'bootstrap': False, 'n_estimators': 10}
Loop:  1
--------------
R2 score: 0.993462885564
Best Score: 0.671309726329
Best params: {'max_features': 'log2', 'min_samples_split': 4, 'bootstrap': False, 'n_estimators': 10}
Loop:  2
--------------
R2 score: -0.181378339338
Best Score: -30.9012120698
Best params: {'max_features': 'log2', 'min_samples_split': 4, 'bootstrap': True, 'n_estimators': 20}
Loop:  3
--------------
R2 score: 0.750116663033
Best Score: 0.71472985391
Best params: {'max_features': 'log2', 'min_samples_split': 4, 'bootstrap': False, 'n_estimators': 30}
Loop:  4
--------------
R2 score: 0.692075744759
Best Score: 0.715012972471
Best params: {'max_features': 'sqrt', 'min_samples_split': 2, 'bootstrap': True, 'n_estimators': 30}
[/code]

¿Warum erhalte ich [b]unterschiedliche Ergebnisse[/b] im [b]R2-Score[/b]?, ¿Das liegt daran, dass ich [b]CV=5[/b] ausgewählt habe?, ¿Das liegt daran, dass ich auf meinem [b]RandomForestRegressor()[/b] keinen [b]random_state=0[/b] ermittelt habe?

Post Reply Previous topic Next topic

1 post • Page 1 of 1

Quick Reply

Subject:

Username:

Change Text Case:

Smilies

View more smilies

Similar Topics

Replies

Views

Last post

Wann sollte numpy.random.randn(...) und wann numpy.random.rand(...) verwendet werden?

Last post by Guest « 04 Jan 2025, 05:55
Posted in Python

by Guest » 04 Jan 2025, 05:55 » in Python

In meiner Deep-Learning-Übung musste ich einen Parameter D1 mit der gleichen Größe wie A1 initialisieren, also habe ich Folgendes getan:
D1 = np.random.randn(A1.shape ,A1.shape )

Aber nachdem ich...

0 Replies

81 Views

Last post by Guest
04 Jan 2025, 05:55
Keine Standardwerte im Xgboost -Regressor -Modell [geschlossen]

Last post by Anonymous « 11 Jul 2025, 11:39
Posted in Python

by Anonymous » 11 Jul 2025, 11:39 » in Python

Ich stoße auf ein Problem in Bezug auf Xgboost -Regressor. Es erzeugt keine Standardwerte, wie in Abbildung unten gezeigt. Was könnte der Grund dafür sein, dass die Standardwerte für das XSGBOOST...

0 Replies

8 Views

Last post by Anonymous
11 Jul 2025, 11:39
GridSearchCV mit nach Zeit indizierten Daten

Last post by Guest « 05 Jan 2025, 15:49
Posted in Python

by Guest » 05 Jan 2025, 15:49 » in Python

Ich versuche, den GridSearchCV von sklearn.model_selection zu verwenden. Meine Daten sind eine Reihe von Klassifizierungen, die nach Zeit indiziert sind. Daher möchte ich bei der Kreuzvalidierung,...

0 Replies

17 Views

Last post by Guest
05 Jan 2025, 15:49
Verschachtelte Parallelität mit GridSearchCV verursacht unendlich hängen

Last post by Anonymous « 06 Mar 2025, 12:14
Posted in Python

by Anonymous » 06 Mar 2025, 12:14 » in Python

Ich führe eine GridSearchCV -Optimierung in eine parallelisierte Funktion aus. Der Pseudocode sieht so aus
from tqdm.contrib.concurrent import process_map
from sklearn.model_selection import...

0 Replies

12 Views

Last post by Anonymous
06 Mar 2025, 12:14
Probleme beim Entschlüsseln von Video- und Audio-Neuordnung mithilfe von Random State in Python

Last post by Guest « 12 Jan 2025, 04:33
Posted in Python

by Guest » 12 Jan 2025, 04:33 » in Python

Ich arbeite an einem Projekt, bei dem ich die Sekunden von Audio und Videobildern mithilfe eines zufälligen Zustands in Python neu anordne, mit dem Ziel, die Reihenfolge zu „verschlüsseln“. Nach der...

0 Replies

47 Views

Last post by Guest
12 Jan 2025, 04:33

Return to “Python”