Learning scikit-learn: Machine Learning in Python by 2013

Learning scikit-learn: Machine Learning in Python by 2013

Author:2013
Language: eng
Format: mobi, epub
Publisher: Packt Publishing


Our tree has an accuracy of 0.838 on the training set. But remember that this is not a good indicator. This is especially true for decision trees as this method is highly susceptible to overfitting. Since we did not separate an evaluation set, we should apply cross-validation. For this example, we will use an extreme case of cross-validation, named leave-one-out cross-validation. For each instance in the training sample, we train on the rest of the sample, and evaluate the model built on the only instance left out. After performing as many classifications as training instances, we calculate the accuracy simply as the proportion of times our method correctly predicted the class of the left-out instance, and found it is a little lower (as we expected) than the resubstitution accuracy on the training set.

>>> from sklearn.cross_validation import cross_val_score, LeaveOneOut >>> from scipy.stats import sem >>> >>> def loo_cv(X_train, y_train,clf): >>> # Perform Leave-One-Out cross validation >>> # We are preforming 1313 classifications! >>> loo = LeaveOneOut(X_train[:].shape[0]) >>> scores = np.zeros(X_train[:].shape[0]) >>> for train_index, test_index in loo: >>> X_train_cv, X_test_cv = X_train[train_index], X_train[test_index] >>> y_train_cv, y_test_cv = y_train[train_index], y_train[test_index] >>> clf = clf.fit(X_train_cv,y_train_cv) >>> y_pred = clf.predict(X_test_cv) >>> scores[test_index] = metrics.accuracy_score( y_test_cv.astype(int), y_pred.astype(int)) >>> print ("Mean score: {0:.3f} (+/-{1:.3f})").format(np.mean(scores), sem(scores)) >>> loo_cv(X_train, y_train,clf) Mean score: 0.837 (+/-0.012)



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Popular ebooks
Whisky: Malt Whiskies of Scotland (Collins Little Books) by dominic roskrow(56701)
What's Done in Darkness by Kayla Perrin(26727)
The Ultimate Python Exercise Book: 700 Practical Exercises for Beginners with Quiz Questions by Copy(19710)
D:\Jan\FTP\HOL\Work\Alien Breed - Tower Assault CD32 Alien Breed II - The Horror Continues Manual 1.jpg by PDFCreator(19545)
De Souza H. Master the Age of Artificial Intelligences. The Basic Guide...2024 by Unknown(19543)
The Fifty Shades Trilogy & Grey by E L James(19198)
Shot Through the Heart: DI Grace Fisher 2 by Isabelle Grey(19182)
Shot Through the Heart by Mercy Celeste(19043)
Wolf & Parchment: New Theory Spice & Wolf, Vol. 10 by Isuna Hasekura and Jyuu Ayakura(17224)
Python GUI Applications using PyQt5 : The hands-on guide to build apps with Python by Verdugo Leire(17151)
Peren F. Statistics for Business and Economics...Essential Formulas 3ed 2025 by Unknown(17004)
Wolf & Parchment: New Theory Spice & Wolf, Vol. 03 by Isuna Hasekura and Jyuu Ayakura & Jyuu Ayakura(16926)
Wolf & Parchment: New Theory Spice & Wolf, Vol. 01 by Isuna Hasekura and Jyuu Ayakura & Jyuu Ayakura(16556)
The Subtle Art of Not Giving a F*ck by Mark Manson(14505)
The 3rd Cycle of the Betrayed Series Collection: Extremely Controversial Historical Thrillers (Betrayed Series Boxed set) by McCray Carolyn(14252)
Stepbrother Stories 2 - 21 Taboo Story Collection (Brother Sister Stepbrother Stepsister Taboo Pseudo Incest Family Virgin Creampie Pregnant Forced Pregnancy Breeding) by Roxi Harding(13889)
Scorched Earth by Nick Kyme(12882)
Drei Generationen auf dem Jakobsweg by Stein Pia(11077)
Suna by Ziefle Pia(10999)
Scythe by Neal Shusterman(10469)