Alex Harlan
Blog
Projects
Tagged in
k-fold
RECENT POSTS
Blog
Alex Harlan
—
Wed 25 July 2018
You Might Be Leaking Data Even if You Cross Validate
Tags
SQL
churn rate
classification
decision trees
random forest
boosting
Tableau
crunchbase
scraping
logistic regression
PCA
optimization
gradient descent
cross validation
k-fold
nested cv
leakage
t-test
ANOVA
underfitting
overfitting
variance
bias
confusion matrix
jupyter-notebook
regression
p-hacking
bonferroni correction
multiple testing
regularization
lasso
ridge
Knn
logit
numpy
scipy
pandas
matplot
sqlite
polynomial regression
OLS
precision
recall
binary classification
python
beautifulsoup