Seeing the wood for the trees
A technique for interactively visualising “bigger data”. The post Seeing the wood for the trees appeared first on thinkr.continue reading.
A technique for interactively visualising “bigger data”. The post Seeing the wood for the trees appeared first on thinkr.continue reading.
Well, 2018 has flown by and today seems like an appropriate time to take a look at the landscape of R bloggerdom as seen through the eyes of readers of...continue reading.
Giddy up, giddy it upWanna move into a fool’s gold roomWith my pulse on the animal jewelsOf the rules that you choose to use to get looseWith the luminous movesBored...continue reading.
The truth is out there R readers, but often it is not what we have been led to believe. The previous post examined the strong positive results bias in optimism...continue reading.
In the previous parts of the series we demonstrated a positive results bias in optimism corrected bootstrapping by simply adding random features to our labels. This problem is due to...continue reading.
Welcome to part III of debunking the optimism corrected bootstrap in high dimensions (quite high number of features) in the Christmas holidays. Previously we saw with a reproducible code implementation...continue reading.
A cheezy Le Monde mathematical puzzle : (which took me much longer to find [in the sense of locating] than to solve, as Warwick U does not get a daily...continue reading.
Some people are very fond of the technique known as ‘optimism corrected bootstrapping’, however, this method is clearly bias and this becomes apparent as we increase the number of features...continue reading.
Creating and plotting decision trees (like one below) for the models created in H2O will be main objective of this post:Figure 1. Decision Tree Visualization in RDecision Trees with H2OWith...continue reading.
In the scorecard development, the area under ROC curve, also known as AUC, has been widely used to measure the performance of a risk scorecard. Given everything else equal, the...continue reading.
There are lots of ways to assess how predictive a model is while correcting for overfitting. In Caret the main methods I use are leave one out cross validation, for...continue reading.
With the holidays approaching, one of the most discussed questions at STATWORX was whether we’ll have a white Christmas or not. And what better way to get our hopes up,...continue reading.
In the last 23 days I presented one function each day from the helfRlein package we created here at STATWORX. I hope you found some of the functions useful and...continue reading.
Searching for Speed In my previous post, I described the method I use for compiling Fortran (or C) into an R package using the .Call interface. This post will compare...continue reading.
Phishing is [still] the primary way attackers either commit a primary criminal act (i.e. phish a target to, say, install ransomware) or is the initial vehicle used to gain a...continue reading.
This little helper adds functionality to the base R function strsplit – hence the same name! Der Beitrag Day 23 – little helper strsplit erschien zuerst auf STATWORX.continue reading.
This little helper does the same thing as the “Find in files” search within RStudio. Der Beitrag Day 22 – little helper get_files erschien zuerst auf STATWORX.continue reading.
Week 16 Gold Mining and Fantasy Football Projection Roundup now available. Go get that free agent gold! The post Gold-Mining Week 16 (2018) appeared first on Fantasy Football Analytics.continue reading.
This little helper returns indices of recurring patterns. It works with numbers as well as with characters. Der Beitrag Day 21 – little helper get_sequence erschien zuerst auf STATWORX.continue reading.
R 3.5.2, the latest version of the R language for statistical computation and graphics from the R Foundation, was released today. (This release is codenamed “Eggshell Igloo”, likely in reference...continue reading.