Summer 2019 workshops: clinical audit stats, and a data science primer for managers
I am doing two BayesCamp workshops in central London this summer: Statistical Analysis for Clinical Audit, 21 June [bookings] Data … Morecontinue reading.
I am doing two BayesCamp workshops in central London this summer: Statistical Analysis for Clinical Audit, 21 June [bookings] Data … Morecontinue reading.
After working on the MOB package, I received requests from multiple users if I can write a binning function that takes the weighting scheme into consideration. It is a legitimate...continue reading.
After wrapping up the function batch_woe() today with the purpose to allow users to apply WoE transformations to many independent variables simultaneously, I have completed the development of major functions...continue reading.
A Latin square Le Monde mathematical puzzle that I found rather dreary: A hidden 3×3 board contains all numbers from 1 to 9. Anselm wants to guess the board and...continue reading.
After Stata 15 came out in the summer of ’17, I wrote a series of blog posts for Timberlake, the … Morecontinue reading.
Although one would think that the basic concepts of statistics should be the same across all sciences, there is an amazing heterogeneity between fields in how statistics is taught and...continue reading.
In my GitHub repository (https://github.com/statcompute/MonotonicBinning), multiple R functions have been developed to implement the monotonic binning by using either iterative discretization or isotonic regression. With these functions, we can run...continue reading.
You’d be surprised at how many data scientists don’t know how to turn their probabilities into class labels. Often times they will just go with 50% as the cutoff without...continue reading.
You’d be surprised at how many data scientists don’t know how to turn their probabilities into class labels. Often times […] The post The Easiest Way to Create Thresholds And...continue reading.
There are plenty of methods out there for machine learning model interpretability but which is the best one? Find out how to do it right with RemixAutoML.continue reading.
There are plenty of methods out there for machine learning model interpretability but which is the best one? Find out how to do it right with RemixAutoML. The post Companies...continue reading.
You’d be surprised at how many data scientists don’t know how to turn their probabilities into class labels. Often times they will just go with 50% as the cutoff without...continue reading.
The commute to my workplace is 90 minutes each way. Podcasts are my friend. I’m a long-time listener of In Our Time and enjoyed the recent episode about The Danelaw....continue reading.
This is about some academic work I did that never got published. But, I think it should be out there … Morecontinue reading.
In my neverending quest to track The Popularity of Data Science Software, it’s time to update the section on Scholarly Articles. The rapid growth of R could not go on...continue reading.
In addition to monotonic binning algorithms introduced in my previous post (https://statcompute.wordpress.com/2019/03/10/a-summary-of-my-home-brew-binning-algorithms-for-scorecard-development), two more functions based on Generalized Boosted Regression Models have been added to my GitHub repository, gbm_bin() and...continue reading.
The view_df() function from the sjPlot-package creates nice „codeplans“ from your data sets, and also supports labelled data and tagged NA-values. This gives you a comprehensive, yet clear overview of your data set....continue reading.
La luna es un pozo chicolas flores no valen nadalo que valen son tus brazoscuando de noche me abrazan(Zorongo Gitano, Carmen Linares) When I publish a post showing my drawings,...continue reading.
In my previous post (https://statcompute.wordpress.com/2019/03/10/a-summary-of-my-home-brew-binning-algorithms-for-scorecard-development), I’ve shown different monotonic binning algorithm that I developed over time. However, these binning functions are all useless without a deployment vehicle in production. During...continue reading.
What’s in a Data Community? One of the UK’s top retailers, Sainsbury’s, knows just how much value can be derived from a community, which, when coming together in a data...continue reading.