Prepping data for #rstats #tidyverse and a priori planning
Many if not most data clean up, tidying, wrangling, and joining can be done directly in R. There are many advantages to this approach – i.e. read in data in...continue reading.
Many if not most data clean up, tidying, wrangling, and joining can be done directly in R. There are many advantages to this approach – i.e. read in data in...continue reading.
Many if not most data clean up, tidying, wrangling, and joining can be done directly in R. There are many advantages to this approach – i.e. read in data in...continue reading.
By Gabriel Vasconcelos Introduction Today we are going to talk about quantile regression. When we use the lm command in R we are fitting a linear regression using Ordinary Least...continue reading.
Many news reports scare us with machines taking over our jobs in the not too distant future. Common examples of take-over targets include professions like truck drivers, lawyers and accountants....continue reading.
The process used to generate the pdf of my evidence-based software engineering book has been on my list of things to blog about, for ever. An email arrived this afternoon,...continue reading.
Necesito para estar sentado, un arbolito en este descampado (Desarraigo, Extremoduro) From time to time I come back to experiment with this stunning photograph of Boris Karloff as Frankenstein’s monster....continue reading.
Suppose you have a dataset with many variables, and you want to check: if there are any duplicated for each of the observation replace duplicates with random value from pool...continue reading.
You’ll be pleased to know that Jumping rivers are running R training courses up and down the UK, in London, Newcastle, Belfast and Edinburgh. I’ve put together a quick summary...continue reading.
Hadley Wickham from RStudio has won the 2019 COPSS Award, which expresses a rather radical switch from the traditional recipient of this award in that this recognises his many contributions...continue reading.
The 4.3.0 release of simmer, the Discrete-Event Simulator for R, is on CRAN. Along with this update, we are very glad to announce that our homonymous paper finally appeared in the Journal...continue reading.
Hubert Baniecki created an awesome package dime for serverless HTML interactive model exploration. The experimental version is at Github, here is the pkgdown website. It is a part of the...continue reading.
Azure SQL Database has a new “serverless” mode in preview that eliminates compute costs when not in use. In this post, I’ll show how you can set up a serverless...continue reading.
XAI (eXplainable artificial intelligence) is a fast growing and super interesting area. Working with complex models generates lots of problems with model validation (on test data performance is great but...continue reading.
Microsoft Machine Learning Server, the enhanced deployment platform for R and Python applications, has been updated to version 9.4. This update includes the open source R 3.5.2 and Python 3.7.1...continue reading.
This is a reblog from the “Announcing Dash for R” announcement originally published July 10. Dash, the fastest growing framework for building analytic web applications on top of Python models, is...continue reading.
Grades are not Normally distributed. That’s not what’s seen naturally in grades and the idea is not supported by statistics. You can force grades to look Normally distributed, but doing...continue reading.
Conferences like userR & EARL are the R events to attend every year and personally, and as a company, I can’t imagine skipping one. It’s an important place to be...continue reading.
It’s been yet-another weirdly busy summer but I’m finally catching up on noting some recent-ish developments in the blog. First up is a full rewrite of the {wand} pacakge which...continue reading.
My second package implements change point testing procedures, especially those for end-of-sample change points. I demonstrate on an example computing a stock’s alpha and beta.continue reading.
Vom 09.-10. Oktober präsentieren wir von STATWORX gemeinsam mit BARC die Data University an der Goethe-Uni in Frankfurt. 2 Tage lang werden wir dort unser geballtes Data Science Wissen in...continue reading.