A checklist for choosing between #rstats packages
The paradox of choice can at times be a challenge. There are well over 10,000 packages on CRAN now (likely 16,000), and there have been suggestions on how to find...continue reading.
The paradox of choice can at times be a challenge. There are well over 10,000 packages on CRAN now (likely 16,000), and there have been suggestions on how to find...continue reading.
Our latest tool development at STATWORX: random boost, an algorithm twice as fast as gradient boosting, with comparable prediction performance. Der Beitrag How to Speed Up Gradient Boosting by a...continue reading.
When this blog moved from bioinformatics to data science I ran a Twitter poll to ask whether I should start afresh at a new site or continue here. “Continue here”,...continue reading.
RStudio Connect 1.7.2 is ready to download, and this release contains some long-awaited functionality that we are excited to share. Several authentication and user-management tooling improvements have been added, including...continue reading.
This spring, I’ll be giving talks at a couple of Meetups and conferences: March, 26th: At the data lounge Bremen, I’ll be talking about Explainable Machine Learning April, 11th: At...continue reading.
Slides (in English): https://nowosad.github.io/BioGIS_19/workshop/#1. Slides (in Polish): https://nowosad.github.io/BioGIS_19/workshop-pl/#1.continue reading.
RStudio Connect 1.7.2 is ready to download, and this release contains some long-awaited functionality that we are excited to share. Several authentication and user-management tooling improvements have been added, including...continue reading.
The slides are available here.continue reading.
Roland Stevenson is a data scientist and consultant who may be reached on Linkedin. When accessing an API or database in R, it is often necessary to provide credentials such...continue reading.
There’s a lot going on in the development version of {tidyr}. New functions for pivoting data frames, pivot_wide() and pivot_long() are coming, and will replace the current functions, spread() and...continue reading.
There’s a lot going on in the development version of {tidyr}. New functions for pivoting data frames, pivot_wide() and pivot_long() are coming, and will replace the current functions, spread() and...continue reading.
In my previous post, I discussed Gartner’s reviews of data science software companies. In this post, I show Forrester’s coverage and discuss how radically different it is. As usual, this...continue reading.
Labelling data is typically a task for end-users and is applied in own scripts or functions rather than in packages. However, sometimes it can be useful for both end-users and...continue reading.
The Data Walking project was organised and written up by David Hunter at Ravensbourne University London (which you might remember … Morecontinue reading.
Co-Author: Eric Kammers Part 1 – Theoretical Background The Dynamic Mode Decomposition (DMD) was originally developed for its application in fluid dynamics where it could decompose complex flows into simpler...continue reading.
There’s been alot of talk about “dependencies” in the R universe of late. This is not really a post about that but more of a “really, don’t do this” if...continue reading.
Version 2.1.1 of the tibble package is on CRAN now. Tibbles are a modern reimagining of the data frame, keeping what time has shown to be effective, and throwing out...continue reading.
RStudio have recently announced ‘RStudio Connect QuickStart’ which is a VM containing a full suite of RStudio’s pro tools, available to be trialled for a 45 day period. RStudio Connect...continue reading.
Opening the black-box in complex models: SHAP values. What are they and how to draw conclusions from them? With R code example!continue reading.
The fine folks over at @PacketTotal bequeathed an API token on me so I cranked out an R package for it to enable more dynamic investigations work (RStudio makes for...continue reading.