Causal Inference cheat sheet for data scientists
Being able to make causal claims is a key business value for any data science team, no matter their size.Quick analytics (in other words, descriptive statistics) are the bread and...continue reading.
Being able to make causal claims is a key business value for any data science team, no matter their size.Quick analytics (in other words, descriptive statistics) are the bread and...continue reading.
Despite their advantages, Dynamic Shiny Modules can destabilize the Shiny environment and cause its reactive graph to be rendered multiple times. In this blogpost, I present how to remove deleted...continue reading.
To increase revenue, customers should be offered products they may need or films they might like. In this blog post, our colleague Andreas explains how to train your own movie...continue reading.
When I started my evidence-based software engineering book, nobody had written a data analysis book for software developers, so I had to write one (in fact, a book on this...continue reading.
When I started my evidence-based software engineering book, nobody had written a data analysis book for software developers, so I had to write one (in fact, a book on this...continue reading.
Deep learning need not be irreconcilable with privacy protection. Federated learning enables on-device, distributed model training; encryption keeps model and gradient updates private; differential privacy prevents the training data from...continue reading.
A summary of common problems that my colleagues and I had when migrating R / packages to newer version.continue reading.
📌 Learn about this feature in a special live webinar and AMA on May 7, 2020 at 1pm EDT with Plotly’s CTO and cofounder, Alex Johnson. Written by: Alex Johnson, Plotly CTO Before...continue reading.
R 4.0.0 was released in source form on Friday, and binaries for Windows, Mac and Linux are available for download now. As the version number bump suggests, this is a...continue reading.
By Marek Rogala and Jędrzej Świeżewski, PhD In this article, we focus on the technical aspects of the machine learning solution that we implemented for the xView2 competition. We created...continue reading.
R is widely popular and incredibly useful for people working as Data Scientists or in companies. But you can also use R for more simple things, like creating a nice...continue reading.
I have written couple of blog posts on R packages (here | here ) and this blog post is sort of a preset of all the most needed packages for...continue reading.
“When is Mom’s birthday?” “Remind me to pick up flowers and a cake this afternoon.” “How do I get to the nearest flower shop?” “Find the best bakeries near me.”...continue reading.
A weekly Monde current mathematical puzzle that reminded me of an earlier one (but was too lazy to check): The integer n=36 enjoys the property that all the differences between...continue reading.
A new sparklyr release is now available. This sparklyr 1.2 release features new functionalities such as support for Databricks Connect, a Spark backend for the ‘foreach’ package, inter-op improvements for...continue reading.
Launch of New Course Platform After months of hard work we are really excited to launch our brand-new course platform to learn and apply data science. Together with the new...continue reading.
“…an essential part of understanding how many ties these RNGs produce is to understand how many ties one expects in 32-bit integer arithmetic.” A sort of a birthday-problem paper for...continue reading.
This is the second of two articles about our recent participation in the Pandemic Response Hackathon. Our project (CoronaRank) was one of only 5 projects out of 230 submissions chosen...continue reading.
Is GPL software overly restrictive? Software licensing can be a personal or philosophical choice, and we’re not looking to incite controversy. But, we do want to examine why more and...continue reading.