Folks, C’mon, Use Parquet
In today’s data-driven landscape, the way we store and manage data can significantly impact both efficiency and decision-making processes. While CSV files have long been the go-to format for quick...continue reading.
In today’s data-driven landscape, the way we store and manage data can significantly impact both efficiency and decision-making processes. While CSV files have long been the go-to format for quick...continue reading.
Using R as an alternative to SAS (Statistical Analysis System) offers bespoke interactivity on top of R routines. It enables effective technical handling while engaging non-technical users through interactive data...continue reading.
Appsilon’s Top 10 Picks for the Upcoming posit::conf 2023 The posit::conf 2023 event is on the horizon, and the Appsilon team is brimming with excitement as they unveil their selection...continue reading.
In this tutorial, I’ll show you how to use the Sklearn Logistic Regression function to create logistic regression models in Python. I’ll quickly review what logistic regression is, explain the...continue reading.
Sooner or later, most R programmers end up with code that no longer runs because of package updates. One way to address the problem was the MRAN Time Machine which...continue reading.
Are attending this year’s Joint Statistical Meetings in Toronto? If so, stop by booth 404 to see the latest features of BlueSky Statistics. A menu-based graphical user interface for the...continue reading.
A couple of days ago, I wrote a blog post about how GPT writes bad Pandas code. If you’ve been reading at the blog for a while, you’ve probably realized...continue reading.
I’ve updated The Popularity of Data Science Software‘s market share estimates based on scholarly articles. I posted it below, so you don’t have to sift through the main article to...continue reading.
I have a confession. I’m an American, and I was raised an American. But, I’ve been semi-nomadic for almost 10 years. A “digital nomad.” That is, I’ve lived in many...continue reading.
This is a great opportunity and I am honoured to be hosting this months T-SQL Tuesday blogging invitation. With the invitation of Steve, we have agreed to post topic on...continue reading.
For years, people have been talking about the era of “big data.” The data deluge. As early as 2010, The Economist wrote about how “quantity of information in the world...continue reading.
I’ll never forget what my manager said to me. “Do you need me to break down some doors? Do you need me to flip over some tables?” He was talking...continue reading.
Unless you’ve been living in a secluded hut on the side of a remote mountain in Nepal, you’ve probably heard major chatter about AI in the last few months. First,...continue reading.
While it sounds like the title of a science-fiction catastrophe novel or of a (of course) convoluted nouveau roman, this book by Nick Huntington-Klein is a massive initiation to econometrics...continue reading.
Recently, I wrote a blog post stating that I’m starting to use Python for most of my data science work. And I’m starting to recommend Python as the data science...continue reading.
Recently, I wrote a blog post stating that I’m starting to use Python for most of my data science work. And I’m starting to recommend Python as the data science...continue reading.
There are multiple data science languages to choose from. The most popular being R and Python. Many people will tell you to learn both. Learn R AND Python. I think...continue reading.
The first question that aspiring data scientists ask me is “what is the best data science language … which should I learn.” And almost always, this is framed as a...continue reading.
After much deliberation, I have updated my personal R code style guide to no longer mandate functions begin with uppercase letters. While I still feel there is value in being...continue reading.
In this tutorial, I’ll show you how to use the Numpy transpose function. I’ll explain what the function does, how the syntax works, and I’ll show you step-by-step examples of...continue reading.