Data Visualization – Part 1
Introduction to Data Visualization – Theory, R & ggplot2 The topic of data visualization is very popular in the data science community. The market size for visualization products is valued...continue reading.
Introduction to Data Visualization – Theory, R & ggplot2 The topic of data visualization is very popular in the data science community. The market size for visualization products is valued...continue reading.
Merging and Joining data sets are key activities of any data scientist or analyst. In this tutorial, we explore the process of combining datasets based on common columns quickly and...continue reading.
Following on from Part 1 of this two-part post, I would now like to explain how the Naive Bayes classifier works before applying it to a classification problem involving breast...continue reading.
Introduction A very useful machine learning method which, for its simplicity, is incredibly successful in many real world applications is the Naive Bayes classifier. I am currently taking a machine...continue reading.
Should I learn R or Python for data science? I am asked this question regularly, both online and in person. There is a simple answer: it doesn’t matter. There are...continue reading.
R Tutorial: Visualizing multivariate relationships in Large Datasets A tutorial by D.M. Wiig In two previous blog posts I discussed some techniques for visualizing relationships involving two or three variables...continue reading.
When I was a young boy with a wild imagination, I used to try my hand at numerous sports ranging from tennis to gaelic footbal to soccer, each with varying...continue reading.
There is a plethora of classification algorithms available to people who have a bit of coding experience and a set of data. A common machine learning method is the random...continue reading.
When someone asks you how your weekend was, you don’t start off by (potentially) boring them with the unnecessary details of how your Aunt Sally’s train was late and you...continue reading.
When someone asks you how your weekend was, you don’t start off by (potentially) boring them with the unnecessary details of how your Aunt Sally’s train was late and you...continue reading.
Stoltzmaniac is going local in today’s blog post! I dug into the City of Fort Collins open data and published my findings below. The data was surprisingly clean and laid...continue reading.
The Situation You are a consultant who has been hired by a business that sells one commodity product. On December 31st the price is $100 per unit. The business owner...continue reading.
This post aims to illustrate use of TensorFlow framework for implementing a simple Matrix Factorization (MF). MF is one of the widely used recommender systems that is especially exploited when...continue reading.
This post is dedicated to my mother – Seinfeld’s greatest fan. Seinfeld is a classic TV sitcom. It featured four main characters surrounded by relatively normal, everyday, run of the...continue reading.
Marijuana, Alcohol, and Other Drugs Continuing our Exploration of the Data After identifying the sources of crime growth, it’s time to investigate specific crime rates. This blog post addresses drug...continue reading.
A few months ago a reader point me out this new way of connecting R and Excel. I don’t know for how long this has been around, but I never...continue reading.
Getting More Granular Where we’re going In having noticed the crime rates heading up over the last few years, taking a better look seemed more important. I want to first...continue reading.
Project Background As we all know, Colorado is considered one of the scariest places on earth. Denver, CO has had an enormous influx of people over the last decade and...continue reading.
Geocode your addresses for free with Python and Google For a recent project, I ported the “batch geocoding in R” script over to Python. The script allows geocoding of large numbers...continue reading.
Plentiful high-quality data is the key to great machine learning models. But good data doesn’t grow on trees, and that …Continue reading →continue reading.