Author: Scott Stoltzman

April 10, 2020

Who is The Average Customer?

I hate to be the one to break it to you, but the average customer shouldn’t be that important to you. I’m not writing this to repeat the marketing rhetoric you hear...continue reading.

Scott Stoltzman

April 10, 2020

Who is The Average Customer?

I hate to be the one to break it to you, but the average customer shouldn’t be that important to you. I’m not writing this to repeat the marketing rhetoric you hear...continue reading.

Scott Stoltzman

April 8, 2020

Ketchup, Correlation and Outliers

The classic saying “correlation does not imply causation” is still an incredibly important thing to keep in mind when doing data analysis. Spurious regressions will sneak up on you and...continue reading.

Scott Stoltzman

March 28, 2020

Principal Component Analysis (PCA) – Part 4 – Python ML – OOP Basics

Goal of this post: Add principal component analysis (PCA) Refactor using inheritance Convert gradient descent to stochastic gradient descent Add new tests via pytest What we are leaving for the...continue reading.

Scott Stoltzman

March 20, 2020

Multivariate Linear Regression – Part 3 – Refactoring – Python ML – OOP Basics

Goal of this post: Move beyond single linear regression into multiple linear regression by utilizing gradient descent Refactor using inheritance Reconfigure our pytest to include the general case What we...continue reading.

Scott Stoltzman

March 7, 2020

Single Linear Regression – Part 2 – Testing – Python ML – OOP Basics

We have now entered part 2 of our series on object oriented programming in Python for machine learning. If you have not already done so, you may want to check...continue reading.

Scott Stoltzman

March 3, 2020

Single Linear Regression – Part 1 – Python ML – OOP Basics

Data scientists who come to the career without a software background (myself included) tend to use a procedural style of programming rather than taking an object oriented approach. Changing styles...continue reading.

Scott Stoltzman

February 26, 2020

Building a Data Pipeline in Python – Part 5 of N – Database, ORM & SQLAlchemy

Adding data to your database Many people focusing on ETL will eventually be utilizing a database. We will be examining a relational database, SQLite in this case, to store and...continue reading.

Scott Stoltzman

February 19, 2020

Building a Data Pipeline in Python – Part 4 of N – Basic Reporting

Building a report that passes tests At this point, we have seen what our data looks like, how it is stored, and what some basic tests might look like. In...continue reading.

Scott Stoltzman

February 11, 2020

Building a Data Pipeline in Python – Part 3 of N – Testing Data

Simple testing of data: columns, data types, values In a previous post, we walked through data exploration / visualization and tests to see if our data fit basic requirements. The...continue reading.

Scott Stoltzman

April 11, 2019

100 Days of Code – Completed!

I finished the #100DaysOfCode challenge and it feels great! I will tell you a little a bit about my experience. Top 5 Takeaways: Sitting down and writing code every day...continue reading.

Scott Stoltzman

March 16, 2019

Building a Data Pipeline in Python – Part 2 of N – Data Exploration

Initial data acquisition and data analysis In order to get an idea of what our data looks like, we need to look at it! The Jupyter Notebook, embedded below, will...continue reading.

Scott Stoltzman

March 3, 2019

ETL – Building a Data Pipeline With Python – Introduction – Part 1 of N

ETL (Extract, Transform, Load) is not always the favorite part of a data scientist’s job but it’s an absolute necessity in the real world. If you don’t understand this process,...continue reading.

Scott Stoltzman

January 13, 2019

100 Days of Code – What Does it Look Like at Day 11

Stoltzmaniac Fans – It’s time for a #100DaysOfCode update. I have completed 11 days of the challenge. Let me tell you, it has been a blast and I have already...continue reading.

Scott Stoltzman

January 3, 2019

New Year, New Challenge – 100 Days of Code

Starting the 100 Days of Code ( #100DaysOfCode ) challenge I am always looking to boost my coding skills and as I watch everyone make resolutions for the year,...continue reading.

Scott Stoltzman

October 29, 2018

Looking at Fertility in R

Fertility is something people don’t typically discuss openly in the US, which isn’t a surprise because it is an incredibly personal topic. In fact, it’s really difficult to even write...continue reading.

Scott Stoltzman

March 13, 2018

Exploratory Analysis – When to Choose R, Python, Tableau or a Combination

Not all data analysis tools are created equal. Recently, I started looking into data sets to compete in Go Code Colorado (check it out if you live in CO). The problem...continue reading.

Scott Stoltzman

December 2, 2017

George Washington as a Constitutional Word Cloud

Is George Washington better looking on the dollar bill or represented by a word cloud built with the text of The Constitution of the USA? A colleague recently asked me...continue reading.

Scott Stoltzman

November 10, 2017

Simulating Probabilities – The Monty Hall Problem

Psychology vs. Probability Anyone old enough to remember the Monty Hall problem from the old TV Show Let’s Make a Deal? It’s a classic probability problem – but despite its...continue reading.

Scott Stoltzman

September 27, 2017

Microsoft Cognitive Services Vision API in R

Microsoft Cognitive Services Vision API in R A little while ago I did a brief tutorial of the Google Vision API using RoogleVision created by Mark Edmonson. I couldn’t find...continue reading.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Author: Scott Stoltzman

Who is The Average Customer?

Who is The Average Customer?

Ketchup, Correlation and Outliers

Principal Component Analysis (PCA) – Part 4 – Python ML – OOP Basics

Multivariate Linear Regression – Part 3 – Refactoring – Python ML – OOP Basics

Single Linear Regression – Part 2 – Testing – Python ML – OOP Basics

Single Linear Regression – Part 1 – Python ML – OOP Basics

Building a Data Pipeline in Python – Part 5 of N – Database, ORM & SQLAlchemy

Building a Data Pipeline in Python – Part 4 of N – Basic Reporting

Building a Data Pipeline in Python – Part 3 of N – Testing Data

100 Days of Code – Completed!

Building a Data Pipeline in Python – Part 2 of N – Data Exploration

ETL – Building a Data Pipeline With Python – Introduction – Part 1 of N

100 Days of Code – What Does it Look Like at Day 11

New Year, New Challenge – 100 Days of Code

Looking at Fertility in R

Exploratory Analysis – When to Choose R, Python, Tableau or a Combination

George Washington as a Constitutional Word Cloud

Simulating Probabilities – The Monty Hall Problem

Microsoft Cognitive Services Vision API in R

Editor Picks

Augmenting RNA-Ligand Binding Prediction With Machine Learning: A Leap Towards Enhanced Drug Discovery

{Shiny.Telemetry} 0.3.0: Track User Behavior In Your Shiny Applications

Categories

Platinum Sponsors

Sponsors

Buy us a coffee for $10.

Older posts