Author: Scott Stoltzman
Goal of this post: Add principal component analysis (PCA) Refactor using inheritance Convert gradient descent to stochastic gradient descent Add new tests via pytest What we are leaving for the...continue reading.
Goal of this post: Move beyond single linear regression into multiple linear regression by utilizing gradient descent Refactor using inheritance Reconfigure our pytest to include the general case What we...continue reading.
We have now entered part 2 of our series on object oriented programming in Python for machine learning. If you have not already done so, you may want to check...continue reading.
Data scientists who come to the career without a software background (myself included) tend to use a procedural style of programming rather than taking an object oriented approach. Changing styles...continue reading.
Adding data to your database Many people focusing on ETL will eventually be utilizing a database. We will be examining a relational database, SQLite in this case, to store and...continue reading.
Building a report that passes tests At this point, we have seen what our data looks like, how it is stored, and what some basic tests might look like. In...continue reading.
Simple testing of data: columns, data types, values In a previous post, we walked through data exploration / visualization and tests to see if our data fit basic requirements. The...continue reading.
Initial data acquisition and data analysis In order to get an idea of what our data looks like, we need to look at it! The Jupyter Notebook, embedded below, will...continue reading.
ETL (Extract, Transform, Load) is not always the favorite part of a data scientist’s job but it’s an absolute necessity in the real world. If you don’t understand this process,...continue reading.
Stoltzmaniac Fans – It’s time for a #100DaysOfCode update. I have completed 11 days of the challenge. Let me tell you, it has been a blast and I have already...continue reading.
Not all data analysis tools are created equal. Recently, I started looking into data sets to compete in Go Code Colorado (check it out if you live in CO). The problem...continue reading.