What you need to know about data augmentation for machine learning
Plentiful high-quality data is the key to great machine learning models. But good data doesn’t grow on trees, and that …Continue reading →continue reading.
Plentiful high-quality data is the key to great machine learning models. But good data doesn’t grow on trees, and that …Continue reading →continue reading.
Continuing to play with Julia and data visualizations. This time I decided to replicate a scatterplot created by Matt Stiles examining the relationship between a country’s average temperature and its...continue reading.
Science is hard. Why make it harder? Scientists and researchers spend a lot of time on data preparation and analysis, and some of these analyses are quite computationally intensive. The...continue reading.
Today we’re excited to announce R Notebooks, which add a powerful notebook authoring engine to R Markdown. Notebook interfaces for data analysis have compelling advantages including the close association of...continue reading.
Introduction This script utilizes the new dataRetrieval package access to the USGS Statistics Web Service. We will be pulling daily mean data using the daily value service in readNWISdata, and...continue reading.
The Python Data Analysis Library pandas provides basic but reliable Excel in- and output. However, more advanced features for writing … Read More →continue reading.
I’m pleased to announce the release of haven. Haven is designed to faciliate the transfer of data between R and SAS, SPSS, and Stata. It makes it easy to read...continue reading.
On Wednesday, Mathias Drton and I will be presenting a read paper on Bayesian model choice for singular models at the Royal Statistical Society in London. You can read more...continue reading.
What is R? I was asked at the end of my presentation on the 10th Cracow R Users Meetup that was held last Friday (30.09.2016). I felt strange but absolutely...continue reading.
This one-file project fetches Global Surface Summary of the Day (GSOD) from the National Oceanic and Atmospheric Administration (NOAA)’s HTTP server (data are also available on their FTP). See the code...continue reading.
This one-file project fetches Global Surface Summary of the Day (GSOD) from the National Oceanic and Atmospheric Administration (NOAA)’s HTTP server (data are also available on their FTP). See the code...continue reading.
Pandas Data Selection There are multiple ways to select and index rows and columns from Pandas DataFrames. I find tutorials online focusing on advanced selections of row and column choices a...continue reading.
A couple of weeks ago, I posted a map of the traffic fatalities in Arkansas in 2015. The data came from the NHTSA, and the graphic I posted was just...continue reading.
I’m planning to release ggplot2 2.2.0 in early November. In preparation, I’d like to announce that a release candidate is now available: version 2.1.0.9001. Please try it out, and file...continue reading.
Today I’m giving a talk for the Department of Physics and Astronomy at the University of Utah about careers outside academia for astronomers and physicists. Check out my slides here,...continue reading.
As a follow-up to my Primer On Universal Function Approximation with Deep Learning, I’ve created a project on Github that …Continue reading →continue reading.
Earlier this week, I published a post about song lyrics and how different U.S. states are mentioned at different rates, and at different rates relative to their populations. That was...continue reading.
Warsaw R and Data Analytics Enthusiast group is an effort that aims at integrating users of the R language in Warsaw, Poland. Our group has over 970 members at its...continue reading.
This blog hands you the R code needed to create a choropleth heatmap on top of a Google Maps plot of Amsterdam. With this plot you can easily compare the...continue reading.
Parallel Coordinate Plots are useful to visualize multivariate data. R provides several packages/functions to draw Parallel Coordinate Plots (PCPs): ggparcoord … Read More →continue reading.