Logistic regression with categorical data in Ruby
I had some fun analysing the shelter animal data from kaggle using the Ruby gems daru for data wrangling and statsample-glm for model fitting. In this blog post, I want...continue reading.
I had some fun analysing the shelter animal data from kaggle using the Ruby gems daru for data wrangling and statsample-glm for model fitting. In this blog post, I want...continue reading.
Almost six months ago (!) I wrote a blog post about the NEISS data set, a sample of accidents reported to emergency rooms in the U.S. that are related to...continue reading.
Few weeks ago I have a great pleasure of attending BioC 2016: Where Software and Biology Connect Conference at Stanford, where I have learned a lot! It wouldn’t be possible...continue reading.
New York City is a wonderful place to be most of the time but especially in September! If you live or work in the city or just want a good...continue reading.
Monte Carlo analysis is a great way to explore the impact of input variable uncertainty on the results of engineering equations, and with vector variables and distribution and sampling functions...continue reading.
I’ve just updated the instructions for building a 64-bit OpenBLAS-based Rblas.dll for Windows to reflect changes to R 3.3+ and Rtools34. Enjoy! The post Updated OpenBLAS instructions for R-3.3+ and...continue reading.
This post would probably be the last in my series about merging R and ArcGIS. In August unfortunately I would have to work for real and I will not have...continue reading.
This blog post demonstrates the usage of the R package dplyr. It turns out that dplyr is intuitive to the point where I probably won’t ever need to look back...continue reading.
The JSM conference in Chicago, July 31 thru August 4, 2016, is one of the largest to be found on statistics, with many terrific talks for R users. We’ve listed...continue reading.
I am pleased to update you on the Data Science Boot Camp we ran at the Ted Rogers School of Management at Ryerson University in Toronto in collaboration with IBM’s...continue reading.
Suppose that you would like to create a function which does a series of computations on a data frame. You would like to pass a column as this function’s argument....continue reading.
Suppose that you would like to create a function which does a series of computations on a data frame. You would like to pass a column as this function’s argument....continue reading.
This is a brief tutorial on the cdlTools package developed by Lu Chen and I to download and perform some simple analysis on USDA’s cropland data layer (CDL). This tutorial...continue reading.
In this post I will introduce another toolbox I created to show the functions that can be added to ArcGIS by using R and the R-Bridge technology.In this toolbox I...continue reading.
Overview Anomaly detection algorithms are core to many fraud and security applications/business solutions. Identifying cases where specific values are outside norms can be useful in outlier detection (as a predicate...continue reading.
Part-of-speech tagging or POS tagging of texts is a technique that is often performed in Natural Language Processing. It allows … Read More →continue reading.
When you’re dealing with natural language data, especially survey data, misspelled words occur quite often in free-text answers and might … Read More →continue reading.
Marcus Beck (USEPA) Laura DeCicco (USGS-OWI) Introduction EGRET is an R-package for the analysis of long-term changes in water quality and streamflow, and includes the water-quality method Weighted Regressions on...continue reading.
Preamble Every few months, I try to do a clean install on my machine. I know that OS X Sierra is due out in September, but I elected to do...continue reading.
In this post I present my third experiment with R-Bridge. The plotting toolbox is a plug-in for ArcGIS 10.3.x that allows the creation of beautiful and informative plot, with ggplot2,...continue reading.