Category: R

March 7, 2015

Text bashing in R for SQL

Reading Time: < 1 minute Fairly often, a coworker who is strong in Excel, but weak in writing code will come to me for help in special details about customers...continue reading.

Jeremy Jackson

March 7, 2015

Text bashing in R for SQL

Reading Time: < 1 minutes Fairly often, a coworker who is strong in Excel, but weak in writing code will come to me for help in special details about customers...continue reading.

Sara

February 16, 2015

Mapping the world with tweets

A few days ago, I collected 30 minutes of tweets all around the world. I used the twitteR and streamR packages for this. The nice thing about those tweets is...continue reading.

Sara

February 15, 2015

Bayesian network in R: Introduction

Bayesian networks (BNs) are a type of graphical model that encode the conditional probability between different learning variables in a directed acyclic graph. There are benefits to using BNs compared...continue reading.

Sara

January 27, 2015

Goodness of fit test in R

As a data scientist, occasionally, you receive a dataset and you would like to know what is the generative distribution for that dataset. In this post, I aim to show...continue reading.

Sara

January 19, 2015

stacked plot in R

Consider the following example: there is a three-stage truck maintenance pipeline. Initially, when a Truck comes to the maintenance service, it is added to the first stage and its status...continue reading.

tomizono

November 23, 2014

Calculates population growth rate λ along element changes

The previous article introduced the sensitivity and elasticity to seasonal matrix model of imaginary annual plant. Both sensitivity and elasticity are partial derivatives. This means the values can only predict...continue reading.

statcompute

October 28, 2014

Flexible Beta Modeling

library(betareg) library(sas7bdat) df1 <- read.sas7bdat(‘lgd.sas7bdat’) df2 <- df1[df1$y < 1, ] fml <- as.formula(‘y ~ x2 + x3 + x4 + x5 + x6 | x3 + x4 | x1...continue reading.

statcompute

October 26, 2014

Model Segmentation with Recursive Partitioning

library(party) df1 <- read.csv("credit_count.csv") df2 <- df1[df1$CARDHLDR == 1, ] mdl <- mob(DEFAULT ~ MAJORDRG + MINORDRG + INCOME + OWNRENT | AGE + SELFEMPL, data = df2, family =...continue reading.

statcompute

October 20, 2014

Estimating a Beta Regression with The Variable Dispersion in R

pkgs <- c(‘sas7bdat’, ‘betareg’, ‘lmtest’) lapply(pkgs, require, character.only = T) df1 <- read.sas7bdat("lgd.sas7bdat") df2 <- df1[which(df1$y < 1), ] xvar <- paste("x", 1:7, sep = ”, collapse = " +...continue reading.

tomizono

October 13, 2014

Sensitivity and Elasticity of seasonal matrix model

The previous article introduced the seasonal matrices and the population growth rate λ of imaginary annual plant. In this article, let’s try the sensitivity analysis of these matrices and the...continue reading.

statcompute

October 8, 2014

Fitting Lasso with Julia

Julia Code R Codecontinue reading.

statcompute

October 5, 2014

By-Group Aggregation in Parallel

Similar to the row search, by-group aggregation is another perfect use case to demonstrate the power of split-and-conquer with parallelism. In the example below, it is shown that the homebrew...continue reading.

statcompute

October 2, 2014

Vector Search vs. Binary Search

# REFERENCE: # user2014.stat.ucla.edu/files/tutorial_Matt.pdf pkgs <- c(‘data.table’, ‘rbenchmark’) lapply(pkgs, require, character.only = T) load(‘2008.Rdata’) dt <- data.table(data) benchmark(replications = 10, order = "elap…continue reading.

statcompute

September 29, 2014

Row Search in Parallel

I’ve been always wondering whether the efficiency of row search can be improved if the whole data.frame is splitted into chunks and then the row search is conducted within each...continue reading.

tomizono

September 28, 2014

Stage abundances, eigenvector of population matrix

The previous article introduced the seasonal matrices and the population growth rate λ of imaginary annual plant. This article focuses on the meaning of the eigenvector at first, and then...continue reading.

statcompute

September 25, 2014

Select Distinct Values with Pig

First of all, I used SQL statement with SQLDF package in R. It took ~51 seconds user time to select 12 rows out of 7 millions. Next, I used Apache...continue reading.

tomizono

August 31, 2014

Periodic matrix model for annual plant demography

Let’s challenge to build a matrix population model of annual organisms and then calculate the population growth rate λ using R. Consider a simple life cycle of imaginary annual plants;...continue reading.

Sara

August 28, 2014

Getting emotional in the absence of something: Using the Berlin Affective Word List to analyze emotional valence and arousal for nouns and adjectives.

This is something I did a while ago using the Berlin Affective Word List (BAWL).The BAWL contains ratings for 2902 German words (2107 nouns, 504 verbs, 291 adjectives). Ratings were...continue reading.

diffuseprior

June 13, 2014

Brazil’s Host Advantage

If history can tell us anything about the World Cup, it’s that the host nation has an advantage of all other teams. Evidence of this was presented last night as...continue reading.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Category: R

Text bashing in R for SQL

Text bashing in R for SQL

Mapping the world with tweets

Bayesian network in R: Introduction

Goodness of fit test in R

stacked plot in R

Calculates population growth rate λ along element changes

Flexible Beta Modeling

Model Segmentation with Recursive Partitioning

Estimating a Beta Regression with The Variable Dispersion in R

Sensitivity and Elasticity of seasonal matrix model

Fitting Lasso with Julia

By-Group Aggregation in Parallel

Vector Search vs. Binary Search

Row Search in Parallel

Stage abundances, eigenvector of population matrix

Select Distinct Values with Pig

Periodic matrix model for annual plant demography

Getting emotional in the absence of something: Using the Berlin Affective Word List to analyze emotional valence and arousal for nouns and adjectives.

Brazil’s Host Advantage

Editor Picks

Appsilon Joins the Pharmaverse Council to Advance Open-Source Clinical Reporting

R Highcharts: How to Make Interactive Maps for R and R Shiny

Categories

Platinum Sponsors

Sponsors

Buy us a coffee for $10.

Older posts