Category: R statistical package

August 11, 2020

TV Shows on the “Big 3” Streaming Services

2020 has been a tough year, and I’ve been doing my best to keep busy (and distracted from all the insanity – both at the personal and worldwide levels). Earlier...continue reading.

Sara

June 26, 2020

Flying Saucers and Bright Lights: A Data Visualization

UFO Sightings by Shape and YearEarlier last week, I taught part 2 of a course on using R and tidyverse for my work colleagues. I wanted a fun dataset to...continue reading.

Sara

May 4, 2020

Statistics Sunday: My 2019 Reading

I’ve spent the month of April blogging my way through the tidyverse, while using my reading dataset from 2019 as the example. Today, I thought I’d bring many of those...continue reading.

Sara

May 1, 2020

Z is for Additional Axes

Here we are at the last post in Blogging A to Z! Today, I want to talk about adding additional axes to your ggplot, using the options for fill or...continue reading.

Sara

April 30, 2020

Y is for scale_y

Yesterday, I talked about scale_x. Today, I’ll continue on that topic, focusing on the y-axis.The key to using any of the scale_ functions is to know what sort of data...continue reading.

Sara

April 29, 2020

X is for scale_x

These next two posts will deal with formatting scales in ggplot2 – x-axis, y-axis – so I’ll try to limit the amount of overlap and repetition.Let’s say I wanted to...continue reading.

Sara

April 28, 2020

W is for Write and Read Data – Fast

Once again, I’m dipping outside of the tidyverse, but this package and its functions have been really useful in getting data quickly in (and out) of R.For work, I have...continue reading.

Sara

April 26, 2020

In this series, I’ve covered five terms for data manipulation:arrangefiltermutateselectsummariseThese are the verbs that make up the grammar of data manipulation. They all work with group_by to perform these functions...continue reading.

Sara

April 25, 2020

U is for Useful Trick

This will be a very short post for a line of code I’ve found unbelievably useful as I analyze data for work. I’m working with datasets containing millions of rows...continue reading.

Sara

April 24, 2020

T is for Themes

One of the easiest ways to make a beautiful ggplot is by using a theme. ggplot2 comes with a variety of pre-existing themes. I’ll use the genre statistics summary table...continue reading.

Sara

April 23, 2020

S is for summarise

Today, we’ll finally talk about summarise! It’s very similar to mutate, but instead of adding or altering a variable in a dataset, it aggregates your data, creating a new tibble...continue reading.

Sara

April 22, 2020

R is for read_

The tidyverse is full of functions for reading data, beginning with “read_”. The read_csv I’ve used to access my reads2019 data is one example, falling under the read_delim functions. read_tsv...continue reading.

Sara

April 21, 2020

Q is for qplot versus ggplot

Two years ago, when I did Blogging A to Z of R, I talked about qplots. qplots are great for quick plots – which is why they’re named as such...continue reading.

Sara

April 19, 2020

P is for percent

We’ve used ggplots throughout this blog series, but today, I want to introduce another package that helps you customize scales on your ggplots – the scales package. I use this...continue reading.

Sara

April 18, 2020

O is for order_by

This will be a quick post on another tidyverse function, order_by. I’ll admit, I don’t use this one as often as arrange. It can be useful, though, if you don’t...continue reading.

Sara

April 17, 2020

N is for n_distinct

Today, we’ll start digging into some of the functions used to summarise data. The full summarise function will be covered for the letter S. For now, let’s look at one...continue reading.

Sara

April 16, 2020

M is for mutate

Today, we finally talk about the mutate function! I’ve used it a lot throughout the series so far, so it’s nice to get to discuss what it is and how...continue reading.

Sara

April 15, 2020

L is for Log Transformation

When visualizing data, outliers and skewed data can have a huge impact, potentially making your visualization difficult to understand. We can use many of the tricks covered so far to...continue reading.

Sara

April 14, 2020

K is for Keep or Drop Variables

A few times in this series, I’ve wanted to display part of a dataset, such as key variables, like Title, Rating, and Pages. The tidyverse allows you to easily keep...continue reading.

Sara

April 12, 2020

J is for Join

Today, we’ll start digging into the wonderful world of joins! The tidyverse offers several different types of joins between two datasets, X and Y:left_join – keeps all rows from X...continue reading.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Category: R statistical package

TV Shows on the “Big 3” Streaming Services

Flying Saucers and Bright Lights: A Data Visualization

Statistics Sunday: My 2019 Reading

Z is for Additional Axes

Y is for scale_y

X is for scale_x

W is for Write and Read Data – Fast

V is for Verbs

U is for Useful Trick

T is for Themes

S is for summarise

R is for read_

Q is for qplot versus ggplot

P is for percent

O is for order_by

N is for n_distinct

M is for mutate

L is for Log Transformation

K is for Keep or Drop Variables

J is for Join

Editor Picks

Augmenting RNA-Ligand Binding Prediction With Machine Learning: A Leap Towards Enhanced Drug Discovery

{Shiny.Telemetry} 0.3.0: Track User Behavior In Your Shiny Applications

Categories

Platinum Sponsors

Sponsors

Buy us a coffee for $10.

Older posts