Category: Machine learning

January 2, 2016

The Power of Decision Stumps

A decision stump is the weak classification model with the simple tree structure consisting of one split, which can also be considered a one-level decision tree. Due to its simplicity,...continue reading.

Sara

January 1, 2016

Wind Resource Assessment

This is an article we recently published on “Renewable and Sustainable Energy Reviews”. It starts with a thorough review of the methods used for wind resource assessment: from algorithms based...continue reading.

Sara

December 19, 2015

Evaluation of time series forecasting using Spark windowing

Evaluation metrics play a critical role in machine learning ecosystem. Especially for machine learning products, evaluation metrics are like the heart beats. They show how healthy the model is and...continue reading.

Sara

December 19, 2015

Evaluation of time series forecasting using Spark windowing

Christoph Glur

November 23, 2015

Shiny 1: Amazon AWS for R and Shiny

Learn how to set up an Amazon AWS Ubuntu instance on which you can install R , RStudio, OpenCPU, or Shiny Server. The post Shiny 1: Amazon AWS for R...continue reading.

Sara

September 25, 2015

hat tip: join two spark dataframe on multiple columns (pyspark)

Consider the following two spark dataframes:df1.show()+—-+——+——-+|id_a|time_a|value_a|+—-+——+——-+| 1| 1| CA|| 1| 2| CA|| 2| 1| TX|| 3| 5| NE|| 4| 6| WA|+—-+——+——-+df2.show(…continue reading.

statcompute

March 21, 2015

Ensemble Learning with Cubist Model

The tree-based Cubist model can be easily used to develop an ensemble classifier with a scheme called “committees”. The concept of “committees” is similar to the one of “boosting” by...continue reading.

statcompute

March 19, 2015

Model Segmentation with Cubist

Cubist is a tree-based model with a OLS regression attached to each terminal node and is somewhat similar to mob() function in the Party package (https://statcompute.wordpress.com/2014/10/26/model-segmentation-with-recursive-partitioning). Below is a demonstrate...continue reading.

statcompute

October 8, 2014

Fitting Lasso with Julia

Julia Code R Codecontinue reading.

statcompute

February 4, 2013

A Grid Search for The Optimal Setting in Feed-Forward Neural Networks

The feed-forward neural network is a very powerful classification model in the machine learning content. Since the goodness-of-fit of a neural network is majorly dominated by the model complexity, it...continue reading.

statcompute

January 12, 2013

PART – A Rule-Learning Algorithm

> require(‘RWeka’) > require(‘pROC’) > > # SEPARATE DATA INTO TRAINING AND TESTING SETS > df1 <- read.csv(‘credit_count.csv’) > df2 <- df1[df1$CARDHLDR == 1, 2:12] > set.seed(2013) > rows <-...continue reading.

statcompute

December 19, 2012

Generalized Boosted Regression with A Monotonic Marginal Effect for Each Predictor

In the practice of risk modeling, it is sometimes mandatory to maintain a monotonic relationship between the response and each predictor. Below is a demonstration showing how to develop a...continue reading.

statcompute

December 3, 2012

Exchange Data between Python and R with SQLite

SQLite is a light-weight database with zero-configuration. Being fast, reliable, and simple, SQLite is a good choice to store / query large data, e.g. terabytes, and is well supported by...continue reading.

quantsignals

November 12, 2012

Portfolio Trading

In finance and investing the term portfolio refers to the collection of assets one owns. Compared to just holding a single asset at a time a portfolio has a number...continue reading.

statcompute

October 8, 2012

Fit and Visualize A MARS Model

################################################# ## FIT A MULTIVARIATE ADAPTIVE REGRESSION ## ## SPLINES MODEL (MARS) USING MDA PACKAGE ## ## DEVELOPED BY HASTIE AND TIBSHIRANI ## ##############################################…continue reading.

quantsignals

September 27, 2012

E-Learning to Machine Learn

machine learningA quick post about online educational resources on machine learning. Perhaps its a sign of increasing popularity of the field that there are now several courses on machine learning...continue reading.

quantsignals

September 26, 2012

Learning Kernels SVM

Machine Learning and Kernels A common application of machine learning (ML) is the learning and classification of a set of raw data features by a ML algorithm or technique. In...continue reading.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Category: Machine learning

The Power of Decision Stumps

Wind Resource Assessment

Evaluation of time series forecasting using Spark windowing

Evaluation of time series forecasting using Spark windowing

Shiny 1: Amazon AWS for R and Shiny

hat tip: join two spark dataframe on multiple columns (pyspark)

Ensemble Learning with Cubist Model

Model Segmentation with Cubist

Fitting Lasso with Julia

A Grid Search for The Optimal Setting in Feed-Forward Neural Networks

PART – A Rule-Learning Algorithm

Generalized Boosted Regression with A Monotonic Marginal Effect for Each Predictor

Exchange Data between Python and R with SQLite

Portfolio Trading

Fit and Visualize A MARS Model

E-Learning to Machine Learn

Learning Kernels SVM

Editor Picks

How to prevent data leakage in pandas & scikit-learn ☔

Q1 2024 tidymodels digest

Categories

Platinum Sponsors

Sponsors

Buy us a coffee for $10.

Older posts