Hierarchical Clustering

In a previous post I discussed k-means clustering, which is a type of unsupervised learning method.  Today I want to add another tool to our modeling kit by discussing hierarchical clustering methods and their implementation in R. As in the k-means clustering post I will discuss the issue of clustering countries based on macro data.  … More Hierarchical Clustering

Shaving a Classifier with Occam’s Razor

I recently went through a Coursera course on Classification taught by Carlos Guestrin from U of Washington and thought it was excellent.   There was an interesting discussion on model overfitting that I thought I would share. In previous posts I discussed linear models with shrinkage parameters such us ridge and lasso regression models. Similar approach … More Shaving a Classifier with Occam’s Razor

Classification Trees

In today’s post I wanted to describe classification trees. I will concentrate on Classification and Regression Tree (CART) algorithm. As I discuss the main features of this algorithm I will contrast it with other widely used methods to construct classification trees. What A Classification Tree Looks Like: Before diving into the algorithm let’s have a … More Classification Trees

Lasso model example (LME’s Aluminium Futures Price)

In my previous post I showed a coordinate descent algorithm for solving Lasso coefficients. Lasso model is part of a family of penalized regression models that are popular in machine learning and predictive modeling. In today’s post I want to show you how this model can be used to estimate the monthly average price of … More Lasso model example (LME’s Aluminium Futures Price)

On US PMI

I have spent a month running around Toronto and Tokyo so did not get a chance to post earlier on the nasty PMI print coming out of US but decided to comment at this time so I can tie to my earlier post on ridge regression modeling. There is not much positive to say about … More On US PMI