unsupervised classification with R

unsupervised classification with R

m

January 29, 2016

Here we see three simple ways to perform an unsupervised classification on a raster dataset in R. I will show these approaches, but first we need to load the relevant packages and the actual data. You could use the Landsat data used in the “Remote Sensing and GIS for Ecologists” book which can be downloaded here.

library("raster")  
library("cluster")
library("randomForest")

# loading the layerstack  
# here we use a subset of the Landsat dataset from "Remote Sensing and GIS for Ecologists" 
image <- stack("path/to/raster")
plotRGB(image, r=3,g=2,b=1,stretch="hist")

RGBimage

Now we will prepare the data for the classifications. First we convert the raster data in a matrix, then we remove the NA-values.

## returns the values of the raster dataset and write them in a matrix. 
v <- getValues(image)
i <- which(!is.na(v))
v <- na.omit(v)

The first classification method is the well-known k-means method. It separates n observations into  k clusters. Each observation belongs to the cluster with the nearest mean.

## kmeans classification 
E <- kmeans(v, 12, iter.max = 100, nstart = 10)
kmeans_raster <- raster(image)
kmeans_raster[i] <- E$cluster
plot(kmeans_raster)

Kmeans

The second classification method is called clara (Clustering for Large Applications). It work by clustering only a sample of the dataset and then assigns all object in the dataset to the clusters.

## clara classification 
clus <- clara(v,12,samples=500,metric="manhattan",pamLike=T)
clara_raster <- raster(image)
clara_raster[i] <- clus$clustering
plot(clara_raster)

clara

The third method uses a random Forest model to calculate proximity values. These values were clustered using k-means. The clusters are used to train another random Forest model for classification.

## unsupervised randomForest classification using kmeans
vx<-v[sample(nrow(v), 500),]
rf = randomForest(vx)
rf_prox <- randomForest(vx,ntree = 1000, proximity = TRUE)$proximity

E_rf <- kmeans(rf_prox, 12, iter.max = 100, nstart = 10)
rf <- randomForest(vx,as.factor(E_rf$cluster),ntree = 500)
rf_raster<- predict(image,rf)
plot(rf_raster)

randomForest

The three classifications are stacked into one layerstack and plotted for comparison.

class_stack <- stack(kmeans_raster,clara_raster,rf_raster)
names(class_stack) <- c("kmeans","clara","randomForest")

plot(class_stack)

Comparing the three classifications:

Looking at the different classifications we notice, that the kmeans and clara classifications have only minor differences.
The randomForest classification shows a different image.

 

want to read more about R and classifications? check out this book:

follow us and share it on:

you may also like:

EORC at the Smart Forest Conference: Research meets Practive

EORC at the Smart Forest Conference: Research meets Practive

This week, our EO4CAM staff Sonja and Julian attended the Smart Forest Conference in Freising, a meeting that brings together researchers, forestry practitioners, and technology developers working at the interface of forest science and digital innovation. Over two...

EO4CAM Data Portal Launched to Support Climate Adaptation in Bavaria

EO4CAM Data Portal Launched to Support Climate Adaptation in Bavaria

New Earth observation platform provides satellite-based information for public authorities and planners A new Earth observation data portal is bringing satellite-derived environmental information closer to climate adaptation planning. On 9 March 2026, the EO4CAM data...

Demography Meets Spatial Analysis: Insights from today’s EORC Talk

Demography Meets Spatial Analysis: Insights from today’s EORC Talk

A recent EORC Talk at the Earth Observation Research Cluster (EORC) at the University of Würzburg brought together perspectives from demography, geography, and spatial analysis. Sebastian Klüsener and Tamilwai Kolowa from the Federal Institute for Population Research...

A Fresh Look of the Foyer and Seminarroom for our Events

A Fresh Look of the Foyer and Seminarroom for our Events

Over the past weeks, our foyer and seminar rooms have received a thoughtful upgrade that reflects both the spirit and the scope of our research community. The spaces now feature large-scale prints that visually showcase the diverse research topics within our group,...

Physical 3D building model of Würzburg at the EORC

Physical 3D building model of Würzburg at the EORC

As part of the "Allianz New Space Mainfranken" initiative by the Würzburg-Schweinfurt Chamber of Industry and Commerce (IHK) (see e.g. here: https://remote-sensing.org/exploring-new-space-opportunities-in-mainfranken/ ), we have established a collaboration between...

Share This