Bayesian Framework for Classification of Sensor Data

 

back to notes

 

Bayesian Analysis

Bayesian analysis is a statistical procedure which endeavors to estimate parameters of an underlying distribution based on the observed distribution.
  • Given a prior distribution,
  • collect data (samples) to get an observed distribution,
  • then calculate the observed distribution's likelihood as a function of parameter values,
  • multiply this likelihood function by the prior distribution,
  • normalize to obtain a unit probability over all possible values
  • This is called the posterior distribution.

 

   

Bayes Theorem

The aim is to find the probability that a certain label is the right classification for the current input vector x. This is called the posterior probability:

from Bayes' Theorem:

  • is the probability that (x1,...,xd) is being measured by the sensors, when we know that li is the current label.

  • is the probability that the current label is li

  • is the probability that the values x1,...,xd are being measured by the sensors

   

Maximum A Posteriori (MAP) hypothesis

To classify a certain input vector, we look for the label li that has the highest probability to be the right classification for the current input vector (x1,...,xd): The Maximum A Posteriori hypothesis for the current label is:

We can leave out since it is just a constant, and would not affect the maximum li , and we could even leave the out if we assume that all labels are equally probable: with n the number of labels. This leads to.. 

 

   

Maximum Likelihood (ML) hypothesis

The Maximum Likelihood hypothesis for the current label is defined as:

The term is often called the likelihood of (x1,...,xd) given the class li , hence the name maximum likelihood.

   

Naive Bayes (NB) classifier

The first term of the MAP hypothesis could be simplified if the assumption is made that all sensors are conditionally independent, if the correct label is given: . In this case the Naive Bayes classifier is thus:

This assumption is usually not very workable when dealing with multiple sensors, though. Conditionally independent means that we can write: .

 

   
 

Compiled by Kristof Van Laerhoven.