Revision as of 19:38, 5 May 2014 by Drunyan (Talk | contribs)


Introduction to Clustering A slecture by CS student David Runyan


Introduction


In class, we covered the simple Bayesian classifier. This form of classification falls under a category known as supervised learning. What this means is that a set of labelled data data is used to to "train" the underlying model. However, it is not always possible to have such a data set, yet we may still wish to discover some form of underlying structure in an unlabelled data set. Such a task falls under the category of unsupervised learning.

Clustering is a form of unsupervised learning. "Clustering is the problem of identifying groups, or clusters of data points in multidimensional space". For example, consider the following data set:


In class we covered various forms of supervised learning, such as classification and density estimation. These applications required the use of training data. Supervised learning, as i.e. learning using training data. Clustering, on the other hand is a form of unsupervised learning. We are not given a training set, yet wish to discover some form of underling structure in a dataset.




Back to ECE662, Spring 2014

Alumni Liaison

Ph.D. 2007, working on developing cool imaging technologies for digital cameras, camera phones, and video surveillance cameras.

Buyue Zhang