Difference between revisions of "K-means"

Latest revision as of 00:02, 28 March 2011

In statistics and machine learning, k-means clustering is a method of cluster analysis which aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean. It is similar to the expectation-maximization algorithm for mixtures of Gaussians in that they both attempt to find the centers of natural clusters in the data as well as in the iterative refinement approach employed by both algorithms.

Given a set of observations (x₁, x₂, …, x_n), where each observation is a d-dimensional real vector, k-means clustering aims to partition the n observations into k sets (k ≤ n) S = {S₁, S₂, …, S_k} so as to minimize the within-cluster sum of squares (WCSS):

{\underset {\mathbf {S} }{\operatorname {arg\,min} }}\sum _{i=1}^{k}\sum _{\mathbf {x} _{j}\in S_{i}}\left\|\mathbf {x} _{j}-{\boldsymbol {\mu }}_{i}\right\|^{2}

where μ_i is the mean of points in S_i.

For more detail information, please visit WIKI: http://en.wikipedia.org/wiki/K-means

Revision as of 00:00, 28 March 2011 (view source) Nqi (talk \| contribs) (Created page with 'In statistics and machine learning, k-means clustering is a method of cluster analysis which aims to partition n observations into k clusters in which each observation belongs to…')		Latest revision as of 00:02, 28 March 2011 (view source) Nqi (talk \| contribs)
Line 6:		Line 6:

	where '''''μ'''''<sub>''i''</sub> is the mean of points in ''S''<sub>''i''</sub>.		where '''''μ'''''<sub>''i''</sub> is the mean of points in ''S''<sub>''i''</sub>.
		+
		+	For more detail information, please visit WIKI: http://en.wikipedia.org/wiki/K-means

Difference between revisions of "K-means"

Latest revision as of 00:02, 28 March 2011

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools