• Also known as
  • Clustering

Clus­ter analy­sis describes the process of auto­mat­i­cally group­ing infor­ma­tion into log­i­cal bun­dles (clus­ters) of infor­ma­tion. This makes it eas­ier for users and soft­ware appli­ca­tions to iden­tify the con­text and rel­e­vant infor­ma­tion in big data.

Clus­ter­ing is a main task of exploratory data min­ing, and a com­mon tech­nique for sta­tis­ti­cal data analy­sis. It is used in many fields of infor­ma­tion tech­nol­ogy, includ­ing machine learn­ing, pat­tern recog­ni­tion, image analy­sis, infor­ma­tion retrieval, bioin­for­mat­ics, data com­pres­sion, and com­puter graph­ics.