problem 1: Clustering has been popularly recognized as a significant data mining task with broad applications. Give one application illustration for each of the given cases:
a) An application which takes clustering as a main data mining function.
b) An application which takes clustering as a preprocessing tool for data preparation for other data mining tasks.
problem 2: Why is outlier mining significant? In brief describe the various approaches behind statistical-based outlier detection, distanced-based outlier detection, density-based local outlier detection and deviation-based outlier detection.
problem 3: Describe the difference between K-means and k-medoids algorithm.
problem 4: Describe the efficiency of k-medoids algorithm on large data sets.
problem 5: describe the diverse dimensions in a spatial data cube.
problem 6: How to construct a data cube for multimedia data analysis?
problem 7: How to find out the similarity between documents?
problem 8: Describe how to mine spatial databases.