Q1. Distinguish between the star schema, snowflake schema and fact constellation with the help of illustrations.
Q2. Describe Data extraction, Data transformation and the Data loading associated to the ETL.
Q3. Describe the ID3 Algorithm for the decision trees.
Q4. Apply the hierarchical clustering algorithm for clustering the given eight points. Find out the clusters with their elements.
The distance function is Euclidean distance:
A1(2,10), A2(2,5), A3(8,4), A4(5,8), A5(7,5), A6(6,4), A7(1,2), A8(4,9).
Q5. What do you mean by Apriori property? How it is used by the APRIORI algorithm? Illustrate the drawbacks of Apriori algorithm?
Q6. prepare detail notes on any three:
a) Outlier Analysis
b) Decision Support System
d) Data Marts