Assignment: Data Mining and Data Warehousing.
Question 1
Compare the following with the help of examples.
(a) Star schema and snowflake schema
(b) Data cleaning and data transformation?
(c) Enterprise warehouse, data mart,and virtual warehouse
(d) OLAP and OLTP
Question 2
a) Explain three tiers of data warehouse architecture.
b) Which methods are used for efficient computations of data cubes.
c) Which algorithm is most appropriate to computeclosed iceberg cubes efficiently?
Question 3
Consider the database containing transaction data as stated in the table below. Use Apriori algorithm to find frequent itemsets where minimum support is 50%.
Transactions
|
Item Set
|
I1
|
A,B,C
|
I2
|
A,C
|
I3
|
A,D
|
I4
|
B,E,F
|
Question 4
Discuss candidate generation in Generalized Sequential Pattern (GSP) with the help of an example