Question 1: Describe why the concept hierarchies are useful in data mining?
Question 2: Consider Associations Rule below, which was mined from the student data-base at Big-university Major(X,”biology”) =>status(X,”undergrad”)
Question 3: Assume that the number of students at the university (that is, the number of task-relevant data tuples) is 5000, that 56% of undergraduates at the university major in science, that 64% of the students are registered in programs leading to undergraduate degress, and that 70% of the students are majoring in science.
a) Calculate the confidence and support of the rule?
b) Consider Rule below Major(X,”biology”) =>status(X,”undergrad”) Assume that 30% of science students are majoring in biology. Would you consider Rule to be novel with respect to Rule? Describe.
Question 4: Give the significance of data mining query language.
Question 5: What are the primitives of data mining query language?
Question 6: What do you mean by domain testing? Describe about the schematic representation of domain testing?
Question 7: With the help of examples describe about the nice domains and ugly domains?
Question 8: Describe about the testing of one dimensional and two dimensional domains?
Question 9: Describe about the closure compatibility and span compatibility?