Get premium membership and access revision papers, questions with answers as well as video lessons.

Data Warehousing And Data Mining Question Paper

Data Warehousing And Data Mining 

Course:Bachelor Of Science In Information Technology

Institution: Masinde Muliro University Of Science And Technology question papers

Exam Year:2012



YEAR 3 EXAMINATION FOR THE BACHELOR OF SCIENCE IN
INFORMATION TECHNOLOGY
DATA WAREHOUSING AND DATA MINING
DATE: APRIL 2012 TIME: 2 HOURS
INSTRUCTIONS: Answer Question One and Any other Two Questions
QUESTION ONE
a) Define the following terms. [4 Marks]
i. Data warehousing
ii. Data mining
b) Describe any six types of data that can be mined and state the type of organizations that gather
these types data [6 Marks]
c) Discuss any four applications of data mining. [6 Marks]
d) With the help of a diagram illustrate the Knowledge Discovery Process. [8 Marks]
e) Discuss six ways in which the data that has been mined can be visually presented. [6 Marks]
QUESTION TWO
a) Define the term data warehouse [2 Marks]
b) Discuss four benefits of data mining. [4 Marks]
c) Discuss any four challenges facing data mining. [4 Marks]
d) Describe any five possible discoveries from a data mining exercise. [10 Marks]
QUESTION THREE
a) Discuss six factors that lead to the growth and popularity of data mining. [6 Marks]
2
b) State and explain four ways of categorizing data mining systems. [6 Marks]
c) Define an OLAP system. [2 Marks]
d) Discuss six characteristics of OLAP systems. [6 Marks]
QUESTION FOUR
a) A grocery shop sells six items which are Bread, Cheese, Eggs, Juice, Milk and Yogurt. The shopkeeper also keeps a record of the transactions as follows.
TRANSACTION ID ITEMS
100 Bread, Cheese, Eggs, Juice
200 Bread, Cheese, Juice
300 Bread, Milk, Yogurt
400 Bread, Juice, Milk
500 Cheese, Juice, Milk
Using the Apriori algorithm find the association rules with 50% support and 75% confidence.
[14 Marks]
b) Discuss six factors that influence the selection and acquisition of data mining software. [6 Marks]
QUESTION FIVE
a) In building a decision tree, three possible attributes are considered as split attributes, the
information gain for the attributes A, B, and C are 0.97, 0.029, and 0.15 respectively. Which
attribute should be selected for the split and why? [3 Marks]
b) The table below shows the training data for classifying bank loan applications by assigning
applications to one of the risk classes.
Owns Home Married Gender Employed Credit rating Risk class
Yes Yes Male Yes A B
No No Female Yes A A
Yes Yes Female Yes B C
Yes No Male No B B
No Yes Female Yes B C
No No Female Yes B A
No No Male No B B
Yes No Female Yes A A
No Yes Female Yes A C
Yes Yes Female Yes A C
i. Find the attribute that has the highest information gain. [13 Marks]
ii. Draw the decision tree for the table above [4 Marks]






More Question Papers


Popular Exams


Mid Term Exams

End Term 1 Exams

End Term 3 Exams

Opener Exams

Full Set Exams



Return to Question Papers