Bit 3201A Data Warehousing And Data Mining (Day) Question Paper

Bit 3201A Data Warehousing And Data Mining (Day) 

Course:Bachelor Of Science In Information Technology

Institution: Kca University question papers

Exam Year:2014



1
UNIVERSITY EXAMINATIONS: 2013/2014 ORDINARY EXAMINATION FOR THE BACHELOR OF SCIENCE IN INFORMATION TECHNOLOGY BIT 3201A DATA WAREHOUSING AND DATA MINING (DAY) DATE: APRIL, 2014 TIME: 2 HOURS INSTRUCTIONS: Answer Question ONE and any other TWO QUESTION ONE
a) Define the following terms [4 Marks]
i. Data warehousing
ii. Data mining
b) Describe any four types of data that are gathered and be mined and state the type of organizations that gather these types data [6 Marks]
c) Discuss any four application of data mining [6 Marks]
d) With the help of a diagram illustrate the Knowledge Discovery Process [8 Marks]
e) Discuss six ways in which the data that has been mined can be visually presented.
[6 Marks] QUESTION TWO
a) Define the term data ware house [2 Marks]
b) Discuss four benefits of data mining [4 Marks]
c) Discuss any four challenges facing data mining. [4 Marks]
d) Discuss any five possible discoveries from a data mining exercise. [10 Marks]
QUESTION THREE
a) Discuss six factors that lead to the growth and popularity of data mining. [6 Marks]
b) Describe the various classification of data mining systems [6 Marks]
c) Define an OLAP system [2 Marks]
2
d) Discuss five characteristics of OLAP [6 Marks]
QUESTION FOUR
a) A grocery shop sells six items which are Bread, Cheese, Eggs, Juice, Milk and Yogurt. The shopkeeper also keeps a record of the transactions as follows.
TRANSACTION ID
ITEMS
100
Bread, Cheese, Eggs, Juice
200
Bread, Cheese, Juice
300
Bread, Milk, Yogurt
400
Bread, Juice, Milk
500
Cheese, Juice, Milk
Using the Apriori algorithm find the association rules with 50% and 75% confidence. [14 Marks]
b) Discuss six factors that influence the selection and acquisition of data mining software. [6 Marks]
QUESTION FIVE
a) In building a decision tree, three possible attributes are considered as split attributes, the information gain for the attributes A, B, and C are 0.97, 0.029, and 0.15 respectively. Which attribute should be selected for the split and why? [3 Marks]
3
b) The table below shows the training data for classifying bank loan applications by assigning applications to one of the risk classes.
c)
Owns Home
Married
Gender
Employed
Credit rating
Risk class
Yes
Yes
Male
Yes
A
B
No
No
Female
Yes
A
A
Yes
Yes
Female
Yes
B
C
Yes
No
Male
No
B
B
No
Yes
Female
Yes
B
C
No
No
Female
Yes
B
A
No
No
Male
No
B
B
Yes
No
Female
Yes
A
A
No
Yes
Female
Yes
A
C
Yes
Yes
Female
Yes
A
C
i. Find the attribute that has the highest information gain. [12 Marks]
ii. Draw the decision tree for the table above [4 Marks]






More Question Papers


Exams With Marking Schemes

Popular Exams


Mid Term Exams

End Term 1 Exams

End Term 3 Exams

Opener Exams

Full Set Exams



Return to Question Papers