Bit 3201A Data Warehousing And Data Mining (Weekend)  Question Paper

Bit 3201A Data Warehousing And Data Mining (Weekend)  

Course:Bachelor Of Science In Information Technology

Institution: Kca University question papers

Exam Year:2014



1
UNIVERSITY EXAMINATIONS: 2013/2014 ORDINARY EXAMINATION FOR THE BACHELOR OF SCIENCE IN INFORMATION TECHNOLOGY BIT 3201A DATA WAREHOUSING AND DATA MINING (WEEKEND) DATE: APRIL, 2014 TIME: 2 HOURS INSTRUCTIONS: Answer Question ONE and any other TWO QUESTION ONE
a) Define the following terms [6 Marks]
i. Data Normalization
ii. Data Binning
iii. Data cube
b) Discuss five difference between OLTP and OLAP [6 Marks]
c) Before data warehousing it is apparent that data preprocessing must be carried out. Describe the five major tasks that constitute data pre-processing [5 Marks]
d) Giving examples, discuss four reasons why data cleaning stage in the KDD process is necessary [4 Marks]
e) Discuss the causes of noisy data or erroneous data in the database [5 Marks]
f) Differentiate between classification and clustering. [4 Marks]
QUESTION TWO
a) Discuss any five desired features of cluster analysis algorithm. ` [5 Marks]
b) Discuss five ways in which the data that has been mined can be visually presented.
[5 Marks]
2
c) A grocery shop sells six items which are Bread, Cheese, Eggs, Juice, Milk and Yogurt. The shopkeeper also keeps a record of the transactions as follows.
TRANSACTION ID
ITEMS
100
Bread, Cheese, Eggs, Juice
200
Bread, Cheese, Juice
300
Bread, Milk, Yogurt
400
Bread, Juice, Milk
500
Cheese, Juice, Milk
Using the improved naïve algorithm find the association rules with 50% and 75% confidence. [10 Marks] QUESTION THREE
a) Define the term data ware house [2 Marks]
b) Discuss four benefits of data mining [4 Marks]
c) Discuss any four challenges facing data mining. [4 Marks]
d) Discuss any five possible discoveries from a data mining exercise. [10 Marks]
QUESTION FOUR
a) Discuss five characteristics of OLAP [5 Marks]
b) Discuss any four factors that lead to the growth and popularity of data mining.
[4 Marks]
c) Describe the various classification of data mining systems [6 Marks]
d) Discuss any five factors that you would consider when selecting and acquiring a data mining software. [5 Marks]
QUESTION FIVE
a) Using two items A and B, define the following terms. [4 Marks]
i. Support
ii. Confidence
3
b) In the context of association rules mining Describe the following terms [4 Marks]
i. Frequent item-sets
ii. Confident rules
c) In building a decision tree, three possible attributes are considered as split attributes, the information gain for the attributes A, B, and C are 0.97, 0.029, and 0.15 respectively. Which attribute should be selected for the split and why? [3 Marks]
d) With the help of a diagram illustrate the Knowledge Discovery Process [9 Marks]






More Question Papers


Exams With Marking Schemes

Popular Exams


Mid Term Exams

End Term 1 Exams

End Term 3 Exams

Opener Exams

Full Set Exams



Return to Question Papers