Dcis 202:Statistical Data Analysis Question Paper
Dcis 202:Statistical Data Analysis
Course:Diploma In Computer Science
Institution: Kenya Methodist University question papers
Exam Year:2015
KENYA METHODIST UNIVERSITY
END OF 1''st ''TRIMESTER 2015 (FT) EXAMINATION
FACULTY : COMPUTING & INFORMATICS
DEPARTMENT : COMPUTER SCIENCE
UNIT CODE : DBIT 220/DCIS 202
UNIT TITLE : STATISTICAL DATA ANALYSIS
TIME : 2 HOURS
INSTRUCTIONS
Attempt question one and any other two questions.
Question One (30 marks)
State the four steps of learning from data.
(2 marks)
The following numbers represent the time in minutes that ten students took to school on a particular day 18 34 38 22 10 92 46 52 38 29 45
Calculate the quartiles
(3 marks)
Find the interquartile range
(2 marks)
Define the following terms
(4 marks)
Sample
Population
Suppose you read an article in the local college newspaper citing that the average college student plays 2 hours of video games per week. To test whether this is true for your school, you randomly approach 20 fellow students and ask them how long (in hours) they play video games per week. You find that the average student, among those you asked, plays video games for 1 hour per week.
Distinguish the population from the sample
(2 marks)
Distinguish the population parameters from the sample statistic.
(2 marks)
The weights of students in a class of ten are:
120 130 160 170 200 210 190 190 140 calculate the standard deviation for the data. (5 marks)
The table below shows the distribution of weights of 140 students joining a certain university:
Weight (in pounds) Frequency
80-89 4
90-99 23
100-109 49
110-119 38
120-129 17
130-139 6
140-149 3
Plot a histogram for the above data. (5 marks)
State and explain the three criteria that can be used to measure central tendency.
(3 marks)
Find the standard deviation of a data set whose variance is 27.33 (2 marks)
Question Two (15 marks)
Consider the following data and answer questions that follow:
X 2 4 6 8 10 12
Y 3 6 8 10 13 14
Draw a scatter plot for the data
(3 marks)
Determine the line of best fit using the least square equation (5 marks)
Compute the Pearson’s Correlation Coefficient.
(5 marks)
Differentiate between correlation and regression.
(2 marks)
Question Three (15 marks)
Give a brief description of each of the following probability sampling techniques.
(6 marks)
Systematic sampling
Cluster sampling
Stratified sampling
The amount of protein (in grams) for a variety of sandwiches is reported here:
23 30 20 27 44 26 35 20 29 29
25 15 18 27 19 22 12 26 34 15
27 35 26 43 35 14 24 12 23 31
40 35 38 57 22 42 24 21 27 33
Construct a frequency distribution, showing cumulative and relative frequencies
(5 marks)
Draw a histogram.
(4 marks)
Question Four (15 marks)
Consider the frequency distribution given below:
Mileage Frequency
1 5
2 4
5 7
7 8
8 10
9 40
10 10
15 11
17 25
20 10
25 15
30 5
Construct a relative frequency histogram
(13 marks)
Find the mode using the histogram in (i) above
(2 marks)
More Question Papers
Exams With Marking Schemes
Popular Exams
Mid Term Exams
End Term 1 Exams
End Term 3 Exams
Opener Exams
Full Set Exams
Return to Question Papers