# MDA5404  DATA ANALYTICS AND KNOWLEDGE ENGINEERING KCA Past Paper

UNIVERSITY EXAMINATIONS: 2018/2019
EXAMINATION FOR THE DEGREES OF MASTER OF SCIENCE IN
DATA ANALYTICS
MDA 5404: DATA ANALYTICS AND KNOWLEDGE ENGINEERING
ORDINARY EXAMINATIONS
DATE: AUGUST, 2019 TIME: 2 HOURS
INSTRUCTIONS: Answer Question One & ANY OTHER TWO questions.

QUESTION ONE
(a) Briefly describe three limitations of descriptive analytics and explain how each of them can
(b) Discuss limitations of the following descriptive analytics measures and explain which other
measures can be used to address the limitations
(i) Mean (2 Marks)
(ii) Covariance (2 Marks
(c) Briefly explain how a combination of mode, median and mean can be used to determine
whether data is skewed or symmetric (2 Marks)
(d) Discuss the purpose of using Box plot in data analytics (2 Marks)
(e) Briefly interpret the following data analytics visualization output. (3 Marks)

(d) The following Table shows a list of hours studied and Marks received by 4 students
Study hours Marks

Use the above data set to compute measures for both study hours and Marks. Interpret results for
each case
(ii) Standard deviation (2 Marks)
(iii) Covariance (2 Marks)
(iv) Correlation (2 Marks)
QUESTION TWO
(a) Briefly discuss the relationship between ‘Data analytics’ and ‘Knowledge engineering’
(2 Marks)
(b) Discuss three main goals of data analytics and their importance in business enterprises
(3 Marks)
(c) Describe the difference between descriptive and diagnostic analytics. Use a practical example
(d) Discuss the steps followed to carry out factor analysis including techniques for each step
(5 Marks)
(e) Consider the following knowledgebase of facts
furniture (sink, kitchen,1).
furniture (chair,lounge,4).
furniture (bed,bedroom,1).
furniture (cooker,kitchen,1).
furniture (chair,kitchen,4).
furniture (sofa,lounge,1).
(i) Write a query that can be used to find the number of each item there are in the lounge
(1 Mark)
(ii) Write a query that can be used to list all the rooms without showing the furniture or
the numbers (1 Mark)
(iii) Write a query that can be used to find the number of chairs in each room
(1 Mark)
QUESTION THREE
(a) Briefly describe the meaning of the following concepts as used in data analytics and
knowledge engineering:
(i) Eigen values (1 Mark)
(ii) Normalization (1 Mark)
(iii) Principal component analysis (1 Mark)
(b) State and explain five knowledge engineering activities (5 Marks)
(c) Discuss four components of a knowledge based system (4 Marks)
(d) Construct Semantic Network of the following scenario (3 Marks)
cats, bears and whales are mammals. Bears and cats have fur while whales and fish lives in
water. Both mammals and fish are animal.
QUESTION FOUR
(a) Describe the difference between predictive and prescriptive analytics (1 Mark)
(b) Describe predictive Analytics Process Cycle. (5 Marks)
(c) Consider the following knowledge
A lorry that has part a trailer, 18 wheels and has a large weight capacity.
It is driven by a driver and its speed is 100kph.
(i) Draw a frame that represents the above knowledge. (2 Marks)
(ii) Use predicate logic to represent the above knowledge (2 Marks)
(d) Consider the following data set

Name Give Birth Can Fly Live in Water Have Legs Class
Human Yes No no yes non-mammals
Python No No no no non-mammals
Salmon No No yes no non-mammals
Whale Yes No yes no Mammals
Frog No No yes yes Mammals
Komodo No No no yes non-mammals
Bat Yes Yes no yes Mammals
Pigeon No Yes no yes non-mammals
Cat Yes No no yes non-mammals
leopard shark Yes No yes no non-mammals
4
Use the above data set to answer the following questions
(i) identify independent and dependent attributes (1 Mark)

(ii) Given that the split point =3,write sample python code to split the data set into test and
training data set,
(2 Marks)
(iii)Write sample python code to split the data into independent and dependent attributes
(2 Marks)

