UNIVERSITY EXAMINATIONS: 2018/2019
EXAMINATION FOR THE DEGREE OF MASTER OF SCIENCE IN
INFORMATION SYSTEMS MANAGEMENT
MISM5105 PRINCIPLES OF DATA SCIENCE
DATE: APRIL 2019 TIME: 2 HOURS
INSTRUCTIONS: Answer Question One & ANY OTHER TWO questions.
QUESTION ONE
a) Differentiate between Data visualization and data formating as used in big data analytics 4Marks
b) Nearly 80% of data analysis is spent on the cleaning and preparing data. Explain 4 Marks
c) Which is the next step performed by data scientist after acquiring the data? Explain your 3Marks
d) Differentiate between Data visualization and data formating as used in big data analytics 4Marks
c) Describe the term data merging as used in data science 3 Marks
d) Subsetting can be used to select and exclude variables and observations. Explain 3 Marks
QUESTION TWO
a) 3V’s are not sufficient to describe big data. Discuss 3 Marks
b) I)In what phase of an anlytics project would you expect to invest most time and why ? 4 Marks
ii) Where would you expect t spend the least time 3 Marks
iii) Describe the general syntax for calling functions and saving the result to a variable using python .5 Marks
QUESTION THREE
a) Discuss data wangling or datamungling 6 Marks
b) Describe the main components of handoop 2 Marks
c) Describe the term data veracity 3 Marks
d) Describe how big data analytics can be used to improve health services in Kenya 4 Marks
QUESTION FOUR
Scenerio
A mideium size retail bank in kenya wants to improve its net present value and its retention rate of
customers.They want to establish an effective market campaign targeting customers to reduce the churn
rate by at least 5%. They also want to determine whether those customers are worth retaining. In addition
the wants to analyze reasons for customer attrition and what they can do to keep them. The wants to build
a data ware house to support marketing and other customer related care groups.
Required
Perform an analytic plan for the bank above case study above 15 Marks