联系方式

  • QQ:99515681
  • 邮箱:99515681@qq.com
  • 工作时间:8:00-23:00
  • 微信:codehelp

您当前位置:首页 >> OS程序OS程序

日期:2023-08-28 08:01

Assignment 1
Question 1 [Total 23 Marks]

A group of researchers are interested in studying the prevalence of obesity, diabetes, and other
cardiovascular risk factors in Subang Jaya, Selangor. To gain more insight into this question,
1150 subjects were interviewed and some of the results obtained are compiled in the data file
A1 S2 2023.xls. The columns provide the following information:

Column A: the patient ID

Column B: the level of stabilised glucose

Column C: The total level of cholesterol

Column D: the level of high-density-lipoprotein (“good” cholesterol)

Column E: the weight of the patient

Column F: the gender of the patient
Column G: the type of body frame (small, medium, large)

The data is available on the “A1 S2 2023.xls” file on the Moodle. You must use your subsample
of the survey data. Your sample will consist of 200 observations starting from the respondent
whose ID is the same as the last three digits of your student number. For example, if your
student number is 20275749, you would use individuals 749 to 948.

All tables, graphs and comments for this question should be places in the designated spaces in
the Worksheet Results.

(a) Complete Table (a). Use Countif or another method to find the frequencies for the
number of male and female patients in the sample and hence complete Table (a).
[2 marks]

(b) Display the data in Table (a) using an appropriate chart to be placed in the Graph (b)
Textbox. [2 marks]

(c) Using Countif or any other appropriate method, complete Table (c) by filling in the
frequencies of male and female patients according to their type of body frame.
[2 marks]

(d) Display the data in Table (c) using an appropriate chart to be placed in the Graph (d)
Textbox. [2 marks]

(e) Complete Table (e) containing the summary statistics for the HDL (high-density-
lipoprotein or “good” cholesterol) variable according to the patient gender.
[2 marks]

(f) Complete the grouped frequency Table for the HDL (“good” cholesterol) for female and
male patients [Table (f)]. Find the frequency and hence calculate the percentage
frequency and cumulative percentage frequency for female and male patients. [2 marks]

(g) Is the level of “good” cholesterol (HDL) different for the two groups? Use figures from
Table (e) to help you explain any differences. [3 marks]

(h) Construct percentage frequency polygons for the HDL for female and male students as one chart
as Graph (h). [3 marks]

(i) Discuss the shape of the percentage frequency polygons for the HDL levels for female
and male patients. [3 marks]

(j) List the four measures of variability from the summary statistics. Which one of the
HDL (female or male patients) shows more variability? You are required to use your
sample result to answer this question. [4 marks]

Question 2 [Total 13 Marks]

a. Based on your sample size, construct a contingency Table between the gender of the
patient and the type of body frame. [1.5 marks]

b. Who are the majority of patients and what is their probability? [1.5 marks]

c. What is the probability that the randomly selected patient is a medium body frame?
[2 mark]
d. What is the probability that a randomly selected patient is female and has a large
frame? [2 marks]
e. Using the conditional percentage and appropriate Research Question, write a short report
[about 70 words] to hospital management regarding the gender of the patient and the type
of body frame. [6 marks]

Question 3 [Total 8 Marks]
The file travellers.xls on Moodle contains a worksheet of raw data. The data have been
collected from 3999 travelers as they arrive at Kuala Lumpur International Airport. The sheet
contains the country (region) they came from and the main purpose of their visit (work, study
or tourism), so there are two categorical variables to be examined: one is Region and the other
is Purpose.

You must use your subsample of the survey data. Your sample will consist of 500 observations
starting from the respondent whose ID is the same as the last three digits of your student
number. For example, if your student number is 20275249, you would use individuals 249 to
748.

Do travelers from all regions tend to visit Kuala Lumpur for study? You are required to identify
the dependent and independent variable. Using the conditional percentage and appropriate
Research Question indicate if there is an association between region and purpose of visit to
Kuala Lumpur. [8 marks]

Question 4 [Total 6 Marks]

General Hospital's patient account division has compiled data on the age of accounts
receivables. The data collected indicate that the age of the accounts follows a normal
distribution with a population mean of 28 days and a population standard deviation of 8 days.
a. What proportion of the accounts are between 20 and 40 days old? [2 marks]
b. What proportion of the accounts are less than 30 days old? [1 mark]
c. What is the number of days in which 75% of all accounts are above? [3 marks]

相关文章

版权所有:留学生编程辅导网 2021,All Rights Reserved 联系方式:QQ:99515681 电子信箱:99515681@qq.com
免责声明:本站部分内容从网络整理而来,只供参考!如有版权问题可联系本站删除。