The data set for the assignment is about breast cancer. Some of these data have been changed for the purposes of this assignment so the results from the analysis of this file may not reflect findings in the literature or current clinical knowledge. However, please analyse these data as if they had come from a real study.
Variable
|
Label
|
Value Labels
|
Missing Values
|
id
|
|
|
|
age
|
Age (years)
|
|
|
pathsize
|
Pathologic Tumour Size (cm)
|
|
99.00
|
histgrad
|
Histological Grade
|
1=Low grade, 2=Intermediate grade,
3=High grade
4=Unknown
|
4
|
status
|
Vital Status
|
1=Dead, 2=Alive
|
9
|
HRTuse
|
Hormone Replacement Therapy Use Ever
|
0=No, 1=Yes
|
|
Background
A medical researcher has collected data from 1207 consecutive women with breast cancer in one large hospital. The researcher is interested in the characteristics of the woman (age, tumour size, histological grade, HRT use and wishes to describe them in a clear and concise way. The researcher wishes to see whether there are any associations between the characteristics (age, tumour size, histological grade, HRT use) and the vital status of the women at the end of follow-up.
Task
Your task is to analyse the data with respect to the following questions to describe the main features of the dataset "breast cancer assignment 2011.sav". You need to prepare a word processed document containing your results. The document must not be hand written. Imagine that this document will be read by people who do not have access to the data. You are expected to use full sentences for your answers, not just a few words. It is NOT NECESSARY OR REQUIRED to read about breast cancer in the medical literature, or to compare your results to results from articles in the medical literature.