Practical exercise 10
Table 1 gives the observed number of lung cancer cases in the male population in four Danish cities in the period 1968-1971 distributed over five age groups. Table 2 gives the corresponding approximate numbers of inhabitants.
Table 1: Observed number of lung cancer cases in the male population between 1968 and 1971 in four Danish cities distributed over age groups.
|
|
City |
|
|
Age group |
Fredericia |
Horsens |
Kolding |
Vejle |
40-54 |
11 |
13 |
4 |
5 |
55-59 |
11 |
6 |
8 |
7 |
60-64 |
11 |
15 |
7 |
10 |
65-69 |
10 |
10 |
11 |
14 |
70-74 |
11 |
12 |
9 |
8 |
Table 2: Approximate number of male inhabitants in four Danish cities in the period 1968-1971, distributed over
age groups.
|
|
City |
|
|
Age group |
Fredericia |
Horsens |
Kolding |
Vejle |
40-54 |
3059 |
2875 |
3142 |
2520 |
55-59 |
800 |
1083 |
1050 |
878 |
60-64 |
710 |
923 |
895 |
839 |
65-69 |
581 |
834 |
702 |
631 |
70-74 |
509 |
634 |
535 |
539 |
You may read the data from the tables into R by the command:
path="http://www.uio.no/studier/emner/matnat/math/STK4080/h14/lungcancer.txt"
lungcancer=read.table(path,header=T)
The data are organized with one line for each combination of city and age group, and with the following variables in the four columns:
city: city (1: Fredericia, 2: Horsens, 3: Kolding, 4: Vejle)
age: age group (1: 40-54 years, 2: 55-59 years, 3: 60-64 years, 4: 65-69 years, 5: 70-74 years)
cancer: number of lung cancer cases
population: approximate number of male inhabitants
In this exercise we will consider lung cancer incidence for all cities combined (the effect of city will be considered in a later exercise).
a) Compute the total number of lung cancer cases in and the approximate number of inhabitants for all cities combined. Convert the approximate number of inhabitants to person years by multiplying the number of inhabitants by four. (Why?) For help to this and the following questions, see the R-commands to the lectures from week 43.
b) Compute occurrence/exposure rates of the lung cancer incidence in the five age groups with standard errors, and make a plot of the occurrence/exposure rates with standard 95% confidence limits. Give an interpretation of the plot.
c) Compute log-transformed 95 % confidence limits and add them to the plot in b. Comment on the difference between the two sets of confidence limits.