The data from Vol. X are available through two databases. These two databases contain tabulated data (no individual records). The data are stored in the traditional form of number of cases by
sex, five-year age group and cancer. They are recorded in Comma Separated Value (CSV) format and can be easily incorporated into statistical
packages for further analysis. The layout of the data files together with the cancer and the population dictionaries are provided as text
files. Before downloading the files, please note that they may be freely used but not
for sale or for use in conjunction with commercial or promotional purposes, and
provided any use shall be subject to appropriate reference and acknowledgement
of the source.
-
The CI5-X summary database contains the data used in the online analysis options of this Internet
application (three-character ICD-10 sites).
-
The CI5-X detailed database contains the data used to
produce the book. The standard three-character ICD-10 anatomical sites
have been replaced by a set of 244 categories based on a combination of ICD-10
three-or four-character site codes and ICD-O-3 morphological groups (see
Table by histological type menu option).
Great care should be exercised when using the detailed database and when comparing incidence rates for histological subtypes of cancers. The values for the specified histological types will be greatly influenced by the proportion of unspecified cases at a given site. One must also consider the proportion of cases with no microscopic verification, since the calculations of rates for histological subtype can only be done for microscopically verified cases. Researchers are strongly recommended either to select datasets with few cases in the missing categories, or to make an appropriate adjustment before comparing rates.
|
|