Cancer Analysis System

16/11/2022
17/10/2024
Data source
Human
Cancer registry
Access and validation

Governance details

Documents or webpages that describe the overall governance of the data source and processes and procedures for data capture and management, data quality check and validation results (governing data access or utilisation for research purposes).

Biospecimen access

Are biospecimens available in the data source (e.g., tissue samples)?

No

Access to subject details

Can individual patients/practitioners/practices included in the data source be contacted?

No

Description of data collection

CAS is one of the most detailed cancer databases in the world including patient demographics, clinical characteristics, treatment patterns and outcomes for nearly all patients diagnosed with cancer in England. For a detailed description of data collection, please refer to https://doi.org/10.1093/ije/dyz076
Event triggering registration

Event triggering registration of a person in the data source

Disease diagnosis

Event triggering de-registration of a person in the data source

Death
Other

Event triggering de-registration of a person in the data source, other

patient opt-out

Event triggering creation of a record in the data source

Any hopstial visit for any cancer diagnosis
Data source linkage

Linkage

Is the data source described created by the linkage of other data sources (prelinked data source) and/or can the data source be linked to other data source on an ad-hoc basis?

Yes

Linkage description, pre-linked

CAS is comprised of a number of relational Datasets outlined in more detail in the Data dictionary (https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_Data/file/1054575/NCRAS_ODR_Data_dictionary_v4.4.xlsx). As clinical practice continues to evolve, and more detailed questions on cancer epidemiology and care arise, new sources of relevant Data are sought by NCRAS to both enrich and complement those already collected, any updated is then referlect in new Data dictionaries.
List of existing Data tables that makeup CAS:
- Cancer registry (This table is relational in nature; with one-to-many and many-to-many relationships. It contains three primary keys (PATIENTID, TUMOURID and EVENTID). This table can be considered the Master Linkage File for all Other NCRAS Data.
- Cancer Pathway (The table contains a primary key (AVPID) and foreign keys (PATIENTID, TUMOURID) that can be used for patient level linkage to Other NCRAS Datasets. The events contained within this Dataset are sourced from multiple Data feeds (Cancer pathway events are sourced/summarised from the Cancer Registry, SACT, RTDS, HES, CWT, and DIDs Datasets) and therefore its coverage and completeness varies according to the underlying Data feed used to populate event information.)
- SACT - Systemic anti-cancer theraphy Dataset (Patients treated with chemOtherapy from April 2012 to the last available quarter before the request date. This is approximately 6 months behind the current date.)
- RDTS - National RadiOtheraphy Dataset (RDTS) (Patients diagnosed with cancer in the calendar years 1995 to the last available cancer registration year with RTDS treatment details available from the 1st April 2009 to the last available quarter before the request date. This is approximately 6 months behind the current date.)

Linkage description, possible linkage

Linkage can also be performed to routine NHS Data curated by Other Data controllers, such as Hospital Episode Statstics (HES) admitted care, HES outpatient, HES Accident and Emergency, Cancer waiting times (CWT), Diagnostic Imagine Dataset (DIDs).
Additional Linkage can be done to non-routine Data collections curated by NHS Digital or Other Data Controllers, such as Nactional Cancer Diagnosis Audit (NCDA), Lung Cancer Data audit (LUCADA), National Cancer Patient Experience Survey (CPES), Quality of Life of Cancer Survivors in England: (Breast, Colorectal, Prostate, Non-Hodgkin’s Lymphoma), Quality of Life of Colorectal Cancer Survivors in England: Patient Reported Outcome Measures Survey (PROMS), Somatic Molecular Dataset

Linked data source 1

Pre linked

Is the data source described created by the linkage of other data sources?

Yes

Data source, other

Cancer Pathway (The table contains a primary key (AVPID) and foreign keys (PATIENTID, TUMOURID) that can be used for patient level linkage to Other NCRAS Datasets. The events contained within this Dataset are sourced from multiple Data feeds (Cancer pathway events are sourced/summarised from the Cancer Registry, SACT, RTDS, HES, CWT, and DIDs Datasets) and therefore its coverage and completeness varies according to the underlying Data feed used to populate event information.)

Linkage strategy

Deterministic

Linkage variable

Routine linkages are conducted at both patient and/or tumours level to using the PATIENTID and TUMOURID Data to Other cancer registration Datasets. The primary patient identifier is the NHS number (a unique identifier used throughout the healthcare system in England), but date of birth, full name and address are also used for patient identification and linkage.

Linkage completeness

Linkage has a very high-level of completion

Linked data source 2

Pre linked

Is the data source described created by the linkage of other data sources?

Yes

Data source, other

Cancer registry (This table is relational in nature; with one-to-many and many-to-many relationships. It contains three primary keys (PATIENTID, TUMOURID and EVENTID). This table can be considered the Master Linkage File for all Other NCRAS Data.

Linkage strategy

Deterministic

Linkage variable

Routine linkages are conducted at both patient and/or tumours level to using the PATIENTID and TUMOURID Data to Other cancer registration Datasets. The primary patient identifier is the NHS number (a unique identifier used throughout the healthcare system in England), but date of birth, full name and address are also used for patient identification and linkage.

Linkage completeness

Linkage has a very high-level of completion

Linked data source 3

Pre linked

Is the data source described created by the linkage of other data sources?

Yes

Data source, other

RDTS - National RadiOtheraphy Dataset (RDTS) (Patients diagnosed with cancer in the calendar years 1995 to the last available cancer registration year with RTDS treatment details available from the 1st April 2009 to the last available quarter before the request date. This is approximately 6 months behind the current date.)

Linkage strategy

Deterministic

Linkage variable

Routine linkages are conducted at both patient and/or tumours level to using the PATIENTID and TUMOURID Data to Other cancer registration Datasets. The primary patient identifier is the NHS number (a unique identifier used throughout the healthcare system in England), but date of birth, full name and address are also used for patient identification and linkage.

Linkage completeness

Linkage has a very high-level of completion

Linked data source 4

Pre linked

Is the data source described created by the linkage of other data sources?

Yes

Data source, other

SACT - Systemic anti-cancer theraphy Dataset (Patients treated with chemOtherapy from April 2012 to the last available quarter before the request date. This is approximately 6 months behind the current date.)

Linkage strategy

Deterministic

Linkage variable

Routine linkages are conducted at both patient and/or tumours level to using the PATIENTID and TUMOURID Data to Other cancer registration Datasets. The primary patient identifier is the NHS number (a unique identifier used throughout the healthcare system in England), but date of birth, full name and address are also used for patient identification and linkage.

Linkage completeness

Linkage has a very high-level of completion
Data management specifications that apply for the data source

Data source refresh

Yearly

Informed consent for use of data for research

Possibility of data validation

Can validity of the data in the data source be verified (e.g., access to original medical charts)?

No

Data source preservation

Are records preserved in the data source indefinitely?

Yes

Approval for publication

Is an approval needed for publishing the results of a study using the data source?

Yes

Data source last refresh

Common Data Model (CDM) mapping

CDM mapping

Has the data source been converted (ETL-ed) to a common data model?

No