Development and testing of a text-mining approach to analyse patients’ comments on their experiences of colorectal cancer care

Wagland, Richard, Recio Saucedo, Alejandra, Simon, Michael, Bracher, Michael, Hunt, Katherine, Foster, Claire, Downing, Amy, Glaser, Adam W. and Corner, Jessica (2015) Development and testing of a text-mining approach to analyse patients’ comments on their experiences of colorectal cancer care. BMJ Quality & Safety . pp. 1-26. ISSN 2044-5423

Full text not available from this repository.


Background: Quality of cancer care may greatly impact upon patients’ health-related quality of life (HRQoL). Free-text responses to patient-reported outcome measures (PROMs) provide rich data but analysis is time and resource-intensive. This study developed and tested a learning-based text-mining approach to facilitate analysis of patients’ experiences of care and develop an explanatory model illustrating impact upon HRQoL.

Methods: Respondents to a population-based survey of colorectal cancer survivors provided free-text comments regarding their experience of living with and beyond cancer. An existing coding framework was tested and adapted, which informed learning-based text mining of the data. Machine-learning algorithms were trained to identify comments relating to patients’ specific experiences of service quality, which were verified by manual qualitative analysis. Comparisons between coded retrieved comments and a HRQoL measure (EQ5D) were explored.

Results: The survey response rate was 63.3% (21,802/34,467), of which 25.8% (n=5634) participants provided free-text comments. Of retrieved comments on experiences of care (n=1688), over half (n=1045, 62%) described positive care experiences. Most negative experiences concerned a lack of post-treatment care (n=191, 11% of retrieved comments), and insufficient information concerning self-management strategies (n=135, 8%) or treatment side effects (n=160, 9%). Associations existed between HRQoL scores and coded algorithm-retrieved comments. Analysis indicated that the mechanism by which service quality impacted upon HRQoL was the extent to which services prevented or alleviated challenges associated with disease and treatment burdens.

Conclusions: Learning-based text mining techniques were found useful and practical tools to identify specific free-text comments within a large dataset, facilitating resource-efficient qualitative analysis. This method should be considered for future PROM analysis to inform policy and practice. Study findings indicated that perceived care quality directly impacts upon HRQoL.

Item Type: Article
Keywords: text-mining, PROMs, quality of life, colorectal cancer, machine learning, machine learning algorithms, thematic analysis, thematic content analysis, qualitative methods
Subjects: ?? QA75 ??
Library of Congress Subject Areas > R Medicine > RC Internal medicine > RC 254 Neoplasms. Tumors. Oncology (including Cancer)
Library of Congress Subject Areas > R Medicine > RT Nursing
Schools/Departments: University of Nottingham, UK > Faculty of Medicine and Health Sciences > School of Health Sciences
Identification Number:
Depositing User: Eprints, Support
Date Deposited: 20 Oct 2015 15:49
Last Modified: 04 May 2020 17:18

Actions (Archive Staff Only)

Edit View Edit View