Umfassende Service-Einschränkungen im Bereich Ausleihe ab 17. März!

Treffer: Challenges and solutions to employing natural language processing and machine learning to measure patients' health literacy and physician writing complexity: The ECLIPPSE study.

Title:

Challenges and solutions to employing natural language processing and machine learning to measure patients' health literacy and physician writing complexity: The ECLIPPSE study.

Authors:

Brown W 3rd; Center for AIDS Prevention Studies, University of California, San Francisco, San Francisco, CA, United States; Bakar Computational Health Science Institute, University of California, San Francisco, San Francisco, CA, United States; University of California San Francisco Center for Vulnerable Populations, Zuckerberg San Francisco General Hospital, San Francisco, CA, United States; Department of Medicine, University of California, San Francisco, San Francisco, CA, United States. Electronic address: william.brown@ucsf.edu., Balyan R; State University of New York Old Westbury, NY, United States; Department of Psychology, Arizona State University, Tempe, AZ, United States., Karter AJ; Division of Research, Kaiser Permanente Northern California, Oakland, CA, United States., Crossley S; Department of Applied Linguistics and English as a Second Language, Georgia State University, Atlanta, GA, United States., Semere W; Department of Medicine, University of California, San Francisco, San Francisco, CA, United States., Duran ND; School of Social and Behavioral Sciences, Arizona State University, Glendale, AZ, United States., Lyles C; University of California San Francisco Center for Vulnerable Populations, Zuckerberg San Francisco General Hospital, San Francisco, CA, United States; Department of Medicine, University of California, San Francisco, San Francisco, CA, United States; Division of Research, Kaiser Permanente Northern California, Oakland, CA, United States., Liu J; Division of Research, Kaiser Permanente Northern California, Oakland, CA, United States., Moffet HH; Division of Research, Kaiser Permanente Northern California, Oakland, CA, United States., Daniels R; University of California San Francisco Center for Vulnerable Populations, Zuckerberg San Francisco General Hospital, San Francisco, CA, United States., McNamara DS; Department of Psychology, Arizona State University, Tempe, AZ, United States., Schillinger D; University of California San Francisco Center for Vulnerable Populations, Zuckerberg San Francisco General Hospital, San Francisco, CA, United States; Department of Medicine, University of California, San Francisco, San Francisco, CA, United States; Division of Research, Kaiser Permanente Northern California, Oakland, CA, United States.

Source:

Journal of biomedical informatics [J Biomed Inform] 2021 Jan; Vol. 113, pp. 103658. Date of Electronic Publication: 2020 Dec 11.

Publication Type:

Journal Article; Research Support, N.I.H., Extramural; Research Support, U.S. Gov't, P.H.S.

Language:

English

Journal Info:

Publisher: Elsevier Country of Publication: United States NLM ID: 100970413 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1532-0480 (Electronic) Linking ISSN: 15320464 NLM ISO Abbreviation: J Biomed Inform Subsets: MEDLINE

Imprint Name(s):

Publication: Orlando : Elsevier
Original Publication: San Diego, CA : Academic Press, c2001-

MeSH Terms:

Health Literacy* , Physicians*, Humans ; Machine Learning ; Natural Language Processing ; Writing

References:

J Commun Healthc. 2020;13(4):1-13. (PMID: 34306181)
Patient Educ Couns. 2004 Mar;52(3):315-23. (PMID: 14998602)
J Gen Intern Med. 2019 Nov;34(11):2490-2496. (PMID: 31428986)
PLoS One. 2019 Feb 22;14(2):e0212488. (PMID: 30794616)
Arch Intern Med. 2003 Jan 13;163(1):83-90. (PMID: 12523921)
J Cancer Educ. 2014 Dec;29(4):698-701. (PMID: 24633725)
BMJ Open. 2019 Feb 5;9(2):e024582. (PMID: 30813117)
Int J Epidemiol. 2009 Feb;38(1):38-47. (PMID: 18326513)
Health Serv Res. 2021 Feb;56(1):132-144. (PMID: 32966630)
J Gen Intern Med. 2010 Sep;25(9):962-8. (PMID: 20480249)
J Diabetes Res. 2017;2017:1348242. (PMID: 28265579)
JAMA Intern Med. 2013 Feb 11;173(3):210-8. (PMID: 23277199)
Health Serv Res. 2015 Apr;50(2):537-59. (PMID: 25131156)
Circulation. 2012 Jun 12;125(23):2854-62. (PMID: 22572916)
Patient Educ Couns. 2017 Mar;100(3):542-549. (PMID: 27776790)
Health Promot Pract. 2017 Jan;18(1):140-149. (PMID: 27188894)
BMJ Open. 2018 Jul 6;8(7):e022132. (PMID: 29982220)
BMC Health Serv Res. 2015 Jun 27;15:249. (PMID: 26113118)
BMJ Open Diabetes Res Care. 2017 Aug 29;5(1):e000437. (PMID: 29225895)
J Gen Intern Med. 2005 Nov;20(11):1001-7. (PMID: 16307624)
Health Commun. 2021 Jul;36(8):1018-1028. (PMID: 32114833)
AIDS Behav. 2017 Mar;21(3):822-832. (PMID: 26961538)
J Assoc Nurses AIDS Care. 2014 May-Jun;25(3):203-13. (PMID: 23433916)
J Gen Intern Med. 2008 May;23(5):561-6. (PMID: 18335281)
J Med Libr Assoc. 2018 Jan;106(1):38-45. (PMID: 29339932)
Psychol Health Med. 2019 Aug;24(7):853-865. (PMID: 30706719)
J Clin Epidemiol. 2018 Oct;102:134-138. (PMID: 29793001)
Am J Health Behav. 2007 Sep-Oct;31 Suppl 1:S85-95. (PMID: 17931142)
J Cancer Educ. 2018 Feb;33(1):89-94. (PMID: 27236309)
J Am Med Inform Assoc. 2021 Jun 12;28(6):1252-1258. (PMID: 33236117)
AMIA Annu Symp Proc. 2007 Oct 11;:418-22. (PMID: 18693870)
J Gen Intern Med. 2013 Sep;28(9):1181-7. (PMID: 23512335)
JAMA. 2002 Jul 24-31;288(4):475-82. (PMID: 12132978)
Diabetes Care. 2010 Apr;33(4):733-5. (PMID: 20086256)

Grant Information:

KL2 TR001870 United States TR NCATS NIH HHS; K12 HS026383 United States HS AHRQ HHS; R01 LM012355 United States LM NLM NIH HHS; P30 DK092924 United States DK NIDDK NIH HHS; R01 LM013045 United States LM NLM NIH HHS

Contributed Indexing:

Keywords: Diabetes health care quality; Digital health and health services research; Electronic health records; Health literacy; Machine learning; Natural language processing

Entry Date(s):

Date Created: 20201214 Date Completed: 20210728 Latest Revision: 20240923

Update Code:

20260130

PubMed Central ID:

PMC8186847

DOI:

10.1016/j.jbi.2020.103658

PMID:

33316421

Database:

MEDLINE

Weitere Informationen

Objective: In the National Library of Medicine funded ECLIPPSE Project (Employing Computational Linguistics to Improve Patient-Provider Secure Emails exchange), we attempted to create novel, valid, and scalable measures of both patients' health literacy (HL) and physicians' linguistic complexity by employing natural language processing (NLP) techniques and machine learning (ML). We applied these techniques to > 400,000 patients' and physicians' secure messages (SMs) exchanged via an electronic patient portal, developing and validating an automated patient literacy profile (LP) and physician complexity profile (CP). Herein, we describe the challenges faced and the solutions implemented during this innovative endeavor.
Materials and Methods: To describe challenges and solutions, we used two data sources: study documents and interviews with study investigators. Over the five years of the project, the team tracked their research process using a combination of Google Docs tools and an online team organization, tracking, and management tool (Asana). In year 5, the team convened a number of times to discuss, categorize, and code primary challenges and solutions.
Results: We identified 23 challenges and associated approaches that emerged from three overarching process domains: (1) Data Mining related to the SM corpus; (2) Analyses using NLP indices on the SM corpus; and (3) Interdisciplinary Collaboration. With respect to Data Mining, problems included cleaning SMs to enable analyses, removing hidden caregiver proxies (e.g., other family members) and Spanish language SMs, and culling SMs to ensure that only patients' primary care physicians were included. With respect to Analyses, critical decisions needed to be made as to which computational linguistic indices and ML approaches should be selected; how to enable the NLP-based linguistic indices tools to run smoothly and to extract meaningful data from a large corpus of medical text; and how to best assess content and predictive validities of both the LP and the CP. With respect to the Interdisciplinary Collaboration, because the research required engagement between clinicians, health services researchers, biomedical informaticians, linguists, and cognitive scientists, continual effort was needed to identify and reconcile differences in scientific terminologies and resolve confusion; arrive at common understanding of tasks that needed to be completed and priorities therein; reach compromises regarding what represents "meaningful findings" in health services vs. cognitive science research; and address constraints regarding potential transportability of the final LP and CP to different health care settings.
Discussion: Our study represents a process evaluation of an innovative research initiative to harness "big linguistic data" to estimate patient HL and physician linguistic complexity. Any of the challenges we identified, if left unaddressed, would have either rendered impossible the effort to generate LPs and CPs, or invalidated analytic results related to the LPs and CPs. Investigators undertaking similar research in HL or using computational linguistic methods to assess patient-clinician exchange will face similar challenges and may find our solutions helpful when designing and executing their health communications research.
(Copyright © 2020 Elsevier Inc. All rights reserved.)

Treffer: Challenges and solutions to employing natural language processing and machine learning to measure patients' health literacy and physician writing complexity: The ECLIPPSE study.

Weitere Informationen

Links

Zusatz-Funktionen