A topic modeling analysis of stigma dimensions, social, and related behavioral circumstances in clinical notes among patients with HIV.

To characterize stigma dimensions, social, and related behavioral circumstances in people living with HIV (PLWHs) seeking care, using natural language processing methods applied to a large collection of electronic health record (EHR) clinical notes from a large integrated health system in the southeast United States.

We identified a cohort of PLWHs from the University of Florida (UF) Health Integrated Data Repository and performed topic modeling analysis using Latent Dirichlet Allocation (LDA) to uncover stigma-related dimensions and related social and behavioral contexts. Domain experts created a seed list of HIV-related stigma keywords, then applied a snowball strategy to iteratively review notes for additional terms until saturation was reached. To identify more target topics, we tested three keyword-based filtering strategies. The detected topics were evaluated using three widely used metrics and manually reviewed by specialists. Word frequency analysis was used to highlight the prevalent terms associated with each topic. In addition, we conducted topic variation analysis among subgroups to examine differences across age- and sex-specific demographics.

We identified 9,140 PLWHs at UF Health and collected 2.9 million clinical notes. Through the iterative keyword approach, we generated a list of 91 keywords associated with HIV-related stigma. Topic modeling on sentences containing at least one keyword uncovered a wide range of topic themes associated with HIV-related stigma, social, and related behaviors circumstances, including "Mental Health Concern and Stigma", "Social Support and Engagement", "Limited Healthcare Access and Severe Illness", "Missed Appointments and HIV Care Monitoring", "Treatment Refusal and Isolation", "Intimate Partner Violence and Relationship Concerns", "Fear of Falling and Physical Health Concerns", "Substance Abuse", and "Food Insecurity and Resource Scarcity". Topic variation analysis across sex and age subgroups revealed no substantial difference between males and females; however, there were differences were observed among different ages. For example, "Fear of Falling and Physical Health Concerns" was notably more prevalent among older adults.

Extracting and understanding the HIV-related stigma and associated social and behavioral circumstances from EHR clinical notes enables scalable, time-efficient assessment and overcoming the limitations of traditional questionnaires. Findings from this research provide actionable insights to inform patient care and interventions to improve HIV-care outcomes.
Mental Health
Access
Care/Management

Authors

Chen Chen, Liu Liu, Prosperi Prosperi, Vaddiparti Vaddiparti, Cook Cook, Bian Bian, Guo Guo, Wu Wu
View on Pubmed
Share
Facebook
X (Twitter)
Bluesky
Linkedin
Copy to clipboard