Frame Semantic Patterns for Identifying Underreporting of Notifiable Events in Healthcare-- The Case of Gender-Based Violence
Frame Semantic Patterns for Identifying Underreporting of Notifiable Events in Healthcare: The Case of Gender-Based Violence
Categories
- cs.CL
- cs.AI
Initially Published At
2025-10-30T19:52:24Z
Authors
- Lívia Dutra
- Arthur Lorenzi
- Laís Berno
- Franciany Campos
- Karoline Biscardi
- Kenneth Brown
- Marcelo Viridiano
- Frederico Belcavello
- Ely Matos
- Olívia Guaranha
- Erik Santos
- Sofia Reinach
- Tiago Timponi Torrent
Summary
We introduce a methodology for the identification of notifiable events in the domain of healthcare. The methodology harnesses semantic frames to define fine-grained patterns and search them in unstructured data, namely, open-text fields in e-medical records. We apply the methodology to the problem of underreporting of gender-based violence (GBV) in e-medical records produced during patients’ visits to primary care units. A total of eight patterns are defined and searched on a corpus of 21 million sentences in Brazilian Portuguese extracted from e-SUS APS. The results are manually evaluated by linguists and the precision of each pattern measured. Our findings reveal that the methodology effectively identifies reports of violence with a precision of 0.726, confirming its robustness. Designed as a transparent, efficient, low-carbon, and language-agnostic pipeline, the approach can be easily adapted to other health surveillance contexts, contributing to the broader, ethical, and explainable use of NLP in public health systems.