Trustworthiness in Medical Product Question Answering by Large Language Models

Date: November 04, 2025

I gave an invited talk at the Machine Learning for Healthcare Roundtable during the Amazon Machine Learning Conference 2025, presenting our work on evaluating the trustworthiness of large language models (LLMs) in medical product question answering.

Drawing from our evaluation framework for generative shopping assistants, the talk introduced a systematic method for assessing whether LLM-generated answers about medical products adhere to FDA-approved labeling, thereby mitigating the risks of hallucinations, misinformation, and off-label promotion.

Daniel Lopez-Martinez presenting on LLM trustworthiness and FDA-compliant question answering at the Machine Learning for Healthcare Roundtable, Amazon Machine Learning Conference 2025.

Daniel Lopez-Martinez holding the Amazon Machine Learning Conference 2025 badge with the Seattle skyline in the background.

Share on

Facebook LinkedIn X (formerly Twitter)