Clinical Evidence Synthesis AI
Developing a clinical practice guideline is a months-long manual effort, and it cannot keep pace with the evidence. This research asks whether AI can draft an evidence-based guideline far faster while staying trustworthy, by implementing the evidence methodology end to end, verifying the model's own output against source data, refusing any quantitative claim it cannot trace, and keeping a human panel in control of every recommendation.
Developing a clinical practice guideline is a slow, expert-bound process that cannot keep pace with the evidence, and many questions never get a guideline at all. This thread investigates whether the evidence-synthesis workflow can be drafted by an agentic pipeline in a fraction of the time while remaining trustworthy enough for clinical use. The organizing idea is distrust: the model's output is treated as untrusted input, every quantitative claim must trace back to source data, unsupported or direction-flipped estimates are rejected before they reach a clinician, and a human panel gates every recommendation. It is framed throughout as decision support that drafts while the panel decides, never an autonomous guideline author. The work draws on agentic LLM architectures, evidence-synthesis methodology, source-grounded generation, and rigorous evaluation against human-curated ground truth. Faculty-advised.