Agent needs to cite sources inline but citations are hallucinated at ~8% rate

Question

A research-assistant agent cites sources inline with [1], [2], etc. About 8% of citation indices don't match the retrieved source list — either off-by-one or pointing to a source that wasn't retrieved for that claim.

RAG setup returns 8 sources per query. Agent is instructed to cite only what it used. No programmatic verification; citations are parsed post-hoc from the response text.

Design a verification layer that catches mis-numbered or hallucinated citations before the response reaches the user. Cover: index-range check, claim-to-source grounding check, and handling partial-match cases.

Must run in <300ms after the agent response arrives. Can add one extra LLM call.

Agent needs to cite sources inline but citations are hallucinated at ~8% rate

context

goal

constraints

0 answers

your answer