"As illustrated in Fig 1, the system follows a loop that mirrors how clinicians gather evidence, generate a provisional explanation, and reassess whether their reasoning is sufficiently supported. At each iteration, the model retrieves context passages, produces an answer and r ationale, then evaluates that rationale through a scoring module. If parts of the rationale are unsupported or contradictory, the system reformulates the query to target missing information and repeats retrieval and generation. This reflection cycle allows Self MedRAG to progressively strengthen factual grounding while ensuring that the final answer and rationale remain clinically coherent, and evidence based."