Three New arXiv Papers Address LLM Safety, Memory, and Healthcare Applications

Three new preprints on arXiv explore different aspects of large language model deployment and security.

Healthcare Diagnostics

According to arXiv:2512.17559v1, researchers are working on “Explainable Conversational AI for Early Diagnosis with Large Language Models.” The paper addresses challenges in healthcare systems including “inefficient diagnostics, rising costs, and limited access to specialists,” which “often lead to delays in treatment and poor health outcomes,” according to the abstract.

Security Vulnerabilities

A paper titled “MemoryGraft: Persistent Compromise of LLM Agents via Poisoned Experience Retrieval” (arXiv:2512.16962v1) examines security risks in LLM agents. According to the abstract, these agents “increasingly rely on long-term memory and Retrieval-Augmented Generation (RAG) to persist experiences and refine future performance.” The research warns that “while this experience learning capability enhances agentic autonomy, it introduces” vulnerabilities through poisoned memory retrieval.

Policy Compliance

The paper “Towards Safer Chatbots: Automated Policy Compliance Evaluation of Custom GPTs” (arXiv:2502.01436v3) focuses on user-configured chatbots in marketplaces like OpenAI’s GPT Store. According to the abstract, these platforms “enforce usage policies intended to prevent harmful or inappropriate” content, though the full scope of compliance challenges remains under investigation.