Three New arXiv Papers Examine Trust, Collaboration, and Reasoning in Large Language Models

Recent research explores privacy in compressed LLMs, collective false memories in multi-agent systems, and diagnostic reasoning with retrieval-augmented generation.

Three New arXiv Papers Examine Trust, Collaboration, and Reasoning in Large Language Models

Three recent preprints on arXiv explore different aspects of large language model (LLM) capabilities and limitations.

Low-Rank Model Compression and Trust

According to arXiv:2511.22099v2, researchers are examining privacy, adversarial robustness, fairness, and ethics of low-rank LLMs. The paper notes that while LLMs have “driven major advances across domains,” their “massive size hinders deployment in resource-constrained settings.” The research focuses on low-rank factorization as a compression technique to address this challenge.

Collective False Memories in Multi-Agent Systems

A paper titled “When Agents ‘Misremember’ Collectively” (arXiv:2602.00428v1) investigates the “Mandela Effect” in LLM-based multi-agent systems. According to the abstract, while “recent advancements in large language models (LLMs) have significantly enhanced the capabilities of collaborative multi-agent systems,” these systems show “susceptibility” to collective misremembering phenomena.

Diagnostic Reasoning in Healthcare

Researchers are developing end-to-end agentic RAG (Retrieval-Augmented Generation) systems for healthcare diagnostics (arXiv:2508.15746v2). The paper addresses how “integration of Large Language Models (LLMs) into healthcare is constrained by knowledge limitations, hallucinations, and a disconnect from Evidence-Based Medicine (EBM),” proposing RAG as a potential solution for traceable diagnostic reasoning.