Three New arXiv Papers Examine Trust, Collaboration, and Reasoning in Large Language Models

Three recent preprints on arXiv explore different aspects of large language model (LLM) capabilities and limitations.

Low-Rank Model Compression and Trust

According to arXiv:2511.22099v2, researchers are examining privacy, adversarial robustness, fairness, and ethics of low-rank LLMs. The paper notes that while LLMs have “driven major advances across domains,” their “massive size hinders deployment in resource-constrained settings.” The research focuses on low-rank factorization as a compression technique to address this challenge.

Collective False Memories in Multi-Agent Systems

A paper titled “When Agents ‘Misremember’ Collectively” (arXiv:2602.00428v1) investigates the “Mandela Effect” in LLM-based multi-agent systems. According to the abstract, while “recent advancements in large language models (LLMs) have significantly enhanced the capabilities of collaborative multi-agent systems,” these systems show “susceptibility” to collective misremembering phenomena.

Diagnostic Reasoning in Healthcare

Researchers are developing end-to-end agentic RAG (Retrieval-Augmented Generation) systems for healthcare diagnostics (arXiv:2508.15746v2). The paper addresses how “integration of Large Language Models (LLMs) into healthcare is constrained by knowledge limitations, hallucinations, and a disconnect from Evidence-Based Medicine (EBM),” proposing RAG as a potential solution for traceable diagnostic reasoning.