Three New arXiv Papers Examine Language Models: Visual Alignment, Social Science Selection, and Medical Imaging

Recent arXiv preprints explore LLM visual understanding, model selection guidance for researchers, and causal reasoning for medical segmentation.

Three New arXiv Papers Examine Language Models

Three recent papers on arXiv explore different aspects of large language model capabilities and applications.

Visual Alignment in Text-Only Models

According to arXiv paper 2410.07173v3, researchers conducted “a systematic evaluation” examining how well text-only large language models align with the visual world. The study incorporated “frozen representations of various language models into a discriminative vision-language frame[work],” though the abstract provided does not detail the findings.

Model Selection Guidance for Social Scientists

A paper (arXiv:2601.10926v1) addresses the challenge facing social scientists who must choose among “thousands of large pretrained language models” currently available. The authors explore model selection using “validity, reliability, reproducibility, and replicability as guides,” though the provided abstract excerpt does not specify their recommendations.

Causal Reasoning for Medical Segmentation

The paper “Causal-SAM-LLM” (arXiv:2507.03585v2) examines a critical limitation in medical AI: according to the abstract, “the clinical utility of deep learning models for medical image segmentation is severely constrained by their inability to generalize to unseen domains.” The authors attribute this failure to “models learning spurious correlations between anatomic” features, though the complete methodology is not detailed in the excerpt provided.