Three New arXiv Papers Address Transformer Learning Dynamics, LLM Reasoning, and AI-Generated Image Detection

Recent arXiv papers examine transformer arithmetic learning, reinforcement learning for LLM reasoning, and vision language models for detecting AI-generated images.

Three new research papers on arXiv address different aspects of AI model behavior and capabilities.

Transformer Arithmetic Learning

A paper titled “Shattered Compositionality: Counterintuitive Learning Dynamics of Transformers for Arithmetic” (arXiv:2601.22510v1) examines how large language models exhibit unexpected errors even at scale. According to the abstract, the work “reveals the discrepancy between LLMs and humans in skill compositions” and investigates “the learning dynamics of skill compositions.”

Reinforcement Learning for LLM Reasoning

Another paper, “Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective” (arXiv:2505.17652v3), addresses efficiency challenges in using reinforcement learning to enhance LLM reasoning abilities. The abstract states that “reinforcement learning exhibits potential in enhancing the reasoning abilities of large language models, yet it is hard to scale for the low sample efficiency during the rollout phase.”

AI-Generated Image Detection

The paper “AlignGemini: Generalizable AI-Generated Image Detection Through Task-Model Alignment” (arXiv:2512.06746v2) focuses on using Vision Language Models (VLMs) for detecting AI-generated images. According to the abstract, while VLMs are “increasingly used for detecting AI-generated images,” converting them into reliable detectors is “resource-intensive,” and the resulting models “often suffer from hallucination and poor generalization.”