Three New arXiv Papers Explore LLM Reasoning, Inference Efficiency, and Policy Applications

Three preprints posted to arXiv examine different aspects of large language model capabilities and applications.

Proactive Reasoning Models: According to arXiv:2601.22139v1, a new paper titled “Reasoning While Asking” proposes transforming reasoning-oriented LLMs from what the authors describe as a “blind self-thinking paradigm” into “proactive inquirers.” The abstract indicates current Chain-of-Thought (CoT) prompted models perform “extensive internal reasoning” even in situations where this approach may be limited.

Inference Optimization: A separate paper (arXiv:2601.21522v1) introduces a method called “Reset and Discard (ReD)” aimed at improving LLM inference efficiency within fixed computational budgets. The research focuses on “coverage@cost” as a metric rather than the traditional “pass@k” measure, according to the abstract, which defines coverage@cost as “the average number” of correctly answered questions at a given cost.

Policy Evaluation: In arXiv:2509.03827v2, researchers evaluate LLMs for social policymaking, specifically addressing homelessness. According to the abstract, the paper examines LLMs’ “potential to encode evolving social contexts and to generate plausible scenarios” for use as “tools in social policymaking.”

All three papers represent cross-listed or replacement submissions to arXiv’s AI section.