Three New arXiv Papers Explore Specialized Training Methods for Large Language Models

Researchers publish studies on domain-specific LLM training, test-time alignment optimization, and multi-agent molecular generation with LLMs.

Three New arXiv Papers Explore Specialized Training Methods for Large Language Models

Three recent papers on arXiv explore different approaches to optimizing large language models for specific applications.

StatLLaMA (arXiv:2601.09718v1) investigates building a domain-specialized LLM for statistics using the LLaMA-3.2-3B family as a foundation model. According to the abstract, the researchers “systematically compare three multi-stage training pipelines” to efficiently create a statistics-focused model.

LLMdoctor (arXiv:2601.10416v1) presents a token-level flow-guided preference optimization method for test-time alignment of LLMs. The paper addresses the challenge that “traditional fine-tuning methods are computationally expensive and inflexible,” proposing test-time alignment as an alternative approach. The authors note that existing test-time methods “often rely on dist[inct approaches]” (abstract truncated in source).

M^4olGen (arXiv:2601.10131v1) introduces a multi-agent, multi-stage framework for molecular generation under precise multi-property constraints. The researchers address the limitation that “although large language models (LLMs) are expressive, they struggle with precise multi-objective control and numeric rea[soning]” (abstract truncated in source) when generating molecules that meet specific physicochemical property requirements.

All three papers represent “Announce Type: new” or “cross” submissions to the AI category on arXiv.