Three New arXiv Papers Explore Specialized Training Methods for Large Language Models
Three recent papers on arXiv explore different approaches to optimizing large language models for specific applications.
StatLLaMA (arXiv:2601.09718v1) investigates building a domain-specialized LLM for statistics using the LLaMA-3.2-3B family as a foundation model. According to the abstract, the researchers “systematically compare three multi-stage training pipelines” to efficiently create a statistics-focused model.
LLMdoctor (arXiv:2601.10416v1) presents a token-level flow-guided preference optimization method for test-time alignment of LLMs. The paper addresses the challenge that “traditional fine-tuning methods are computationally expensive and inflexible,” proposing test-time alignment as an alternative approach. The authors note that existing test-time methods “often rely on dist[inct approaches]” (abstract truncated in source).
M^4olGen (arXiv:2601.10131v1) introduces a multi-agent, multi-stage framework for molecular generation under precise multi-property constraints. The researchers address the limitation that “although large language models (LLMs) are expressive, they struggle with precise multi-objective control and numeric rea[soning]” (abstract truncated in source) when generating molecules that meet specific physicochemical property requirements.
All three papers represent “Announce Type: new” or “cross” submissions to the AI category on arXiv.