New AI Research Advances Test-Time Adaptation, Search Skills, and Cross-Modal Generation

Three new research papers published on arXiv introduce distinct approaches to improving AI model capabilities across different domains.

According to arxiv.org, researchers have proposed Query-Conditioned Test-Time Self-Training (QueST), a framework that adapts large language model parameters during inference using supervision derived directly from the input query. The paper states that QueST “generates query-conditioned pairs and uses them as supervision for parameter-efficient fine-tuning at test time.” Across seven mathematical reasoning benchmarks and the GPQA-Diamond scientific reasoning benchmark, QueST consistently outperformed strong test-time optimization baselines, according to the abstract.

In a separate development, arxiv.org reports on SearchSkill, a framework that teaches language models to use search tools through “reusable search skills.” According to the paper, “the model first selects a skill, then generates a search or answer action conditioned on the selected skill card.” The system maintains an evolving SkillBank that expands or refines based on recurrent failure patterns. SearchSkill improved exact match performance on knowledge-intensive QA benchmarks and produced “fewer copied first queries, more atomic hop-focused queries, and more correct answers within a small search budget,” according to arxiv.org.

Additionally, researchers introduced V2M-Zero, a video-to-music generation approach that requires zero video-music pairs at training time, according to arxiv.org. The method achieved state-of-the-art performance with “5-9% higher audio quality, 13-15% better semantic alignment, 21-52% improved temporal synchronization, and 28% higher beat alignment on dance videos” compared to prior baselines.