Three New arXiv Papers Address LLM Safety, Trust Calibration, and Time Series Forecasting

Recent preprints introduce methods for maintaining LLM safety during fine-tuning, calibrating AI trust, and teaching LLMs to forecast time series data.

Three new papers on arXiv address different challenges in large language model development and deployment.

Safety Alignment Through Low-Rank Fusion

According to arXiv:2602.00038v1, researchers propose LSSF (Low-Rank Safety Subspace Fusion) to address the fragility of LLM safety mechanisms. The paper notes that “even fine-tuning on datasets without harmful content may still undermine their safety capabilities,” highlighting a critical vulnerability in current safety approaches.

Economic Games for Trust Calibration

ArXiv:2602.00769v1 introduces a method using economic games to elicit trustworthiness in LLMs. The research emphasizes “maintaining calibrated trust” as essential for human-centered AI systems, aiming to avoid both “overtrust (e.g., automation bias) and undertrust (e.g., disuse)” according to the abstract.

Time Series Forecasting via Temporal Distillation

T-LLM, described in arXiv:2602.01937v1, proposes teaching large language models to forecast time series through temporal distillation. The paper notes that “time series data is inherently tied to the evolution of underlying processes and can only accumulate as” time progresses, distinguishing it from vision and language domains where LLMs have traditionally excelled.

All three papers are cross-listed under cs.AI on arXiv.