fix: numerically unstable log-odds in ORPO loss by Mr-Neutr0n · Pull Request #6407 · hpcaitech/ColossalAI

Mr-Neutr0n · 2026-02-11T18:20:19Z

Bug

The ORPO loss (OddsRatioLoss in applications/ColossalChat/coati/models/loss.py) computes log-odds using a numerically fragile pattern:

chosen_odds = chosen_logp - torch.log(-torch.exp(chosen_logp) + 1.0001)

This has two problems:

Biased constant: The magic value 1.0001 shifts the result away from the mathematically correct value, introducing a systematic bias into the loss.
NaN risk: When exp(logp) > 1.0001 (which can happen due to floating-point imprecision, especially in mixed-precision training), the argument to torch.log becomes negative, producing NaN and poisoning the training run.

The mathematically correct log-odds formula is log(p) - log(1-p) = logp - log(1 - exp(logp)).

Clamp logp to (-inf, -eps] so that exp(logp) is strictly less than 1, preventing both division by zero and negative arguments to log.
Replace torch.log(-torch.exp(logp) + 1.0001) with torch.log1p(-torch.exp(logp)), which is the numerically correct and unbiased way to compute log(1 - exp(logp)).

This eliminates both the NaN risk and the systematic bias from the hardcoded offset.

fix: improve numerical stability of log-odds in ORPO loss

74c7355

Mr-Neutr0n requested a review from a team as a code owner February 11, 2026 18:20