NOTE: This has been proven to be unneccessary as the is calculated from at all times, and can even be derived from when calculating the loss.

As seen in the PhyLSTM flow note, in the original PhyLSTM implementation by Francesco, the is processed as

and then

Hence for the loss function

with

and similarly

\begin{align} \mathcal{L}_2 & = || \dot{B} - \hat{\dot{B}} ||^2 \\ & = \sigma_\dot{B}^2||\dot{B}_s - \hat{\dot{B}}_s ||^2 \end{align}

But the scale by the are baked into the loss weights and . However, for the third loss term

\begin{align} \mathcal{L}_3 & = || \hat{\dot{B}} - \dot{\hat{B}} ||^2 \\ & = || (\sigma_\dot{B}\hat{\dot{B}}_s +\mu_\dot{B} ) - \frac{d}{dt}(\sigma_B \hat{B}_s + \mu_B ) ||^2 \\ & = || (\sigma_\dot{B}\hat{\dot{B}}_s +\mu_\dot{B} ) - \sigma_B\dot{\hat{B}}_s ||^2 \end{align}

But in the case the third loss term becomes

In other words, when using that is acquired independently from , the must be calculated explicitly with the unscaled .