NOTE: This has been proven to be unneccessary as the is calculated from at all times, and can even be derived from when calculating the loss.
As seen in the PhyLSTM flow note, in the original PhyLSTM implementation by Francesco, the is processed as
and then
Hence for the loss function
with
and similarly
\begin{align} \mathcal{L}_2 & = || \dot{B} - \hat{\dot{B}} ||^2 \\ & = \sigma_\dot{B}^2||\dot{B}_s - \hat{\dot{B}}_s ||^2 \end{align}But the scale by the are baked into the loss weights and . However, for the third loss term
\begin{align} \mathcal{L}_3 & = || \hat{\dot{B}} - \dot{\hat{B}} ||^2 \\ & = || (\sigma_\dot{B}\hat{\dot{B}}_s +\mu_\dot{B} ) - \frac{d}{dt}(\sigma_B \hat{B}_s + \mu_B ) ||^2 \\ & = || (\sigma_\dot{B}\hat{\dot{B}}_s +\mu_\dot{B} ) - \sigma_B\dot{\hat{B}}_s ||^2 \end{align}But in the case the third loss term becomes
In other words, when using that is acquired independently from , the must be calculated explicitly with the unscaled .