PhyLSTM loss scaling

NOTE: This has been proven to be unneccessary as the $\dot{B}$ is calculated from $B$ at all times, and can even be derived from $B$ when calculating the loss.

As seen in the PhyLSTM flow note, in the original PhyLSTM implementation by Francesco, the $B$ is processed as

$B_{s} = \frac{B - μ _{B}}{σ _{B}}$ and then

\dot{B_{s}} = \frac{d B _{s}}{d t} = \frac{1}{σ _{B}} \frac{d B}{d t} = \frac{1}{σ _{B}} \dot{B}

Hence for the loss function

L = i \sum 5 w_{i} L_{i}

with

L_{1} = ∣∣ B - \hat{B} ∣ ∣^{2} = ∣∣ (σ_{B} B_{s} + μ_{B}) - (σ_{B} \hat{B}_{s} + μ_{B}) ∣ ∣^{2} = σ_{B}^{2} ∣∣ B_{s} - \hat{B}_{s} ∣ ∣^{2}

and similarly

\begin{align} \mathcal{L}_2 & = || \dot{B} - \hat{\dot{B}} ||^2 \\ & = \sigma_\dot{B}^2||\dot{B}_s - \hat{\dot{B}}_s ||^2 \end{align}

But the scale by the $σ$ are baked into the loss weights $w_{1}$ and $w_{2}$ . However, for the third loss term

\begin{align} \mathcal{L}_3 & = || \hat{\dot{B}} - \dot{\hat{B}} ||^2 \\ & = || (\sigma_\dot{B}\hat{\dot{B}}_s +\mu_\dot{B} ) - \frac{d}{dt}(\sigma_B \hat{B}_s + \mu_B ) ||^2 \\ & = || (\sigma_\dot{B}\hat{\dot{B}}_s +\mu_\dot{B} ) - \sigma_B\dot{\hat{B}}_s ||^2 \end{align}

But in the case $\dot{B}_{s} = \frac{1}{σ _{B}} \dot{B}$ the third loss term becomes

L_{3} = ∣∣ σ_{B} \hat{\dot{B}}_{s} - \frac{d}{d t} (σ_{B} \hat{B}_{s} + μ_{B}) ∣ ∣^{2} = σ_{B}^{2} ∣∣ \hat{\dot{B}}_{s} - \dot{\hat{B}}_{s} ∣ ∣^{2}

In other words, when using $\dot{B}$ that is acquired independently from $B$ , the $L_{3}$ must be calculated explicitly with the unscaled $\hat{\dot{B}}$ .

Hysteresis Compensation

Explorer

PhyLSTM loss scaling

Graph View