Plotting of the loss

tdj2000 · June 7, 2025, 11:48am

Hi all,

I have been training some models using different summary networks. One of the things I am interested in is the decay of the loss function, especially in the first few epochs. However, when using f = bf.diagnostics.plots.loss(history, per_training_step=True), I no longer see all training steps during the first epoch and the value of the loss plotted for the first epoch does not match the value that is output in the training visual. Ideally, I would like the x-axis to start at 0, which I believe was also the case in BF version 1. Anyone else with the same issue? I have attached an image demonstrating what I mean.

valentin · June 7, 2025, 12:29pm

Hi,
thanks for reaching out As far I can tell the issue with the numbers not matching was caused by an issue that is already resolved on the dev branch, the fix will be in the next release.

I have taken a quick look at the code, and as far as I can tell the per_training_step functionality is currently broken/was removed, because the history object only contains per-epoch values. Sorry for that, and that the docstring is inaccurate there. Could you open an issue on that, so that we can check if we can/want to bring it back in version 2?

For now, I think there are two options to get the information you need. Either, if you use online training, you can use a small num_batches_per_epoch value, which will give you a better resolution, but might slow down training somewhat. Alternatively, you could use the following custom callback that saves the detailed history for you.

class StepHistory(keras.callbacks.Callback):
    """Callback that records events into a `StepHistory` object in every batch."""

    def __init__(self):
        super().__init__()
        self.history = {}

    def on_train_batch_end(self, batch, logs=None):
        keys = list(logs.keys())
        for k, v in logs.items():
            self.history.setdefault(k, []).append(v)

    def on_epoch_end(self, epoch, logs=None):
        # Set the history attribute on the model after the epoch ends. This will
        # make sure that the state which is set is the latest one.
        self.model.step_history = self

You would pass this to fit like this: history = workflow.fit_...(epochs=..., ..., callbacks=[StepHistory()]). If you use the BasicWorkflow, you can then access the values using workflow.approximator.step_history.history. If you use an approximator directly, you can find them at approximator.step_history.history. This history will not work with the plot function, so you will have to plot the values manually.

tdj2000 · June 7, 2025, 12:57pm

Thanks for the swift reply and the helpful info!

Topic		Replies	Views
Cannot generate posterior samples after training General	21	294	March 11, 2024
Getting error running Linear Regression Example General	18	175	June 10, 2025
Simulation-based calibration histogram General	1	142	December 13, 2023
Model Misspecification Diffusion model for conflict tasks General	5	375	January 20, 2024
Warning during online training leading to NaN loss General	4	46	February 25, 2025

Plotting of the loss

Related topics