"Hooks" for PathSampling by hejung · Pull Request #911 · openpathsampling/openpathsampling

hejung · 2020-03-31T11:50:18Z

This builds on #755 and should only be merged after that.

It adds hook support to the PathSampling, the PathSamplingOutputHook and the LiveVisualizerHook including tests.

Please let me know if htere is anything to improve!
I also have some open questions/TODOs, in particular:

if a user sets any attribute that controls the behaviour of the hook on the PathSimulator object it is only read once at the start of the simulation and only if the hooks value for that attribute is None (see e.g. output_stream, save_frequency and storage). This is potentially confusing, as I would expect to be able to set these attributes on the PathSimulator to influence the behaviour of the currently attached hooks. I am mot really sure how to adress this. Potential solutions include removing these attributes from the PathSimulator and only set them on hook creation. Another option is to make them properties to either (re)set the respective attributes on the hooks (or at least warn the user that it could not work as intended) if he/she sets the attributes on the Pathsimulator
Previously the LiveVisualizer was called before the MCstep but visualizing the previous step. I moved the LiveVisualization to after step. This enables visualization of the step that just finished and avoids that the beginning of the next step depends on the previous one. But it also means that visualization only works if the serial OutputHook has allow_refresh=False, otherwise the plot will get refreshed away.
The step_info passed to the after_step hooks from the PathSampling is specifically tailored for the PathSamplingOutputHook, i.e. it only takes the variables the OutputHook needs (current step, total steps, total elapsed time). @dwhswenson any ideas what else we could want/need in there in the future?

dwhswenson · 2020-04-01T15:08:43Z

Great to see this! Sorry that I'm not going to be very responsive in the next week or two. Somehow I'm significantly busier on lockdown than at normal times.

The current test failure is a known issue with the most recent conda-forge release of svgwrite. It should be fixed very soon -- in principle it is fixed; needs me to reply to someone though (one of the many things on my list).

Quick comments on your questions/todos:

[RE: output_stream, save_frequency] I would expect to be able to set these attributes on the PathSimulator to influence the behaviour of the currently attached hooks.

This is a good point. I'm being quick here, so I'm not even checking the code, but what should probably happen is to treat the PathSimulator attributes as a default fallback: (1) By default, hooks don't set this; instead they get it from the PathSimulator (every hook knows its simulator, right?). This means if the user specifically changes it in simulator and never set it in hooks, it behaves as expected. (2) If a user sets it in the hook and in the simulator, I think the hook takes precedence.

What should not happen is, hook is added, implicitly takes the simulator's value. Then user changes simulator's value, and it has no effect. This would be a bug.

I moved the LiveVisualization to after step.

This seems fine; but is there also a way to ensure that the 0th step is visualized? (Maybe a before_step hook and also has some internal state so it only gets run once?) I'm not sure the current behavior, but it would make sense to show the initial conditions if you're running a simulation that takes a while to do the first step.

The caveat on refreshing output is completely fine. There's no way around that, so as long as the docstrings are clear on the topic, all good.

The step_info passed to the after_step hooks from the PathSampling is specifically tailored for the PathSamplingOutputHook, i.e. it only takes the variables the OutputHook needs (current step, total steps, total elapsed time)

Good question. As I recall, the idea was for step_info to carry all metadata that might possibly be of interest to a hook. For committors, I believe there were 4 things: snapshot number, total number of snapshots, step number (inner loop), and total number of steps per inner loop.

Is the time info actually used by anything other than the output hook? You can probably move the time info into the hook -- get initial time at before_simulation, then update at each stage.

hejung · 2020-04-06T09:35:04Z

No problem.

Concerning the points:

Atm it is exactly as you said, if it is set in the hook the hook takes precedence otherwise the hook takes the defaults from the PathSimulator. The only situation in which I see potential trouble is the following:
First user adds implicit default hooks (without value). Then user runs a round of sampling (i.e. before_simulation is executed and the hooks get their implicit defaults from the PathSimulator). If the user now changes the attributes on the PathSimulator and runs another round of sampling the hooks will not update, because they already got their (default) values when before_simulation was ran the first time.
In addition to making these attributes properties or removing them from the PathSimulator (and moving them to the hooks altogether), I think another solution would be to add a state-flag to the hook: If hook is initialized with values it should always keep them. If it was initalized without it should always take the values from the associated PathSimulator when before_simulation is run. What do you think?
Previously the first step was also not visualized. Currently the StepVisualizer2D needs/uses the MC change to visualize. It was just that the visualization for step n was ran at the start of the next iteration, i.e. directly before step (n+1). But I will look into how/if it is possible to visualize the first step too.
No, at the moment the timing info is not used anywhere else. I could move the timing to the hook, but the PathSamplingOutputHook still needs the total number of snapshots and the current number to be able to calculate the progress string, this would then be similar to the committors, where we also have total iterations and loop counter.
Something else where the time info could be useful is something I have been thinking about: Maybe it would be useful to write a "gracious kill" hook to make ops work nicer with the time limits on queuing systems. This would be hook that knows the maximum walltime and simply checks every step if we expect to finish the next step given the time the previous steps took on average. If not it would sync the storage and exit the simulation loop.

dwhswenson · 2020-04-12T12:16:01Z

First off, the error you have now should be fixed as soon as you merge in master. It's a problem with the miniconda install (they moves the web page we use for checksums).

Point-by-point:

Atm it is exactly as you said, if it is set in the hook the hook takes precedence otherwise the hook takes the defaults from the PathSimulator.

Let me sketch out in code what I meant, which I think will address the issue. Let's pretend that the simulator has a property called frequency. The hook should do something like:

class MyHook(PathSimulatorHook):
    def __init__(self, frequency=None):
        self._simulation = None
        self._frequency = frequency

    def before_simulation(self, sim):
        self._simulation = sim

    @property
    def frequency(self):
        if self._freq_val is not None:
            return self._frequency
        elif self._simulation is not None:
            return self._simulation.frequency
        else:
            raise RuntimeError("`frequency` has not been set")  # maybe return None?

That isn't how I initially implemented it, but I think it (mostly) makes more sense. It solves the problem you're thinking of, right? If you ever set it in the hook, then the hook's value is used (even if you set it in the simulator later). If you haven't set it in the hook, the simulation's value is used.

The only question is what to return when frequency isn't given on initialization and if the simulation is still not known (hasn't started yet). I'm leaning toward an error (probably best to actually subclass RuntimeError with a specialized error case), but returning None is also a reasonable option.

I will look into how/if it is possible to visualize the first step too.

If the first step can be visualized, great, but no big deal if it can't easily be done.

I could move the timing to the hook, but the PathSamplingOutputHook still needs the total number of snapshots and the current number to be able to calculate the progress string, this would then be similar to the committors, where we also have total iterations and loop counter.

I'd say move timing into the hook. For any other hook that needs it, I'm not worried about the overhead of calling time.time() once per MC step.

Maybe it would be useful to write a "gracious kill" hook to make ops work nicer with the time limits on queuing systems.

I think this sounds like a very good idea. In the long run, I want to have mid-trajectory checkpointing, because there are cases where a single trajectory might be as long as your allowed wall time. However, this would be a very useful improvement in the meantime.

If you implement this, please do a separate PR for it. It deserves a separate mention in the release notes.

hejung · 2020-04-19T17:41:02Z

Yes, that solves the problem I had in mind. I went ahead and implemented it for all hooks.

I moved the timing to the PathSamplingOutputHook, which has the added benefit that the test now also test the actual timing (and not only that we are able to print the correct string given the numbers).

codecov · 2021-02-01T21:46:52Z

Codecov Report

Merging #911 (6664456) into master (b6da5dd) will increase coverage by 0.07%.
The diff coverage is 85.16%.

@@            Coverage Diff             @@
##           master     #911      +/-   ##
==========================================
+ Coverage   81.40%   81.47%   +0.07%     
==========================================
  Files         140      140              
  Lines       15237    15362     +125     
==========================================
+ Hits        12403    12516     +113     
- Misses       2834     2846      +12

Impacted Files	Coverage Δ
openpathsampling/step_visualizer_2D.py	`17.28% <0.00%> (ø)`
openpathsampling/pathsimulators/path_sampling.py	`72.26% <68.75%> (-5.43%)`	⬇️
openpathsampling/beta/hooks.py	`100.00% <100.00%> (ø)`
openpathsampling/pathsimulators/path_simulator.py	`92.30% <0.00%> (-1.54%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b6da5dd...6664456. Read the comment docs.

hejung · 2021-02-14T17:05:26Z

@dwhswenson I would say this is ready for review. I cleaned up a bit and added the run_until_n_accepted method you suggested.

…dified storage hook to also suit pathsampling

…ties to ensure a smooth user experience; PathsimulatorOutputHook now uses regex matching to allow for some leeway in the timing output

This will make it easier to implement other approaches to running path sampling. For example, this should make it possible to: * clean up run_until_decorrelated * allow running until a certain number of accepted trajectories * use algorithms that use a replica reservoir

…ssary TODOs.

hejung · 2021-06-30T20:25:42Z

@dwhswenson Not morning anymore, but the branch is rebased on the current master and updated. But it looks like I will have to write some more test tomorrow... :)

dwhswenson · 2021-06-30T21:07:48Z

openpathsampling/beta/hooks.py

+        if self._output_stream is not None:
+            return self._output_stream
+        elif self._simulation is not None:
+            return self._simulation.output_stream
+        else:
+            raise SimulationNotFoundError("'output_stream' has not "


(Starting to glance over it, but a thought before you spend much time on remaining tests.) You might to create something like

def _self_or_sim_property(obj, attr_name): if getattr(obj, "_" + attr_name) is not None: return getattr(obj, "_" + attr_name) ... # etc.

then use it as

@property def output_stream(self): return _self_or_sim_property(self, 'output_stream')

This would allow you to test the underlying logic once, and not need to worry about creating multiple tests for each property. More complete integration test are also welcome, but this definitely seems like a motif that is being (and will continue to be) repeated.

(You could add a return_none=False to that private method to make it return None instead of erroring when that it appropriate)

dwhswenson

Mostly looks good, and thanks for the contribution!

A few mostly minor changes. In particular:

probably best to move sanity_check to its own hook
I don't think LiveVisualizerHook should be a default hook
please patch time.time in your unit tests -- this should make a big difference on performance and reproducibility

BTW, also thanks for the TODO & other comments with questions. It makes very easy to find points that should be considered.

openpathsampling/beta/hooks.py

openpathsampling/pathsimulators/path_sampling.py

openpathsampling/tests/pathsimulators/test_hooks.py

openpathsampling/beta/hooks.py

hejung · 2021-07-02T13:23:04Z

@dwhswenson All done (I think).

I moved the sample_set.sanity_check() to its own hook (which is a default hook for PathSampling) + wrote tests for that hook
LiveVisualizerHook is not default anymore, instead it is attached only upon setting the PathSampling.live_visualizer property to something different then None. I now also throw an error from the hook if the live_visualizer is not set, which should be ok since the hook is not attached by default anymore. + wrote tests for that hook
I fixed/improved the tests for the PathSamplingOutputHook according to your suggestions. Thanks for pointing me to the patch function, that solved the knot I had in my brain :)
After sleeping a night on the StorageHook.after_simulation() method I think what we want is this:

if self.storage is not None:
    self.storage.sync_all()

This should work in all cases, namely if the storage set only in the hook and also if the storage is taken from the simulation (because then it will be at hook.storage if the hook does not have a storage).

hejung · 2021-07-02T15:21:26Z

So I just realized that I did not address the save try/except clause. I would/will do it together with the StorageHook.after_step() if what I did there is not what we want @dwhswenson : Let me know what you think is best there.

dwhswenson · 2021-07-02T16:48:16Z

Looks good, as soon as the try/except clause for SimStore vs netcdfplus saving is added!

After sleeping a night on the StorageHook.after_simulation() method I think what we want is this:
if self.storage is not None:
    self.storage.sync_all()

I agree. I'm not sure why there was a thing to avoid doing a second sync_all. I'm not entirely sure what all happens in netcdfplus, but in SimStore, that just tries to save all objects in an empty list (so there's no real computational cost, and no side effect.) I would think that a second sync_all would also be idempotent in netcdf/netcdfplus.

hejung · 2021-07-03T18:00:03Z

Looks good, as soon as the try/except clause for SimStore vs netcdfplus saving is added!

Done.

I agree. I'm not sure why there was a thing to avoid doing a second sync_all. I'm not entirely sure what all happens in netcdfplus, but in SimStore, that just tries to save all objects in an empty list (so there's no real computational cost, and no side effect.) I would think that a second sync_all would also be idempotent in netcdf/netcdfplus.

That was me beeing confused from what we previously had

if self.storage is not None:
    sim.storage.sync_all()

which did not make sense to me (checking self.storage but using sim.storage)...and the tests call assert_called_once which I did not want to touch :)

dwhswenson · 2021-07-03T21:16:45Z

which did not make sense to me (checking self.storage but using sim.storage)...and the tests call assert_called_once which I did not want to touch :)

Ah, I understand. The self vs sim was probably just a slip of the fingers (or a missed change in a refactor).assert_called_once was probably because there didn't seem to be a way for it to be called more than once, but the important thing was that it should have been called at least once!

Anyway, all looks good now! Merging -- thanks again!

hejung mentioned this pull request May 4, 2020

Gracious kill hook #914

Merged

hejung force-pushed the PathSampling_Hooks branch from 779c257 to 6e0f50a Compare June 23, 2020 10:13

dwhswenson added the feature label Sep 26, 2020

dwhswenson added this to the 1.5 milestone Dec 23, 2020

dwhswenson mentioned this pull request Feb 1, 2021

Refactor PathSampling.run to have separate .run_one_step method hejung/openpathsampling#2

Merged

hejung changed the title ~~[WIP] "Hooks" for PathSampling~~ "Hooks" for PathSampling Feb 14, 2021

dwhswenson mentioned this pull request Feb 16, 2021

Strange random errors #892

Closed

This was referenced Mar 2, 2021

Use a deque to keep track of more recent time estimates #989

Closed

"Hooks" for ShootFromSnapshotsSimulation (i.e., committor) #755

Merged

hejung and others added 9 commits June 30, 2021 20:57

Added ouput and livevisualization hooks for Pathsampling, slightly mo…

eedfb6b

…dified storage hook to also suit pathsampling

Added tests for PathSamplingOutputHook

67ef5d9

implemented dwhswensons suggestion to make the hook attributes proper…

9d26d44

…ties to ensure a smooth user experience; PathsimulatorOutputHook now uses regex matching to allow for some leeway in the timing output

Hook attributes are now properties. attempt to fix tests for py2

2743025

pass information

d7aeb7e

Added run_until_n_accepted method for PathSampling. Cleaned up unnece…

9f06e8b

…ssary TODOs.

fixed tests

c639239

before_sim now takes kwargs, after_sim the hook_state

b06666c

hejung force-pushed the PathSampling_Hooks branch from 7fbc388 to b06666c Compare June 30, 2021 19:46

dwhswenson reviewed Jun 30, 2021

View reviewed changes

dwhswenson requested changes Jul 1, 2021

View reviewed changes

hejung commented Jul 1, 2021

View reviewed changes

openpathsampling/beta/hooks.py Show resolved Hide resolved

WIP: review suggestions

f839bcf

review suggestions + more tests

09b2268

try/except for stash/save to support both old and new storage

6664456

dwhswenson approved these changes Jul 3, 2021

View reviewed changes

dwhswenson merged commit 08fb2bc into openpathsampling:master Jul 3, 2021

This was referenced Jul 5, 2021

OpenPathSampling 1.5 #1031

Merged

Live Visualizer bug after hooks updates #1033

Closed

Conversation

hejung commented Mar 31, 2020

Uh oh!

dwhswenson commented Apr 1, 2020

Uh oh!

hejung commented Apr 6, 2020

Uh oh!

dwhswenson commented Apr 12, 2020

Uh oh!

hejung commented Apr 19, 2020

Uh oh!

codecov bot commented Feb 1, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

hejung commented Feb 14, 2021

Uh oh!

hejung commented Jun 30, 2021

Uh oh!

dwhswenson Jun 30, 2021

Choose a reason for hiding this comment

Uh oh!

dwhswenson Jun 30, 2021

Choose a reason for hiding this comment

Uh oh!

dwhswenson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hejung commented Jul 2, 2021

Uh oh!

hejung commented Jul 2, 2021

Uh oh!

dwhswenson commented Jul 2, 2021

Uh oh!

hejung commented Jul 3, 2021

Uh oh!

dwhswenson commented Jul 3, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Feb 1, 2021 •

edited

Loading