Use a deque to keep track of more recent time estimates by sroet · Pull Request #989 · openpathsampling/openpathsampling

sroet · 2021-03-02T15:44:53Z

Today, I got annoyed by the fact that OPS's Estimated time remaining becomes worse, the longer you run (see for example, the table in this comment )

This fixes that (slightly) by using a deque with a default length of 10_000 MC steps. So that it just uses the average of the last 10_000 steps to estimate the average step time and the time remaining.

The changes in this PR are:

use deque to get a slightly better estimate of the time-left (47f76d4 and babd276)
add a time_per_step override for progress_string to override the time_per_step calculation and adds a test for this behavior (47f76d4)
removes nose from test_tools.py, partial progress towards Switch tests to pytest #756 (6a8485c)

dwhswenson · 2021-03-02T16:03:52Z

The idea is fine, but can you redo this PR to build off of #911? I'm pretty sure there will be significant conflicts with that. (I really need to finish reviewing hooks-related stuff; I'm insisting on that stuff being in the 1.5 release, and that's the main thing remaining for 1.5. If you can give an eye to #755 and #911 with a mental focus on "will this work if the hooks use Dask?" that would be a big help -- the mental energy of needing to ensure that, and being the only person to check it, has been a big part of why I've been slow on that.) Plus, this kind of approach is exactly what hooks should improve: let you easily customize the behavior of something like this, with code that is more isolated from the rest of the logic.

Also, probably the single biggest (and potentially easiest) approach to solve the underlying problem would be to add a SQL index on the UUID for more tables. I believe we now do that for the UUIDs table; it would probably make sense to do that for all data object tables (or even all tables; the extra cost for simulation objects isn't much). I suspect that most of the slow-down you see now is because snapshot tables don't have a UUID index, meaning that the cost of adding to them grows linearly with the number of snapshots stored. PR (separate from this) most definitely welcome on that.

sroet · 2021-03-04T08:30:37Z

closing this for now, will rebuild off of #911 in the future

sroet added 3 commits March 2, 2021 16:18

only use the last 10000 steps for the time estimate

47f76d4

remove nose from test_tools.py

6a8485c

actually use deq for the time estimate

babd276

dwhswenson mentioned this pull request Mar 3, 2021

"Hooks" for ShootFromSnapshotsSimulation (i.e., committor) #755

Merged

6 tasks

sroet closed this Mar 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a deque to keep track of more recent time estimates#989

Use a deque to keep track of more recent time estimates#989
sroet wants to merge 3 commits intoopenpathsampling:masterfrom
sroet:better_time_estimates_master

sroet commented Mar 2, 2021

Uh oh!

dwhswenson commented Mar 2, 2021 •

edited

Loading

Uh oh!

sroet commented Mar 4, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sroet commented Mar 2, 2021

Uh oh!

dwhswenson commented Mar 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sroet commented Mar 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dwhswenson commented Mar 2, 2021 •

edited

Loading

sroet commented Mar 4, 2021 •

edited

Loading