feat: add new move count metric #1072

zepfred · 2024-09-04T19:28:37Z

This pull request includes a new metric for counting the number of moves, which differs from the score count when using R&R moves.

triceo

I'm leaving a review before I leave for PTO.
It's OK if merged once my comments are resolved, provided that someone else sees the final PR as well.

benchmark/src/main/java/ai/timefold/solver/benchmark/impl/SolverBenchmarkFactory.java

...lver/benchmark/impl/statistic/movecalculationspeed/MoveCalculationSpeedProblemStatistic.java

...er/benchmark/impl/statistic/movecalculationspeed/MoveCalculationSpeedSubSingleStatistic.java

core/src/main/java/ai/timefold/solver/core/impl/heuristic/move/AbstractMetricMove.java

zepfred · 2024-09-06T14:27:14Z

Optionally, the metric also has sub-metrics per move type. (Some other metrics can already do that, for inspiration.)

I couldn't come up with a useful sub-metric for the move count metric. Do you have any suggestions?

Naming. Is "move count" and "move speed" the right name?

For the benchmark report, it makes sense to me to present it as "Move Calculation Speed".

On the other hand, when defining the termination configuration, I think using only count is better: moveCountLimit and unimprovedMoveCountLimit. We don't have a termination configuration such as unimprovedScoreCountLimit, but I thought it might be useful to terminate after a certain number of unsuccessful moves are executed.

zepfred · 2024-09-10T13:40:59Z

@triceo, I see some possible points of discussion beside the ones you may find during the review:

Optionally, the metric also has sub-metrics per move type. (Some other metrics can already do that, for inspiration.)

I couldn't find any useful sub-metric for the move count. We already have the move count per step.

Solver logging additionally includes move speed (move count over time), along with score speed which it already does.

I initially included it but then removed it. My point is that it may confuse the user as it is the same number when not using R&R moves. I believe that the benchmark should be used to compare move speeds, and the score calculation speed should still be the primary metric.

There is a score count related type of Termination. We need one of those for move count as well.

I considered adding unimprovedMoveCount, but I'm unsure if it's worth it. This would require changing some base classes to store the best move count and the last accepted step move count.

Does this become the new metric by which we measure solver performance, and therefore is it enabled by default? If so, documentation needs to change, replacing uses of score calculation speed by move speed.

I believe the primary metric remains score calculation speed, and we should delegate the comparison of move speed to the benchmark module. With that in mind, the performance page will not change significantly except for adding a reference to the new metric. It doesn't make sense to me to compare a configuration using R&R with one that doesn't. Hence, when comparing two configurations using R&R, the score calculation still makes sense for evaluating performance.

Christopher-Chianelli

I am okay with the move evaluation speed name, but I am concern that one of the tests, where Ruin moves are not used, have different counts for score calculation count and move evaluation counts (which smells like a bug). Otherwise LGTM.

core/src/test/java/ai/timefold/solver/core/api/solver/SolverManagerTest.java

triceo · 2024-09-16T06:24:52Z

I couldn't find any useful sub-metric for the move count. We already have the move count per step.

Some other metrics can track their values per-move. So, for example, you could have "move execution count for change moves" etc. This is what I mean. There is precedent.

Solver logging additionally includes move speed (move count over time), along with score speed which it already does.

I initially included it but then removed it. My point is that it may confuse the user as it is the same number when not using R&R moves. I believe that the benchmark should be used to compare move speeds, and the score calculation speed should still be the primary metric.

Good point on it being the same number in many cases.
Not sure on what the default should be; arguably, score calc speed is now the confusing one, because based on different moves, it changes. Move execution speed is the one that's "stable".

There is a score count related type of Termination. We need one of those for move count as well.

I considered adding unimprovedMoveCount, but I'm unsure if it's worth it. This would require changing some base classes to store the best move count and the last accepted step move count.

I do think it is worth the effort, for the same reason as I said above. Score calculation count is not stable, its definition changes based on moves used. Move execution count is stable and always means the same thing.

triceo

LGTM after the suggested changes are applied.

docs/src/modules/ROOT/pages/using-timefold-solver/benchmarking-and-tweaking.adoc

docs/src/modules/ROOT/pages/constraints-and-score/performance.adoc

docs/src/modules/ROOT/pages/using-timefold-solver/benchmarking-and-tweaking.adoc

core/src/main/java/ai/timefold/solver/core/impl/phase/scope/AbstractStepScope.java

core/src/main/java/ai/timefold/solver/core/impl/phase/scope/AbstractPhaseScope.java

triceo · 2024-09-26T07:13:44Z

There was something still bugging me about this PR. So I opened the IDE and looked at it. I know what it was now. Disabling metric collection is a very specific thing, it only ever exists for RnR. Yet, it proliferates everywhere - to every phase, to every step, everywhere.

I fixed it by only dealing with disabling metric collection when already in RnR. If we ever need it in more phases than here, we can generify it - but right now, the correct decision was to not introduce this anywhere but in RnR.

I have similarly updated the enterprise PR, which no longer needs to check whether metrics are enabled. As long as RnR doesn't trigger any metrics, nobody else (not LS, not PS, ...) needs to know anything about metrics being enabled or disabled.

Please take a look and check my thinking.

zepfred · 2024-09-26T11:19:56Z

There was something still bugging me about this PR. So I opened the IDE and looked at it. I know what it was now. Disabling metric collection is a very specific thing, it only ever exists for RnR. Yet, it proliferates everywhere - to every phase, to every step, everywhere.

I fixed it by only dealing with disabling metric collection when already in RnR. If we ever need it in more phases than here, we can generify it - but right now, the correct decision was to not introduce this anywhere but in RnR.

I have similarly updated the enterprise PR, which no longer needs to check whether metrics are enabled. As long as RnR doesn't trigger any metrics, nobody else (not LS, not PS, ...) needs to know anything about metrics being enabled or disabled.

Please take a look and check my thinking.

Looks good to me!

sonarqubecloud · 2024-09-26T11:47:58Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
75.5% Coverage on New Code
0.8% Duplication on New Code

See analysis details on SonarCloud

zepfred temporarily deployed to internal September 4, 2024 19:28 — with GitHub Actions Inactive

triceo linked an issue Sep 4, 2024 that may be closed by this pull request

Feat: Move count as an alternative to score calculation count #1038

Closed

7 tasks

zepfred force-pushed the new-metric branch from fe1052a to 489952b Compare September 5, 2024 20:32

zepfred temporarily deployed to internal September 5, 2024 20:32 — with GitHub Actions Inactive

zepfred temporarily deployed to internal September 5, 2024 20:39 — with GitHub Actions Inactive

zepfred force-pushed the new-metric branch from 5839a7f to 0661e5a Compare September 5, 2024 21:48

zepfred temporarily deployed to internal September 5, 2024 21:48 — with GitHub Actions Inactive

triceo reviewed Sep 6, 2024

View reviewed changes

zepfred temporarily deployed to internal September 6, 2024 14:47 — with GitHub Actions Inactive

zepfred temporarily deployed to internal September 6, 2024 18:18 — with GitHub Actions Inactive

zepfred temporarily deployed to internal September 6, 2024 19:11 — with GitHub Actions Inactive

zepfred temporarily deployed to internal September 6, 2024 21:31 — with GitHub Actions Inactive

zepfred temporarily deployed to internal September 9, 2024 15:13 — with GitHub Actions Inactive

zepfred temporarily deployed to internal September 9, 2024 20:54 — with GitHub Actions Inactive

zepfred temporarily deployed to internal September 9, 2024 22:59 — with GitHub Actions Inactive

zepfred temporarily deployed to internal September 10, 2024 13:19 — with GitHub Actions Inactive

zepfred requested a review from triceo September 10, 2024 13:20

zepfred marked this pull request as ready for review September 10, 2024 13:42

zepfred requested review from Christopher-Chianelli and removed request for triceo September 10, 2024 14:08

Christopher-Chianelli reviewed Sep 10, 2024

View reviewed changes

core/src/test/java/ai/timefold/solver/core/api/solver/SolverManagerTest.java Show resolved Hide resolved

zepfred temporarily deployed to internal September 16, 2024 17:26 — with GitHub Actions Inactive

zepfred temporarily deployed to internal September 19, 2024 14:44 — with GitHub Actions Inactive

triceo previously approved these changes Sep 19, 2024

View reviewed changes

zepfred dismissed triceo’s stale review via 9a93db7 September 23, 2024 13:37

zepfred force-pushed the new-metric branch from bdc48e6 to 9a93db7 Compare September 23, 2024 13:37

zepfred temporarily deployed to internal September 23, 2024 13:37 — with GitHub Actions Inactive

zepfred temporarily deployed to internal September 25, 2024 16:54 — with GitHub Actions Inactive

triceo reviewed Sep 25, 2024

View reviewed changes

core/src/main/java/ai/timefold/solver/core/impl/phase/scope/AbstractStepScope.java Outdated Show resolved Hide resolved

chore: addressing PR comments

8673c33

zepfred temporarily deployed to internal September 25, 2024 17:10 — with GitHub Actions Inactive

triceo reviewed Sep 25, 2024

View reviewed changes

core/src/main/java/ai/timefold/solver/core/impl/phase/scope/AbstractPhaseScope.java Outdated Show resolved Hide resolved

chore: addressing PR comments

0ebfa91

zepfred temporarily deployed to internal September 25, 2024 17:30 — with GitHub Actions Inactive

triceo temporarily deployed to internal September 26, 2024 07:11 — with GitHub Actions Inactive

triceo force-pushed the new-metric branch from a2bcfc4 to f7e7096 Compare September 26, 2024 07:37

triceo temporarily deployed to internal September 26, 2024 07:37 — with GitHub Actions Inactive

triceo force-pushed the new-metric branch from f7e7096 to 1bbd206 Compare September 26, 2024 08:15

triceo temporarily deployed to internal September 26, 2024 08:15 — with GitHub Actions Inactive

triceo force-pushed the new-metric branch from 1bbd206 to bbafbd9 Compare September 26, 2024 09:27

triceo temporarily deployed to internal September 26, 2024 09:28 — with GitHub Actions Inactive

triceo force-pushed the new-metric branch from bbafbd9 to 4c59a95 Compare September 26, 2024 09:36

triceo temporarily deployed to internal September 26, 2024 09:37 — with GitHub Actions Inactive

triceo force-pushed the new-metric branch from 4c59a95 to 3b591a5 Compare September 26, 2024 09:41

triceo temporarily deployed to internal September 26, 2024 09:42 — with GitHub Actions Inactive

triceo force-pushed the new-metric branch from 3b591a5 to 632b6f9 Compare September 26, 2024 09:44

triceo temporarily deployed to internal September 26, 2024 09:44 — with GitHub Actions Inactive

Don't proliferate the metric switching

71f1327

triceo force-pushed the new-metric branch from 632b6f9 to 71f1327 Compare September 26, 2024 10:02

triceo temporarily deployed to internal September 26, 2024 10:02 — with GitHub Actions Inactive

zepfred temporarily deployed to internal September 26, 2024 11:19 — with GitHub Actions Inactive

zepfred force-pushed the new-metric branch from 6368053 to 71f1327 Compare September 26, 2024 11:22

zepfred temporarily deployed to internal September 26, 2024 11:22 — with GitHub Actions Inactive

triceo merged commit c027d7e into TimefoldAI:main Sep 26, 2024
51 of 52 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add new move count metric #1072

feat: add new move count metric #1072

zepfred commented Sep 4, 2024

triceo left a comment •

edited

Loading

zepfred commented Sep 6, 2024 •

edited

Loading

zepfred commented Sep 10, 2024

Christopher-Chianelli left a comment

triceo commented Sep 16, 2024

triceo left a comment

triceo commented Sep 26, 2024 •

edited

Loading

zepfred commented Sep 26, 2024

sonarqubecloud bot commented Sep 26, 2024

feat: add new move count metric #1072

feat: add new move count metric #1072

Conversation

zepfred commented Sep 4, 2024

triceo left a comment • edited Loading

Choose a reason for hiding this comment

zepfred commented Sep 6, 2024 • edited Loading

zepfred commented Sep 10, 2024

Christopher-Chianelli left a comment

Choose a reason for hiding this comment

triceo commented Sep 16, 2024

triceo left a comment

Choose a reason for hiding this comment

triceo commented Sep 26, 2024 • edited Loading

zepfred commented Sep 26, 2024

sonarqubecloud bot commented Sep 26, 2024

Quality Gate passed

triceo left a comment •

edited

Loading

zepfred commented Sep 6, 2024 •

edited

Loading

triceo commented Sep 26, 2024 •

edited

Loading