Opened 14 years ago
Closed 14 years ago
#1355 closed defect (done)
EngineAlgorithm should cleanup itself after it stopped
Reported by: | cneumuel | Owned by: | swagner |
---|---|---|---|
Priority: | high | Milestone: | HeuristicLab 3.3.3 |
Component: | Optimization | Version: | 3.3.3 |
Keywords: | Cc: |
Description (last modified by swagner)
When an EngineAlgorithm stops, it currently keeps all its state-information (ExecutionContext and ScopeTree). This is unneccessary because when it is in ExecutionState "Stopped", it cannot be resumed (without calling Prepare, which causes the deletion of this state-information).
Problem: The state-information consumes a large amount of memory when beeing serialized, especially when the population size is large in evolutionary algorithms. This is a significant problem in Hive. When a finished algorithm contains tens or hundrets of megabytes of unnecessary information serialization, deserialization and transmission of jobs is unnecessarily slow.
Also, users who don't know the "trick" of calling Prepare before storing an algorithm, spend lots of unneccessary time on saving and loading their algorithms.
Just calling Prepare() when an algorithm stopped however would be too much, since this would cause the ExecutionTime and the ResultCollection beeing cleaned. It just would not fit the semantics of Prepare to use it for memory cleanup.
Change History (15)
comment:1 Changed 14 years ago by cneumuel
- Status changed from new to assigned
comment:2 Changed 14 years ago by swagner
- Milestone changed from HeuristicLab x.x.x to HeuristicLab 3.3.3
- Status changed from assigned to accepted
comment:3 Changed 14 years ago by swagner
comment:4 Changed 14 years ago by swagner
- Component changed from ### Undefined ### to Optimization
- Description modified (diff)
- Summary changed from Algorithm cleanup itself after it stopped to EngineAlgorithm should cleanup itself after it stopped
comment:5 Changed 14 years ago by swagner
Cleared global scope after an EngineAlgorithm is stopped in r5195.
comment:6 Changed 14 years ago by swagner
Additionally, I think we should get rid of storing a clone of the algorithm in each run. This also requires a lot of memory which probably is not really worth it. I think that the feature to show the corresponding algorithm for a run is not really frequently used.
comment:7 Changed 14 years ago by cneumuel
I agree that storing a clone of the algorithm in each run can become a huge memory-overhead. But maybe it would be enough to just make it an opt-in feature, rather than opt-out.
comment:8 follow-up: ↓ 10 Changed 14 years ago by cneumuel
The following test shows that there may still be a problem with file sizes and memory consumption after the recent changes.
- Algorithm: Genetic Algorithm (MaxGenerations: 15, PopulationSize: 2000)
- Problem: Single Objective Testfunction (ProblemSize: 2000)
- StoreAlgorithmInEachRun: false
The following numbers show file-sizes when the algorihm is stored:
- Before algorithm started: 37KB
- After algorithm paused: 97MB
- After algorithm stopped: 73MB
- After algorithm prepared: 36MB
- After clearing runs: 38KB!
The recent changes help to release ~24MB of memory when the algorithm stops, yet ~37MB can still be released by calling "prepare".
Also interesting and maybe a different problem is that by clearing the ResultCollection (which contained only 1 run), almost ~36MB of memory were released. I further investigated this issue and found that one problem might be the a SingleObjectiveTestFunctionSolution object contains a reference to the whole population. Therefore the BestSolution in a run also contains a reference to the whole population which in this case leads to a large memory consumption.
comment:9 Changed 14 years ago by swagner
Changed storing the algorithm in each run into an opt-in feature in r5203.
comment:10 in reply to: ↑ 8 Changed 14 years ago by swagner
Replying to cneumuel:
The following test shows that there may still be a problem with file sizes and memory consumption after the recent changes.
- Algorithm: Genetic Algorithm (MaxGenerations: 15, PopulationSize: 2000)
- Problem: Single Objective Testfunction (ProblemSize: 2000)
- StoreAlgorithmInEachRun: false
The following numbers show file-sizes when the algorihm is stored:
- Before algorithm started: 37KB
- After algorithm paused: 97MB
- After algorithm stopped: 73MB
- After algorithm prepared: 36MB
- After clearing runs: 38KB!
The recent changes help to release ~24MB of memory when the algorithm stops, yet ~37MB can still be released by calling "prepare".
Also interesting and maybe a different problem is that by clearing the ResultCollection (which contained only 1 run), almost ~36MB of memory were released. I further investigated this issue and found that one problem might be the a SingleObjectiveTestFunctionSolution object contains a reference to the whole population. Therefore the BestSolution in a run also contains a reference to the whole population which in this case leads to a large memory consumption.
Yes, the large file sizes of case 3 and 4 result from storing the whole population in the SingleObjectiveTestFunctionSolution and consequently also in the BestSolution result (which is contained in the results as well as in each run). We decided to store the whole population in order to be able to visualize the whole population graphically. However, this is only possible in the 2D case. If the problem dimension is larger than 2, the graphical solution visualization is not available. Therefore, we should only store the whole population in the 2D case. I created a separate ticket #1360 for this issue.
comment:11 Changed 14 years ago by swagner
Updated samples in r5205.
comment:12 Changed 14 years ago by swagner
- Owner changed from swagner to cneumuel
- Status changed from accepted to reviewing
comment:13 Changed 14 years ago by cneumuel
Tested the same algorithm with the recent changes:
- Before algorithm started: 37KB
- After algorithm paused: 105MB
- After algorithm stopped: 84KB
- After algorithm prepared: 64KB
- After clearing runs: 38KB
comment:14 Changed 14 years ago by cneumuel
- Owner changed from cneumuel to swagner
- Status changed from reviewing to readytorelease
comment:15 Changed 14 years ago by mkommend
- Resolution set to done
- Status changed from readytorelease to closed
- Version changed from 3.3.2 to 3.3.3
Implemented that the ExecutionStack (and consequently also all ExecutionContexts) is cleared in Engine, if the engine is stopped (see ticket #1333 and r5193).