Opened 13 years ago

Closed 13 years ago

Last modified 10 years ago

#149 closed defect (done)

DistributedEngine throws InvalidOperationException after running for a few hours

Reported by: gkronber Owned by: gkronber
Priority: highest Milestone: HeuristicLab 3.3.0
Component: ZZZ OBSOLETE: DistributedEngine Version: 3.1
Keywords: Cc:

Description

This happens at least for the Meta-GP engines of pspitzli. The cause for this is still unclear. Investigate!

Change History (7)

comment:1 Changed 13 years ago by gkronber

  • Status changed from new to assigned

comment:2 Changed 13 years ago by gkronber

  • Resolution set to fixed
  • Status changed from assigned to closed

Worked on stability of plugin-infrastructure (uncaught exceptions in an AppDomain brought down the whole process) in changesets:

  • r241: exception-handling in main-app and graceful unloading of faulted AppDomains
  • r242: automatic restart of 'service' applications
  • r243: grid-client and grid-server are service applications and should be restarted automatically.

Worked on stability and reliability of the grid-infrastructure and the distributed-engine. See changesets:

  • r240: grid-server GUI improvment
  • r244: graceful shutdown of grid-client
  • r254 (#150): re-queuing unfinished jobs at the server
  • r246: bug-fixes related to r244
  • r247: deleted unused interface-class related to r244
  • r248: improvement of stability and reliability on the client-side. introduction of job-manager class that is responsible for sending jobs to the server and receiving results. distributed-engine only has to process results from the job-manager.
  • r249: bugfix for incorrect sleeping-intervals, related to r248
  • r250 (#140): grid-server GUI improvment
  • r251: stability improvement of grid-client
  • r252: another bugfix related to r251
  • r253: client-side stability-improvement
  • r254: improved memory-footprint server-side (removed caching of old results)
  • r255: minor change
  • r256: fixed client-side memory-leak introduced with r248
  • r257: improve client-side memory-footprint, removed caching of serialized zipped engines for restarting jobs.
  • r258: minor change, sleep first then try to fetch result.

comment:3 Changed 13 years ago by gkronber

r240:258 merged into the HL3 stable branch with r279

comment:4 Changed 13 years ago by gkronber

cleanup changesets r263:r265 merged into the HL3 stable branch with r279

comment:5 Changed 12 years ago by swagner

  • Milestone changed from 3.1 to Iteration 0

Milestone 3.1 deleted

comment:6 Changed 11 years ago by swagner

  • Milestone changed from Iteration 0 to Current

Milestone Iteration 0 deleted

comment:11 Changed 10 years ago by swagner

  • Milestone changed from Current to HeuristicLab 3.3.0

Milestone Current deleted

Note: See TracTickets for help on using tickets.