Free cookie consent management tool by TermsFeed Policy Generator

Changes between Initial Version and Version 9 of Ticket #1950

10/26/12 12:09:14 (11 years ago)

I changed the title and description of the ticket since I couldn't reproduce the original StackOverflowException.

But I noticed that I received multiple EndpointNotFoundExceptions when I had the JobManager open, put the computer to sleep and woke it up, but with the network unplugged. A job was shown and to be refreshed automatically. Admittedly, this is expected and a rare case. The problem was that this produces new exception dialogs every second. The exception should either be swallowed and no action performed if not connected, or a state should be remembered which indicates that the computer is not connected and that the exception was already shown.

EndpointNotFoundException: There was no endpoint listening at that could accept the message. This is often caused by an incorrect address or SOAP action. See InnerException, if present, for more details.

Server stack trace: 
   at System.ServiceModel.Security.IssuanceTokenProviderBase`1.DoNegotiation(TimeSpan timeout)
   at System.ServiceModel.Security.SspiNegotiationTokenProvider.OnOpen(TimeSpan timeout)
   at System.ServiceModel.Security.TlsnegoTokenProvider.OnOpen(TimeSpan timeout)
   at System.ServiceModel.Channels.CommunicationObject.Open(TimeSpan timeout)
   at System.ServiceModel.Security.SymmetricSecurityProtocol.OnOpen(TimeSpan timeout)
   at System.ServiceModel.Channels.CommunicationObject.Open(TimeSpan timeout)
   at System.ServiceModel.Channels.SecurityChannelFactory`1.ClientSecurityChannel`1.OnOpen(TimeSpan timeout)
   at System.ServiceModel.Channels.CommunicationObject.Open(TimeSpan timeout)
   at System.ServiceModel.Security.SecuritySessionSecurityTokenProvider.DoOperation(SecuritySessionOperation operation, EndpointAddress target, Uri via, SecurityToken currentToken, TimeSpan timeout)
   at System.ServiceModel.Security.SecuritySessionSecurityTokenProvider.GetTokenCore(TimeSpan timeout)
   at System.IdentityModel.Selectors.SecurityTokenProvider.GetToken(TimeSpan timeout)
   at System.ServiceModel.Security.SecuritySessionClientSettings`1.ClientSecuritySessionChannel.OnOpen(TimeSpan timeout)
   at System.ServiceModel.Channels.CommunicationObject.Open(TimeSpan timeout)
   at System.ServiceModel.Channels.ServiceChannel.OnOpen(TimeSpan timeout)
   at System.ServiceModel.Channels.CommunicationObject.Open(TimeSpan timeout)
   at System.ServiceModel.Channels.ServiceChannel.CallOnceManager.CallOnce(TimeSpan timeout, CallOnceManager cascade)
   at System.ServiceModel.Channels.ServiceChannel.EnsureOpened(TimeSpan timeout)
   at System.ServiceModel.Channels.ServiceChannel.Call(String action, Boolean oneway, ProxyOperationRuntime operation, Object[] ins, Object[] outs, TimeSpan timeout)
   at System.ServiceModel.Channels.ServiceChannelProxy.InvokeService(IMethodCallMessage methodCall, ProxyOperationRuntime operation)
   at System.ServiceModel.Channels.ServiceChannelProxy.Invoke(IMessage message)

Exception rethrown at [0]: 
   at System.Runtime.Remoting.Proxies.RealProxy.HandleReturnMessage(IMessage reqMsg, IMessage retMsg)
   at System.Runtime.Remoting.Proxies.RealProxy.PrivateInvoke(MessageData& msgData, Int32 type)
   at HeuristicLab.Clients.Hive.IHiveService.GetLightweightJobTasks(Guid jobId)
   at HeuristicLab.Clients.Hive.HiveServiceClient.GetLightweightJobTasks(Guid jobId) in c:\HL3\trunk\sources\HeuristicLab.Clients.Hive\3.3\ServiceClients\HiveServiceClient.cs:line 1972
   at HeuristicLab.Clients.Hive.JobResultPoller.<FetchJobResults>b__0(IHiveService service) in c:\HL3\trunk\sources\HeuristicLab.Clients.Hive\3.3\JobResultPoller.cs:line 109
   at HeuristicLab.Clients.Hive.HiveServiceLocator.CallHiveService[T](Func`2 call) in c:\HL3\trunk\sources\HeuristicLab.Clients.Hive\3.3\HiveServiceLocator.cs:line 64
   at HeuristicLab.Clients.Hive.JobResultPoller.RunPolling() in c:\HL3\trunk\sources\HeuristicLab.Clients.Hive\3.3\JobResultPoller.cs:line 91
WebException: The remote name could not be resolved: ''
   at System.Net.HttpWebRequest.GetRequestStream(TransportContext& context)
   at System.Net.HttpWebRequest.GetRequestStream()
   at System.ServiceModel.Channels.HttpOutput.WebRequestHttpOutput.GetOutputStream()


  • Ticket #1950

    • Property Status changed from new to reviewing
    • Property Owner changed from ascheibe to abeham
    • Property Summary changed from HeuristicLab crashes with stack overflow exception to Improve Hive UI stability
  • Ticket #1950 – Description

    initial v9  
    1 I'm not yet 100% sure this is a Hive bug, but so far I've had two situations which involved uploading a job and then locking the computer screen while the upload was running. When I came back after some hours the first time I saw that HL was using one full CPU core without doing any work. It didn't crash, but I had to use the task manager to take it down. The other time (today) HL just crashed completely. I suspect that there may be a bug with auto refreshing jobs.
    3 The information on the crash today is pretty small unfortunately. I just found this in windows application event logs:
    5 {{{
    6 Faulting application name: HeuristicLab 3.3.exe, version:, time stamp: 0x504b2672
    7 Faulting module name: comctl32.dll, version: 6.10.7601.17514, time stamp: 0x4ce7c45b
    8 Exception code: 0xc00000fd
    9 Fault offset: 0x0000000000101f33
    10 Faulting process id: 0xe70
    11 Faulting application start time: 0x01cd8ddb2487d739
    12 Faulting application path: C:\Path\to\hl\HeuristicLab 3.3.exe
    13 Faulting module path: C:\Windows\WinSxS\\comctl32.dll
    14 Report Id: ca7b39a3-fc0b-11e1-8fab-1c6f65487b11
    15 }}}
    17 It seems that 0xc00000fd is a stack overflow exception.
    19 Possibly related information:
    20  * [ Stack Overflow exception (0xC00000FD) at winsock select]
     1The job manager in hive should be made more stable.