SeanG Posted December 5, 2011 Report Posted December 5, 2011 I have an intermittent fault which is causing some problems, and have found that both Wideserver and wideclient have errors appearing in their respective log files at the time of the fault occurring. The problem I have is that I can't generate the fault at will, and as there are multiple copies of the application in question running, I don't even know which instance is at fault. Any pointers in tracking down either the offset at fault, or the offending app instance would be appreciated :) Wideserver log file contents: ********* WideServer.DLL Log [version 6.78] ********* Blocksize guide = 4096 (double allowed) Date (dmy): 04/12/11, Time 10:04:28.948: Server name is FS 88874 Initialising TCP/IP server 88874 Initialising IPX/SPX server 88874 IPX/SPX socket() failed [Error=10044] Socket type not supported 88874 Failed to start IPX/SPX Server 88874 Initialising UDP/IP server 90574 Broadcasting service every 1000 mSecs 91089 Incoming connection Accepted ok (skt=3408) TCP 91214 Incoming connection Accepted ok (skt=3412) TCP 91292 Connected to computer "CDU" running WideClient version 6.860 (skt=3412) TCP 91401 Connected to computer "ENGINES" running WideClient version 6.860 (skt=3408) TCP 93102 Incoming connection Accepted ok (skt=3504) TCP 93258 Connected to computer "PFD" running WideClient version 6.860 (skt=3504) TCP 11838042 recvfrom failed: closing client! [Error=10054] Connection reset by peer (skt=3504) TCP 11840991 Incoming connection Accepted ok (skt=5528) TCP 11841006 Connected to computer "PFD" running WideClient version 6.860 (skt=5528) TCP 18145163 **** ERROR! Sumcheck or length fails on received socket 5528 block, len=15306 (time=36) 24203009 **** ERROR! Sumcheck or length fails on received socket 5528 block, len=32512 (time=33554432) Wideclient log file contents: ********* WideClient Log [version 6.86] Class=FS98MAIN ********* Date (dmy): 04/12/11, Time 10:05:31.171: Client name is CDU 78 Attempting to connect now 78 LUA: "C:\Applications\WideClient\Initial.LUA": not found 328 New Client Application: "FsPanel" (Id=3668) 1078 Trying to locate server: Protocol not yet decided 1078 Failed to connect: waiting to try again 2375 New Client Application: "cdu" (Id=1676) 3140 Attempting to connect now 10718 New Client Application: "FsPanel" (Id=196) 21218 New Client Application: "FsPanel" (Id=684) 32203 New Client Application: "FsPanel" (Id=1568) 42171 New Client Application: "FsPanel" (Id=1248) 48406 Server = FS 48421 Trying TCP/IP host "FS" port 8002 ... 48437 ... Okay, IP Address = 10.0.0.52 48453 Connection made okay! 49390 New Client Application: "FsPanel" (Id=1792) 49609 New Client Application: "FsPanel" (Id=1848) 49953 New Client Application: "FsPanel" (Id=1900) 50296 New Client Application: "FsPanel" (Id=1944) 50593 New Client Application: "FsPanel" (Id=2068) 50937 New Client Application: "FsPanel" (Id=2112) 51281 New Client Application: "FsPanel" (Id=2160) 51484 New Client Application: "FsPanel" (Id=1040) 51750 New Client Application: "FsPanel" (Id=2260) 53765 New Client Application: "FsPanel" (Id=2344) 46933156 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46933187 Connection closed by server! 46933187 Error on send() of 251 bytes [Error=10054] Connection reset by peer 46933187 The connection was terminated due to a time-out or other failure! 46933843 send() failed [0 bytes] after 40 retries, request depth is 3 46933875 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46933906 Connection closed by server! 46933906 The connection was terminated due to a time-out or other failure! 46933937 Attempting to connect now 46934968 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46934984 Connection closed by server! 46934984 The connection was terminated due to a time-out or other failure! 46935000 Attempting to connect now 46935015 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935031 Connection closed by server! 46935046 The connection was terminated due to a time-out or other failure! 46935046 Attempting to connect now 46935062 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935093 Connection closed by server! 46935109 The connection was terminated due to a time-out or other failure! 46935125 Attempting to connect now 46935140 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935156 Connection closed by server! 46935171 The connection was terminated due to a time-out or other failure! 46935171 Attempting to connect now 46935187 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935203 Connection closed by server! 46935218 The connection was terminated due to a time-out or other failure! 46935234 Attempting to connect now 46935250 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935250 Connection closed by server! 46935281 The connection was terminated due to a time-out or other failure! 46935281 Attempting to connect now 46935296 Trying to locate server: Protocol not yet decided 46935312 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935328 Connection closed by server! 46935328 The connection was terminated due to a time-out or other failure! 46935343 Attempting to connect now 46935359 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935375 Connection closed by server! 46935390 The connection was terminated due to a time-out or other failure! 46935406 Attempting to connect now 46935421 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935421 Connection closed by server! 46935437 The connection was terminated due to a time-out or other failure! 46935453 Attempting to connect now 46935468 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935500 Connection closed by server! 46935515 The connection was terminated due to a time-out or other failure! 46935546 Attempting to connect now 46935546 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935562 Connection closed by server! 46935578 The connection was terminated due to a time-out or other failure! 46935593 Attempting to connect now 46935609 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935625 Connection closed by server! 46935640 The connection was terminated due to a time-out or other failure! 46935640 Attempting to connect now 46935656 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935671 Connection closed by server! 46935687 The connection was terminated due to a time-out or other failure! 46935703 Attempting to connect now 46935718 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935734 Connection closed by server! 46935750 The connection was terminated due to a time-out or other failure! 46935765 Attempting to connect now 46935781 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935796 Connection closed by server! 46935812 The connection was terminated due to a time-out or other failure! 46935828 Attempting to connect now 46935828 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935843 Connection closed by server! 46935859 The connection was terminated due to a time-out or other failure! 46935875 Attempting to connect now 46935890 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935906 Connection closed by server! 46935906 The connection was terminated due to a time-out or other failure! 46935921 Attempting to connect now 46935937 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46935953 Connection closed by server! 46935968 The connection was terminated due to a time-out or other failure! 46935984 Attempting to connect now 46935984 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936000 Connection closed by server! 46936015 The connection was terminated due to a time-out or other failure! 46936031 Attempting to connect now 46936046 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936062 Connection closed by server! 46936078 The connection was terminated due to a time-out or other failure! 46936093 Attempting to connect now 46936109 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936125 Connection closed by server! 46936140 The connection was terminated due to a time-out or other failure! 46936140 Attempting to connect now 46936156 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936171 Connection closed by server! 46936187 The connection was terminated due to a time-out or other failure! 46936203 Attempting to connect now 46936218 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936234 Connection closed by server! 46936265 The connection was terminated due to a time-out or other failure! 46936281 Attempting to connect now 46936296 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936296 Connection closed by server! 46936312 The connection was terminated due to a time-out or other failure! 46936328 Attempting to connect now 46936343 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936359 Connection closed by server! 46936375 The connection was terminated due to a time-out or other failure! 46936390 Attempting to connect now 46936406 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936406 Connection closed by server! 46936421 The connection was terminated due to a time-out or other failure! 46936437 Attempting to connect now 46936453 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936468 Connection closed by server! 46936500 The connection was terminated due to a time-out or other failure! 46936515 Attempting to connect now 46936515 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936531 Connection closed by server! 46936546 The connection was terminated due to a time-out or other failure! 46936562 Attempting to connect now 46936578 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936593 Connection closed by server! 46936593 The connection was terminated due to a time-out or other failure! 46936609 Attempting to connect now 46936625 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936640 Connection closed by server! 46936656 The connection was terminated due to a time-out or other failure! 46936671 Attempting to connect now 46936687 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936703 Connection closed by server! 46936718 The connection was terminated due to a time-out or other failure! 46936718 Attempting to connect now 46936734 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936750 Connection closed by server! 46936765 The connection was terminated due to a time-out or other failure! 46936781 Attempting to connect now 46936796 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936812 Connection closed by server! 46936812 The connection was terminated due to a time-out or other failure! 46936828 Attempting to connect now 46936843 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936875 Connection closed by server! 46936875 The connection was terminated due to a time-out or other failure! 46936890 Attempting to connect now 46936906 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936921 Connection closed by server! 46936937 The connection was terminated due to a time-out or other failure! 46936953 Attempting to connect now 46936953 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46936968 Connection closed by server! 46936984 The connection was terminated due to a time-out or other failure! 46937000 Attempting to connect now 46938015 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46938015 Connection closed by server! 46938046 The connection was terminated due to a time-out or other failure! 46938062 Attempting to connect now 46938062 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46938078 Connection closed by server! 46938093 The connection was terminated due to a time-out or other failure! 46938109 Attempting to connect now 46938125 Error on client post-Connection Select() [Error=10053] Software caused connection abort 46938125 Connection closed by server! 46938140 The connection was terminated due to a time-out or other failure! 46938156 Attempting to connect now Regards SeanG
Pete Dowson Posted December 5, 2011 Report Posted December 5, 2011 I have an intermittent fault which is causing some problems, and have found that both Wideserver and wideclient have errors appearing in their respective log files at the time of the fault occurring. The problem I have is that I can't generate the fault at will, and as there are multiple copies of the application in question running, I don't even know which instance is at fault. Any pointers in tracking down either the offset at fault, or the offending app instance would be appreciated All of the errors you are seeing cannot be the responsibility of any application, but must be due to the network connection. Sumcheck errors indicate corrupted blocks, and such could be due to anything from bad memory at either end, bad network drivers, bad network interface, router, cable or switch. I've never seen such errors without there eventually being a component found faulty. Unfortunately you've only suppied log fragments -- if you'd kindly close everything down first before collecting logs it would be much more useful, as summaries are added at the end with a normal closedown. There are only two errors in the Server fragment, both in blocks from the one client, "PFD": 11841006 Connected to computer "PFD" running WideClient version 6.860 (skt=5528) TCP18145163 **** ERROR! Sumcheck or length fails on received socket 5528 block, len=15306 (time=36) 24203009 **** ERROR! Sumcheck or length fails on received socket 5528 block, len=32512 (time=33554432) And these two occur at widely spaced intervals -- 1hr 45 mins after it joined and then another 1hr 41mins later. Very strange, no? Is there something happening on that PC at such intervals? You don't bother to show the Client log for that PC, which is odd considering that's the only one of the three clients which the Server saw problems with, at least in the fragment you supplied.. The other partial log you provide is for the CDU: Date (dmy): 04/12/11, Time 10:05:31.171: Client name is CDU and the first error on that was a long way into the session: 46933156 Error on client post-Connection Select() [Error=10053] Software caused connection abort46933187 Connection closed by server! after which it didn't appear to recover. This sequence occurs a full 12 hours and 2 minutes into a session. Are you sure your server was still running FS at that time? I can't tell, you supply only partial data. You don't show any part of the WideServer log that far into your session! Regards Pete
SeanG Posted December 5, 2011 Author Report Posted December 5, 2011 Hmm, I'll have another look at the files, but I would have thought they would be complete... I'll have another look :) We have this happening on a number of sites at infrequent intervals, and all since we changed out the FS PC's to win 7 boxes. Prior to that the whole network was Win XP Pro, and we never (that I am aware of) had this problem. I'll try to grab a full set of log files (server and all relevant clients) from one which has had this fault, and start delving into network issues with the new PC's Thanks Pete, at least I have a starting point to launch the trouble-shooting from! :) SeanG
Pete Dowson Posted December 5, 2011 Report Posted December 5, 2011 Hmm, I'll have another look at the files, but I would have thought they would be complete... I'll have another look :) Well, if they are 'complete' then they're not "closed", meaning that they are not taken after FS was closed, or WideClient either. There's always a summary at the end after a normal close. The potentially important but missing log is the one from the only client for which the Server is showing as giving any errors (and only two isolated ones which would do little harm in any case), at least for the 3 hours or so covered by the server log fragment. The 12 hour session on the Client log fragment shows that your server session continued a lot longer, and the only errors that shows, after over 12 hours, are really those you might normally see if you closed the server untidily (i.e. without using the closure options in the clients so that they can close tidily too). Note that if you only get problems occurring after long periods like that, it may be indicative of a memory leak in one or other of the add-ons you are using. In fact, with FS9, there were a lot of memory leakages occurring from scenery elements such as autogen files, especially when they are placed in folders without a proper Scenery + Texture pair. I don't recall the exact details, not having used FS9 for over five years now, but i expect you can still find mention in the assorted Forums covering FS9. Regards Pete
SeanG Posted December 5, 2011 Author Report Posted December 5, 2011 I suspect that the way we shut down FS and the clients will cause the loss of the end of the logs... we just command the PC's to shutdown, so I don't know if that would cause everything to close nicely. Next time we have a report of this fault I'll close everything tidily and grab all of the log files. We have pretty much ruled out memory leaks, and we've seen more than our fair share of them in FS9 scenery over the years! Thanks for your help Pete, this is one of those frustrating problems which has been haunting us for some time <lol> SeanG
Pete Dowson Posted December 5, 2011 Report Posted December 5, 2011 I suspect that the way we shut down FS and the clients will cause the loss of the end of the logs... we just command the PC's to shutdown WideServer can do that tidily. Thanks for your help Pete, this is one of those frustrating problems which has been haunting us for some time <lol> Well, assuming the errors in the client log after over 12 hours were due to the server closing untidily, then the only 'problem' I see in what you've supplied is a recoverable one twice in over three hours, and no more. Maybe it isn't so bad? Pete
SeanG Posted December 5, 2011 Author Report Posted December 5, 2011 WideServer can do that tidily. Yeah, that's the way I do it on my sim at home... unfortunately not all of the PC's in the rack are running wideclient, so we need an alternate way of doing it anyway... Well, assuming the errors in the client log after over 12 hours were due to the server closing untidily, then the only 'problem' I see in what you've supplied is a recoverable one twice in over three hours, and no more. Maybe it isn't so bad? More information I have just found; The problem we are having is that one of the "FSPANEL.EXE" instances (FSPanels are our utils which communicate with FSUIPC then send the data out to our other apps) is randomly generating an "FSUIPC Error 13: IPC Request contains bad data" in our app log file. This is one of the apps (not sure which instance) running on the CDU machine, from where the client log came. SeanG
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now