NetXMS sever 1.1.1 crash

Started by testos, May 19, 2011, 02:20:28 PM

Previous topic - Next topic

testos

Hi.

Since I installed 1.1.1 server version, from time to time NetXMS console shows entire all nodes become to down simultaneously and do not respond to pol status. So then I restart the server and everything returns to normal ... until it happens again.

This is part of my netxmsd.log:

Quote[19-May-2011 12:48:15] Unable to create raw socket for ICMP protocol
[19-May-2011 12:48:15] Unable to create raw socket for ICMP protocol
[19-May-2011 12:48:15] Unable to create raw socket for ICMP protocol
[19-May-2011 12:48:15] Unable to create raw socket for ICMP protocol
[19-May-2011 12:48:15] Unable to create raw socket for ICMP protocol
[19-May-2011 12:48:15] Unable to create raw socket for ICMP protocol
[19-May-2011 12:48:15] Unable to create raw socket for ICMP protocol
[19-May-2011 12:48:15] Error returned by accept() system call: Too many open files
[19-May-2011 12:48:15] Error returned by accept() system call: Too many open files
[19-May-2011 12:48:15] Client session closed due to communication error (Success)
[19-May-2011 12:48:16] Error returned by accept() system call: Too many open files
[19-May-2011 12:48:16] Client session closed due to communication error (Success)
[19-May-2011 12:48:16] Error returned by accept() system call: Too many open files



Is it advisable rollback server previous version?


Best regards.

testos


I want to clarify that server daemon process does not down.
No dump files are generated  (.info and .mdmp extensions).

lomo

I'm experiencing same issue with 1.1.1 with the same messages on Debian Squeeze, compiled from sources.

Victor Kirhenshtein

I can reproduce this issue on my server. Looks like file handle leak in the server. I'll try to fix this till Monday.

Best regards,
Victor

Victor Kirhenshtein

Looks like I've found this leak (at least one of them, if there are many). Could you please replace file src/server/code/node.cpp with attached one and recompile the server?

Best regards,
Victor

testos

Okay, I've replaced the file you tell me and recompiled server.

Thanks.
Best regards.

testos

Five hours without problems, seems to look good  ::).

Best regards.

testos

After over 70 hours NetXMS server is working normally.

Best regards.