JPPF, java, parallel computing, distributed computing, grid computing, parallel, distributed, cluster, grid, cloud, open source, android, .net
JPPF

The open source
grid computing
solution

 Home   About   Features   Download   Documentation   On Github   Forums 
November 18, 2019, 07:19:41 AM *
Welcome,
Please login or register.

Login with username, password and session length
Advanced search  
News: New users, please read this message. Thank you!
  Home Help Search Login Register  
Pages: [1]   Go Down

Author Topic: Node Failover to Manual Configuration After Discovered Driver Failed  (Read 2865 times)

dave.swartz

  • Guest

Having read the following section of the node configuration I attempted to configure a failover to manual configuration:
http://www.jppf.org/doc/v2/index.php?title=Node_configuration#Interaction_of_failover_and_server_discovery

To do this I brought up two drivers, one with discovery enabled, the other not. I enabled discovery on the single node I brought up, but manually configured the node to point to the second driver (the one with discovery disabled). My expectation was the following:
Start the first driver and the second driver;
Start the node;
The node discovers the first driver and connects to it;
Stop the first driver;
The node tries to reconnect to the stopped driver for a time;
But eventually it will fail over to the manual configuration in the node (pointing at the second driver);

I am not seeing this behavior. After the first driver is stopped I see the following output. It seems to try over and over to connect to the disconnected class loaded port, but never try to failover to manual configuration.
Code: [Select]
run:
     [echo] Starting the JPPF node
     [java] node process id: 25933
     [java] Attempting connection to the class server at 10.--.---.-:11111
     [java] Reconnected to the class server
     [java] JPPF Node management initialized
     [java] Attempting connection to the node server at 10.--.---.-:11113
     [java] Reconnected to the node server
     [java] Node sucessfully initialized
     [java] Attempting connection to the class server at 10.--.---.-:11111
     [java] SocketInitializer.initializeSocket(): Could not reconnect to the remote server
     [java] Attempting connection to the class server at 10.--.---.-:11111
     [java] Attempting connection to the class server at 10.--.---.-:11111
     [java] Attempting connection to the class server at 10.--.---.-:11111
     [java] Attempting connection to the class server at 10.--.---.-:11111
     [java] Attempting connection to the class server at 10.--.---.-:11111

More detailed log below:
Code: [Select]
2011-10-24 23:13:19,038 [DEBUG][org.jppf.classloader.JPPFClassLoader.init(102)]: initializing connection
2011-10-24 23:13:19,039 [DEBUG][org.jppf.classloader.JPPFClassLoader.initSocketClient(80)]: initializing socket connection
2011-10-24 23:13:19,039 [DEBUG][org.jppf.comm.socket.SocketInitializerImpl.initializeSocket(87)]: about to close socket wrapper
2011-10-24 23:13:24,050 [DEBUG][org.jppf.node.NodeRunner.main(136)]: received reconnection notification
2011-10-24 23:13:24,050 [DEBUG][org.jppf.server.node.JPPFNode.stopNode(380)]: stopping node
2011-10-24 23:13:24,051 [DEBUG][org.jppf.server.node.remote.JPPFRemoteNode.closeDataChannel(105)]: closing data channel: socketClient=null, clientConnection=null
2011-10-24 23:13:24,052 [DEBUG][org.jppf.utils.NetworkUtils.getIPAddresses(139)]: found network interface: name:vnic1 (vnic1)
2011-10-24 23:13:24,053 [DEBUG][org.jppf.utils.NetworkUtils.getIPAddresses(139)]: found network interface: name:vnic0 (vnic0)
2011-10-24 23:13:24,054 [DEBUG][org.jppf.utils.NetworkUtils.getIPAddresses(139)]: found network interface: name:en0 (en0)
2011-10-24 23:13:24,055 [DEBUG][org.jppf.utils.NetworkUtils.getIPAddresses(139)]: found network interface: name:lo0 (lo0)
2011-10-24 23:13:24,055 [DEBUG][org.jppf.comm.discovery.JPPFMulticastReceiver.receive(133)]: Found 3 addresses: 10.37.---.- 10.211.--.- 192.168.1.100
2011-10-24 23:13:29,065 [DEBUG][org.jppf.comm.discovery.JPPFMulticastReceiver.receive(147)]: Auto-discovery of the driver connection information: null
2011-10-24 23:13:29,065 [DEBUG][org.jppf.node.NodeRunner.discoverDriver(200)]: Could not auto-discover the driver connection information
2011-10-24 23:13:29,065 [DEBUG][org.jppf.comm.discovery.JPPFMulticastReceiver.run(258)]: Receive timed out
java.net.SocketTimeoutException: Receive timed out
at java.net.PlainDatagramSocketImpl.receive0(Native Method)
at java.net.PlainDatagramSocketImpl.receive(PlainDatagramSocketImpl.java:145)
at java.net.DatagramSocket.receive(DatagramSocket.java:725)
at org.jppf.comm.discovery.JPPFMulticastReceiver$Receiver.run(JPPFMulticastReceiver.java:245)
2011-10-24 23:13:29,066 [DEBUG][org.jppf.comm.discovery.JPPFMulticastReceiver.run(258)]: Receive timed out
java.net.SocketTimeoutException: Receive timed out
at java.net.PlainDatagramSocketImpl.receive0(Native Method)
at java.net.PlainDatagramSocketImpl.receive(PlainDatagramSocketImpl.java:145)
at java.net.DatagramSocket.receive(DatagramSocket.java:725)
at org.jppf.comm.discovery.JPPFMulticastReceiver$Receiver.run(JPPFMulticastReceiver.java:245)
2011-10-24 23:13:29,070 [DEBUG][org.jppf.comm.discovery.JPPFMulticastReceiver.run(258)]: Receive timed out
java.net.SocketTimeoutException: Receive timed out
at java.net.PlainDatagramSocketImpl.receive0(Native Method)
at java.net.PlainDatagramSocketImpl.receive(PlainDatagramSocketImpl.java:145)
at java.net.DatagramSocket.receive(DatagramSocket.java:725)
at org.jppf.comm.discovery.JPPFMulticastReceiver$Receiver.run(JPPFMulticastReceiver.java:245)
2011-10-24 23:13:29,083 [DEBUG][org.jppf.classloader.JPPFClassLoader.init(102)]: initializing connection
2011-10-24 23:13:29,083 [DEBUG][org.jppf.classloader.JPPFClassLoader.initSocketClient(80)]: initializing socket connection
2011-10-24 23:13:29,085 [DEBUG][org.jppf.comm.socket.SocketInitializerImpl.initializeSocket(87)]: about to close socket wrapper
2011-10-24 23:13:34,087 [DEBUG][org.jppf.node.NodeRunner.main(136)]: received reconnection notification
2011-10-24 23:13:34,089 [DEBUG][org.jppf.server.node.JPPFNode.stopNode(380)]: stopping node
2011-10-24 23:13:34,089 [DEBUG][org.jppf.server.node.remote.JPPFRemoteNode.closeDataChannel(105)]: closing data channel: socketClient=null, clientConnection=null
2011-10-24 23:13:34,090 [DEBUG][org.jppf.utils.NetworkUtils.getIPAddresses(139)]: found network interface: name:vnic1 (vnic1)
2011-10-24 23:13:34,092 [DEBUG][org.jppf.utils.NetworkUtils.getIPAddresses(139)]: found network interface: name:vnic0 (vnic0)
2011-10-24 23:13:34,093 [DEBUG][org.jppf.utils.NetworkUtils.getIPAddresses(139)]: found network interface: name:en0 (en0)
2011-10-24 23:13:34,094 [DEBUG][org.jppf.utils.NetworkUtils.getIPAddresses(139)]: found network interface: name:lo0 (lo0)
2011-10-24 23:13:34,095 [DEBUG][org.jppf.comm.discovery.JPPFMulticastReceiver.receive(133)]: Found 3 addresses: 10.37.---.- 10.211.--.- 192.168.1.100
2011-10-24 23:13:39,101 [DEBUG][org.jppf.comm.discovery.JPPFMulticastReceiver.receive(147)]: Auto-discovery of the driver connection information: null
2011-10-24 23:13:39,102 [DEBUG][org.jppf.node.NodeRunner.discoverDriver(200)]: Could not auto-discover the driver connection information
2011-10-24 23:13:39,103 [DEBUG][org.jppf.comm.discovery.JPPFMulticastReceiver.run(258)]: Receive timed out
java.net.SocketTimeoutException: Receive timed out
at java.net.PlainDatagramSocketImpl.receive0(Native Method)
at java.net.PlainDatagramSocketImpl.receive(PlainDatagramSocketImpl.java:145)
at java.net.DatagramSocket.receive(DatagramSocket.java:725)
at org.jppf.comm.discovery.JPPFMulticastReceiver$Receiver.run(JPPFMulticastReceiver.java:245)
2011-10-24 23:13:39,112 [DEBUG][org.jppf.comm.discovery.JPPFMulticastReceiver.run(258)]: Receive timed out
java.net.SocketTimeoutException: Receive timed out
at java.net.PlainDatagramSocketImpl.receive0(Native Method)
at java.net.PlainDatagramSocketImpl.receive(PlainDatagramSocketImpl.java:145)
at java.net.DatagramSocket.receive(DatagramSocket.java:725)
at org.jppf.comm.discovery.JPPFMulticastReceiver$Receiver.run(JPPFMulticastReceiver.java:245)
2011-10-24 23:13:39,114 [DEBUG][org.jppf.comm.discovery.JPPFMulticastReceiver.run(258)]: Receive timed out
java.net.SocketTimeoutException: Receive timed out
at java.net.PlainDatagramSocketImpl.receive0(Native Method)
at java.net.PlainDatagramSocketImpl.receive(PlainDatagramSocketImpl.java:145)
at java.net.DatagramSocket.receive(DatagramSocket.java:725)
at org.jppf.comm.discovery.JPPFMulticastReceiver$Receiver.run(JPPFMulticastReceiver.java:245)
2011-10-24 23:13:39,200 [DEBUG][org.jppf.classloader.JPPFClassLoader.init(102)]: initializing connection
2011-10-24 23:13:39,216 [DEBUG][org.jppf.classloader.JPPFClassLoader.initSocketClient(80)]: initializing socket connection
2011-10-24 23:13:39,226 [DEBUG][org.jppf.comm.socket.SocketInitializerImpl.initializeSocket(87)]: about to close socket wrapper

Not sure if this is just my misunderstanding, a configuration issue, or a legitimate bug. Any help is greatly appreciated!

Thanks,
Dave
Logged

lolo

  • Administrator
  • JPPF Council Member
  • *****
  • Posts: 2256
    • JPPF Web site
Re: Node Failover to Manual Configuration After Discovered Driver Failed
« Reply #1 on: October 26, 2011, 08:43:15 AM »

Hi Dave,

Indeed this is not working as expected.
I have registered a bug for this: 3428630 - Node failover isn't working as expected
I have found a way to fix this problem, which I will publish as a patch by tomorrow. I will update you in this thread. Sorry for the inconvenience.

Sincerely,
-Laurent
Logged

dave.swartz

  • Guest
Re: Node Failover to Manual Configuration After Discovered Driver Failed
« Reply #2 on: October 26, 2011, 04:05:34 PM »

Thank you very much for such a quick response! Will test out a fix when available.

Dave
Logged

lolo

  • Administrator
  • JPPF Council Member
  • *****
  • Posts: 2256
    • JPPF Web site
Re: Node Failover to Manual Configuration After Discovered Driver Failed
« Reply #3 on: October 27, 2011, 07:11:25 AM »

Hi Dave,

The patch is ready, you can downoad it from this page.

Sincerely,
-Laurent
Logged
Pages: [1]   Go Up
 
JPPF Powered by SMF 2.0 RC5 | SMF © 2006–2011, Simple Machines LLC Get JPPF at SourceForge.net. Fast, secure and Free Open Source software downloads