JPPF Issue Tracker
star_faded.png
Please log in to bookmark issues
enhancement_small.png
CLOSED  Enhancement JPPF-316  -  Improve server responsiveness when many nodes connect at once
Posted Aug 21, 2014 - updated Oct 26, 2014
action_vote_minus_faded.png
0
Votes
action_vote_plus_faded.png
icon_info.png This issue has been closed with status "Closed" and resolution "RESOLVED".
Issue details
  • Type of issue
    Enhancement
  • Status
     
    Closed
  • Assigned to
     lolo4j
  • Type of bug
    Not triaged
  • Likelihood
    Not triaged
  • Effect
    Not triaged
  • Posted by
     lolo4j
  • Owned by
    Not owned by anyone
  • Category
    Server
  • Resolution
    RESOLVED
  • Priority
    High
  • Targetted for
    icon_milestones.png JPPF 5.0
Issue description
I have observed that, when starting many nodes at once, the server can freeze for a significant amount of time before completing the handshake with all the nodes.

For instance, when starting 30 nodes, using node provisioning (so all on the same machine), the server freezes for about 50 seconds. If I start the 30 nodes in multiple steps, for instance 8 nodes then wait until handshake is done, then 8 more, etc..., then this issue does not occur.

Furthermore, turning on DEBUG-level logging in the server i observed that the freeze happens after the handshake completes for all clas loader connections, but before the handshake for the job channels starts. This needs to be investigated, as this could be very disruptive when hundreds of nodes are connecting at the same time, e.g. in the case of a driver restart.