lb::reselect fails to select another node

Question

I stripped out everything fancy and this still doesn't work.  The behavior is peculiar:&nbsp;
&nbsp;rule reselect_test {
&nbsp;   when LB_FAILED {
&nbsp;        LB::reselect
&nbsp;   }
&nbsp;}&nbsp;
&nbsp;pool test {
&nbsp;  member 1.1.1.1:any
&nbsp;  member 1.1.1.2:any
&nbsp;}&nbsp;
&nbsp;virtual test {
&nbsp;  destination 1.1.2.1:any
&nbsp;  protocol tcp
&nbsp;  rule reselect_test
&nbsp;  pool test
&nbsp;  snat automap
&nbsp;}&nbsp;
&nbsp;When I connect, every other time the connection hangs while the LTM goes nuts trying to reconnect to the same back-end server.  &nbsp;
&nbsp;Curiously, if I open another connection it breaks the first connection out of this loop and connects to the second.&nbsp;
&nbsp;I tested this on two 9.2.3 255.0 and one 9.3.0 system.  Same thing.&nbsp;
&nbsp;I thought LB::reselect was a) supposed to select a _different_ node and b) supposed to be limited in the amount of retries?

romka_77775 · Answer

Try to call to LB::detach before LB:reselect.
&nbsp;LB::detach disconnects the server side connection.

d_9795 · Answer

Thanks

krzysztof_kozlo · Answer

Tried that already.  Changes nothing in the behavior whatsoever.

joseph_chan_463 · Answer

BTW, is there a monitor to check the health of those two nodes?&nbsp;
&nbsp;This topic also tries to do something similar.
&nbsp;http://devcentral.f5.com/Default.aspx?tabid=53&amp;forumid=5&amp;postid=14059&amp;view=topic&nbsp;
&nbsp;You may wish to try LB::down, but monitor is the proper way to do this. Monitor will watch out for the node when it comes back up. Rule marks it down and forget about it.&nbsp;
&nbsp;http://devcentral.f5.com/wiki/default.aspx/iRules/LB__down.html&nbsp;&nbsp;

deb_allen_18 · Answer

LB::reselect chooses a node based on the LB algorithm for the pool, which may or may not be a "different" server.  It reselects only once, but if the server fails to respond, you will loop on the LB_FAILED event endlessly unless you include some count/stop logic in your iRule.&nbsp;
&nbsp;When you say "every other time the connection hangs", that would seem to indicate that one of your pool members is not responding.  I don't see that you have any monitoring in place. &nbsp;
&nbsp;I'm not sure why the other node isn't selected on failure, though, since you have default LB method Round Robin configured.&nbsp;
&nbsp;I'd start by applying a monitor to the pool.  You should see better behaviour then.  If you continue to have difficulty, post back &amp; we can try to help further.&nbsp;
&nbsp;/deb&nbsp;

Forum Discussion

lb::reselect fails to select another node

8 Replies

Recent Discussions

GTM Packet Capture command

Wildcard SSL Certificate Deployment on F5 LTM

Migration from i series 10200 with 1 child VCMP to r series 10900 series

Open Redirection Mitigation

How to Implement 2 Way SSL in F5 LTM

Related Content

DevCentral 2024 - Share Another Day

Another Bash script to backup a BIG-IP device.

pool members can't connect to another Virtual Server

LTM - Retransmissions when using LB::reselect virtual in irule

Migrate part of GTM to another GTM