Tracking triggers in an iRule

Question

Hi All,&nbsp;
&nbsp;I have the following iRule which checks a data group to see if the server is marked as online or off-line. If it is marked as online, traffic passes as normal, if it is off-line,  it sends back a 503. If the proxy is online, but the back end pool is unavailable (using LB_FAILED), it sends back a 502. The pool is actually a single node, so there is no need for LB::reselect (which I don't think would work anyway). I have a tcp profile assigned to the virtual server that sets max syn retry to 1 so that LB_FAILED is immediate. &nbsp;
&nbsp;This has worked fairly well so far, except that LB_FAILED is being triggered intermittently, and I don't know why.  One request will get a 502, while another request, received within milliseconds, goes through. If I were using a built-in health check, there would be logging on member up/down, and failures to select pools. But since I am doing passive checking, there isn't much info that I can find. Is there a way to see what is causing the failures from the LTMs point of view?&nbsp;
&nbsp;Thanks,
&nbsp;Kenny&nbsp;
&nbsp;when RULE_INIT {&nbsp;
&nbsp;   log local0.info "proxystatushttp v1.0  $static::tcl_platform(os) $static::tcl_platform(osVersion)"&nbsp;
&nbsp;   set static::DEBUG 0
&nbsp;   
&nbsp;   set static::offlineFlag "offline"
&nbsp;   set static::proxyStatus proxystatus&nbsp;
&nbsp;   if { $static::DEBUG } { log local0.debug "$static::proxyStatus:\n[class get $static::proxyStatus]" }
&nbsp;   
&nbsp;   set static::privateNetworkAddresses private_net
&nbsp;   set static::externalMonitoringAddresses external_monitoring_addresses&nbsp;
&nbsp;   if { $static::DEBUG } { log local0.debug "$static::privateNetworkAddresses:\n[class get $static::privateNetworkAddresses]" }
&nbsp;   if { $static::DEBUG } { log local0.debug "$static::externalMonitoringAddresses:\n[class get $static::externalMonitoringAddresses]" }&nbsp;
&nbsp;}&nbsp;
&nbsp;when HTTP_REQUEST {&nbsp;
&nbsp;   if { [class lookup $static::offlineFlag $static::proxyStatus] } {&nbsp;
&nbsp;      if { (not [class match [IP::client_addr] equals $static::externalMonitoringAddresses]) &amp;&amp; 
&nbsp;           (not [class match [IP::client_addr] equals $static::privateNetworkAddresses]) } {&nbsp;
&nbsp;     set response "ForbiddenNOTICE: Service unavailable at this time."&nbsp;
&nbsp;     HTTP::respond 503 content $response noserver "Connection" "close" "Content-Length" [string length $response]
&nbsp;         if { $static::DEBUG } { log local0.debug "Sent HTTP Status Code 503 due to proxy status offline to [IP::client_addr]" }
&nbsp;     log -noname local0. "[virtual name] MyIP=[IP::local_addr] SrcIP=[IP::client_addr] - - $$[clock format [clock seconds] -format "%d/%b/%Y:%H:%M:%S %z"]$$ - \"[HTTP::method] [HTTP::uri] HTTP/[HTTP::version]\" 503 [HTTP::payload length]"
&nbsp;     return
&nbsp;      }
&nbsp;      else {
&nbsp;         if { $static::DEBUG } { log local0.debug "Processing HTTP request with proxy status offline from [IP::client_addr]" }
&nbsp;      }
&nbsp;   }
&nbsp;}&nbsp;
&nbsp;when LB_FAILED {
&nbsp;   set response "Server ErrorNOTICE: Site has experienced an error."
&nbsp;   HTTP::respond 502 content $response noserver "Connection" "close"
&nbsp;   log -noname local0. "[virtual name] MyIP=[IP::local_addr] SrcIP=[IP::client_addr] - - $$[clock format [clock seconds] -format "%d/%b/%Y:%H:%M:%S %z"]$$ - \"[HTTP::method] [HTTP::uri] HTTP/[HTTP::version]\" 502 [HTTP::payload length]"
&nbsp;}

the_bhattman · Answer

HI Kenny, 
&nbsp;   You might need to apply a TCP profile where you can fine tune the "Maximum Syn Retransmissions" . Here is a link to describe the various settings behind it. 
&nbsp;  
&nbsp; http://devcentral.f5.com/wiki/default.aspx/iRules/LB_FAILED.html 
&nbsp;  
&nbsp; I hope this helps 
&nbsp; Bhattman &nbsp;

hooleylist · Answer

Hi Kenny, 
&nbsp;  
&nbsp; I don't think there is anything special you're doing in the iRule which would trigger a load balancing failure and the LB_FAILED event to run.  You can check the LB_FAILED wiki page for details on when this event is triggered: 
&nbsp;  
&nbsp; http://devcentral.f5.com/wiki/default.aspx/iRules/lb_failed 
&nbsp;  
&nbsp; If this happens frequently, you could try capturing a tcpdump of the client and serverside traffic to see if the pool member is in fact not responding to LTM SYNs.  For details on using tcpdump, check SOL411: 
&nbsp;  
&nbsp; sol411: Overview of packet tracing with the tcpdump utility 
&nbsp; http://support.f5.com/content/kb/en-us/solutions/public/0000/400/sol411.html 
&nbsp;  
&nbsp; Aaron

kenny_lussier_5 · Answer

Thanks for the pointers. Tracking connections is a little tough, since there are thousands of connections to the front end, and the pool/node is a load balancer. finding the one SYN that isn't ACKd is like finding a needle in a needle stack :-)  
&nbsp;  
&nbsp; One thing to note is that I replaced some old Linux servers running Apache using mod_proxy with the LTM. We never had this issue until we went to the LTM. I am trying to figure out which of the hundreds of differences is causing the problem, and if there is a way to adjust the LTM so that it doesn't behave this way. I suppose increasing the SYN retry is an option. Would using an LB::reselect work if there is only one node in a pool? 
&nbsp;  
&nbsp; Thanks, 
&nbsp; Kenny &nbsp;

colin_walker_12 · Answer

I don't think you'd need a LB::reselect, if you just want to try again to the same server, you could use HTTP::retry.  
&nbsp;  
&nbsp; Colin

hooleylist · Answer

With a default TCP profile, TMM tries 5 times over 45 seconds to establish a TCP connection.  If that's not enough attempts you could increase the Maximum Syn Retransmissions" option in the TCP profile.   
&nbsp;  
&nbsp; However, it would probably be more effective to try to capture the failure happening in a tcpdump so you can see exactly what's failing.  I realize that's not easy when the virtual server is in production.  However, you might be able to create a test VS with a custom SNAT pool and point a test client (or front end server) at it to isolate the traffic. 
&nbsp;  
&nbsp; Aaron

Forum Discussion

Tracking triggers in an iRule

9 Replies

Recent Discussions

ASM instance creation

[ASM] - what is "Browser Challange file" ?

[ASM] - HTML5 Cross-Domain Request Enforcement - CLI command

Reverse Proxy Not Behaving

Stable Firmware for F5

Related Content

Irule Trigger two times

ASM_REQUEST_BLOCKING not being triggered in iRule

IRule to select IRule doesn't work even if the condition triggers

Tracking MAC in vCMP

F5 WAF/ASM block users that trigger too many violations by source ip/device id using the correlation logs