Advice on "persist hash" load balancing

Question

Hi, I've been reading through past post about load balancing certain requests based on part of the request data using the "persist hash" method, but I haven't been able to implement it successfully. This is on v9.2.3.
I'll use the URL of this forum-post page as an example:
http://devcentral.f5.com/Default.aspx?tabid=28&amp;forumid=5&amp;view=post
I want to pick a member node of the pool based on the forumid value in the query string, to optimize caching on the web servers. A given user will be bounced between servers as they click between forums, but all requests for forum 5 will end up at a given node. For simplicity I'd rather use the "persist hash" method so I don't have to maintain a list of the actual nodes inside the irule.
Now that I've started testing my irule, however, I find that I'm getting assigned to the same server (at least for several minutes) regardless of page, as if it were persisting based on my client IP or other data rather than what I want. 
The config:
A virtual server with one pool of two nodes, Round Robin, no default persistence profile.
My irule (I'm a TCL newbie, so feel free to point out errors):
when HTTP_REQUEST
{
if {[string tolower [HTTP::uri]] starts_with "/default.aspx"}
{
set forumid "none"
if {[string first forumid= [HTTP::query]] != -1}
{
set query [HTTP::query]
regexp -nocase forumid=(\d*) $query forumid
}
if {$forumid != "none"}
{
persist hash "$forumid"
}
}
}
I've tried to use the log local0. "something" syntax I've seen here to help debug, but I can't find where the output lands - I don't see it in the Logs section of the web UI. (I can probably optimize out the first test for the forumid substring, but I'll take care of that later). I've also tried the simpler version
when HTTP_REQUEST
{
persist hash [HTTP::query]
}
and this too seems to lock me to a single server regardless of query string, for a while before it switches. It seems like severing my TCP connection between my browser and the site (the bigip device) helps break the binding to a particular node, but even then the resulting one isn't deterministic based on the query string param.
Any advice on either what I'm doing wrong or how to better debug this?
Thanks!

dennypayne · Answer

The log function to local0 should be putting log entries in the Local Traffic section of the Logs in the GUI, or /var/log/ltm if you SSH in.&nbsp;
&nbsp;Are you using a OneConnect profile?  OneConnect attempts to "piggyback" separate connections to the servers upon an already open tcp socket to minimize bandwidth usage between BIG-IP and the servers.  That may be producing similar results to what you are seeing.&nbsp;
&nbsp;I also recall seeing on a thread here somewhere that -nocase wasn't a valid operator but I can't find it at the moment....&nbsp;
&nbsp;Denny

roger_wolfson_8 · Answer

Thanks for the tips. I tweaked some log level settings and got it logging.
I checked and OneConnect is disabled, so that's not an issue here.
I also had to tweak the regexp to conform to the tcl implementation, and got my match variable set correctly. However, the same problem is appearing where any request that hits the persist command gets routed to the same server regardless of hash value. It comes down to this:persist hash "$idmatch"
log local0. "persisting on key $idmatch"
I've also tried it without quotes in the first line. The log reads:
HTTP_REQUEST: persisting on key 1570 
HTTP_REQUEST: persisting on key 1571 
HTTP_REQUEST: persisting on key 1572
etc., yet all these requests are going to the first node in the pool. I've tried several dozen hash keys so the odds are miniscule of them all hash-modding to the same one of two servers. Requests that bypass this command seem to load-balance correctly.

deb_allen_18 · Answer

Actually, you might NEED to enable OneConnect.  &nbsp;
&nbsp;Without OneConnect enabled, only the first request in a Keep-Alive connection is parsed for persistence data, so if multiple requests are sent on the same Keep-Alive connection,  LTM will persist them all to the same destination as the first.  &nbsp;
&nbsp;A OneConnect profile with mask of 255.255.255.255 will allow parsing of all requests and serverside connections will only be re-used for the same client.&nbsp;
&nbsp;HTH
&nbsp;/deb&nbsp;&nbsp;

roger_wolfson_8 · Answer

Thanks Deb, this seems to resolve the problem! However, is this a viable option to use in a production configuration, or does it have measurable impact against our previous benefit of connection marshalling?  If I understand correctly the last thing you said, the total number of connections across all webserver nodes will now equal the total number of client connections to the Bigip device, where we're currently seeing only a small fraction of the total client connections appear on the webservers.&nbsp;
&nbsp;While this would probably be an acceptable tradeoff for the benefit we can get, I want to make sure that's an accurate picture of it, and that this isn't discouraged behavior.&nbsp;
&nbsp;Thanks!&nbsp;
&nbsp;Roger

deb_allen_18 · Answer

Hi Roger, &nbsp;
&nbsp;Glad I could help.  That's a common trap.&nbsp;
&nbsp;Without OneConnect enabled, I can't explain whatever connection pooling you saw.&nbsp;
&nbsp;However, OneConnect with either mask is viable in production, and either will be more efficient than none at all, since handshake overhead for your servers will be reduced.&nbsp;
&nbsp;With OneConnect configured with the default mask of 0.0.0.0, any idle serverside connection may be re-used for any new clientside request, significantly reducing the number of serverside connections.  &nbsp;
&nbsp;However, re-used serverside connections retain the source IP of the original client, which results in some very misleading server log entries unless you are also SNATing all connections.&nbsp;
&nbsp;Without SNAT, OneConnect with a host mask (255.255.255.255) keeps the source address info in the server logs consistent with reality.&nbsp;
&nbsp;If you're already SNATing, the 0.0.0.0 mask will result in more efficient connection pooling.&nbsp;
&nbsp;HTH
&nbsp;/deb

Forum Discussion

Advice on "persist hash" load balancing

6 Replies

Recent Discussions

SMS server with BIGIP

F5 terminal - help to run commands - disk space full

Can iRule be used to perform exception of IPI category based on Geolocation

[ASM] - what is "Browser Challange file" ?

LDAPS and renegotiation

Related Content

Would the "Match accross services" work with carp based hash persistence?

Active/Active load balancing examples with F5 BIG-IP and Azure load balancer

NGINXaaS for Azure: Load Balancing

What is Load Balancing?

Deploying F5 BIG-IP with Azure Cross-region load balancer