Dealing with long lasting outbound TCP Connections

Question

I am currently trying to resolve an issue concerning long lasting TCP Sessions. 
&nbsp;One of the balanced webservers is regularly querying a database server that is outside the loadbalanced segment. The operation triggered lasts quite a long time, since it has to process a large amount of data. During that time the TCP session is still open but idle. 
&nbsp;After one hour the session is resetted by the Big IP causing the Query to fail.&nbsp;
&nbsp;My suspicion is that this reset is caused by the wildcard forwarding server 0.0.0.0 which routes the outbound traffic back to the rest of the LAN. This server has a timeout setting in its client connect profile of 3600 seconds which would qualify as a reason. 
&nbsp;I tried bypassing that by adding a second forwarding server which only contains the one database host needed and added a longer timeout in a separate Client Protocol Profile, as well as disabling resetting timed out TCP session without, success. &nbsp;
&nbsp;I am wondering wheter I am completely thinking wrong here or that wildcard virtual server matches before the dedicated Forwarding Server matches and I don´t see a way at the moment to reverse that, if there is one at all. 
&nbsp;Currently I am thinking that adding an iRule to the wildcard server that exchanges the client protocol profil when an IP from a certain datagroup matches might be the most feasable solution. 
&nbsp;Or would there be an easier way around such a problem? &nbsp;
&nbsp;The reason why I am asking is that we do not have a real test environment and everything is run over that one productive cluster (I know it should be different, but I am already sore from argumenting with the other admins), and fidgeting around with the standard route doesn´t leave me feeling comfortable.
&nbsp;So if there is an easier solution to this problem it would be greatly appreciated.&nbsp;
&nbsp;Regards
&nbsp;Andre&nbsp;

hooleylist · Answer

Hi Andre, 
&nbsp;  
&nbsp; The more specific virtual server should be the one matched: 
&nbsp;  
&nbsp; SOL9038 - The order of precedence for local traffic object listeners 
&nbsp; https://support.f5.com/kb/en-us/solutions/public/9000/000/sol9038.html 
&nbsp;  
&nbsp; Can you check the connection table to see which VS is being matched and what idle timeout is set for the connection?  The syntax has changed slightly over the versions, but you should be able to check using 'tmsh show sys conn...'  See the man page for details on filtering the connection table entries in your version (tmsh help sys conn). 
&nbsp;  
&nbsp; Aaron

andre_12127 · Answer

Thanks for the help Aaron, its greatly appreciated.  
&nbsp;  
&nbsp; So basicly the idea of creating a forwarding VS that only relates to that one outbound host with a different client connect profile should do the trick.  
&nbsp; Ok then I will have to investigate further. I will have to talk to the database guys to see wheter they can produce the query again today. They said they would need some time, but hopefully we can arrange that sooner, since I want that off the table. Once we have that query running again, I will check like suggested and post results. 
&nbsp;  
&nbsp; Andre

nitass · Answer

if you are running 10.2.3 or later, logging reset cause might be helpful. 
&nbsp;  
&nbsp; sol13223: Configuring the BIG-IP system to log TCP RST packets 
&nbsp; http://support.f5.com/kb/en-us/solutions/public/13000/200/sol13223.html

andre_12127 · Answer

Unfortunately we are on 10.2.0 still, and at the moment it looks like an update might take some time.  
&nbsp;  
&nbsp; Have talked with the database admins, and they want to give it another go somewhere down next week. So sit tight, I will be back ;)

nitass · Answer

is this relevant? 
&nbsp;  
&nbsp; sol8049: Implementing TCP Keep-Alives for server-client communication using TCP profiles 
&nbsp; https://devcentral.f5.com/Community/GroupDetails/tabid/1082223/asg/52/aft/2163480/showtab/groupforums/Default.aspx

Forum Discussion

Dealing with long lasting outbound TCP Connections

6 Replies

Recent Discussions

F5 Rseries HA

Error when running bigip_command Playbook against LTM : Syntax Error: unexpected argument /bin/sh\n

Can iRule be used to perform exception of IPI category based on Geolocation

Can iRule mask the payload content on event logs of security

minimum tmos software version for connect CIS (openshift)

Related Content

Dealing with DDoS threats by KillNet, Anonymous Sudan and REvil

What is SMTP Smuggling and how can you deal with it?

When using F5 Distributed Cloud Platform, never deal with Site to Site IP conflicts again!

SSL Orchestrator Advanced Use Cases: Outbound SNAT Persistence

Securely connecting Kubernetes Microservices with F5 Distributed Cloud