Help caching urls with unique date/time stamp

Question

Hello irule gurus! 
&nbsp;  
&nbsp; We have an issue on our production service where we need to cache our rss feeds to offload our backend servers.  The problem is that each rss url has a unique date/time stamp attached to it....therefore it never will "hit" in the cache since no two instances are ever the same.  However, we are willing to give up some security for now, to go ahead and "strip" the url so that the date/time stamp are not put into the cache, but rather just the part of the url that stays consistent. 
&nbsp;  
&nbsp; The complete url looks something like this: 
&nbsp;  
&nbsp; /rss/Pepcom/Pepcom+June+2008?oauth_consumer_key=ced9fcdbcae5bd941f51cf82421e6413&amp;oauth_nonce=6191575&amp;oauth_signature_method=HMAC-SHA1&amp;oauth_timestamp=1236751715  
&nbsp;  
&nbsp; However, we only need this to be able to properly serve the url: 
&nbsp;  
&nbsp; /rss/Pepcom/Pepcom+June+2008?oauth_consumer_key=ced9fcdbcae5bd941f51cf82421e6413 
&nbsp;  
&nbsp; We can drop everything from the first "&amp;" on, and still serve the rss feed properly. 
&nbsp;  
&nbsp; We would also like to keep each feed active in the cache for 10 minutes, before expiring. 
&nbsp;  
&nbsp; Can anyone help? 
&nbsp;  
&nbsp; Thanks, 
&nbsp;  
&nbsp; Ross 
&nbsp;  
&nbsp;

ross_79174 · Answer

Me again, 
&nbsp;  
&nbsp; Here is what I have started with, but from looking at the cache hits, I know the trimright is not working correctly. 
&nbsp;  
&nbsp; when RULE_INIT {  
&nbsp; set ::fifteen_minutes 900   
&nbsp; }  
&nbsp; when HTTP_REQUEST { 
&nbsp;   if { [HTTP::uri] starts_with "/rss" } { 
&nbsp; persist none 
&nbsp; CACHE::enable 
&nbsp;     CACHE::uri [string trimright [HTTP::uri] &amp;] 
&nbsp; set cachetime $::fifteen_minutes 
&nbsp; pool rss 
&nbsp;     } else { 
&nbsp;   pool gallery   
&nbsp; log local0. "matches rss"   
&nbsp; } 
&nbsp; } 
&nbsp;  
&nbsp; Thanks.

hooleylist · Answer

Hi Ross, 
&nbsp;  
&nbsp; TMM will crash if you force caching with CACHE::enable and the request does not contain a Host header (not required in HTTP v1.0) or a URI (required in all HTTP versions).  This is described in SOL9617 (Click here).  So it would be good to add a check for the Host header value and path having a length: 
&nbsp;  
&nbsp; if { [HTTP::uri] starts_with "/rss" &amp;&amp; [string length [HTTP::host]] &amp;&amp; [string length [HTTP::path]] } { 
&nbsp;  
&nbsp; To parse the portion of the URI you mention in the first post, you can use HTTP::path to get the URI minus the query string and then just the parameter value for oauth_consumer_key 
&nbsp;  
&nbsp; So to get this: 
&nbsp;  
&nbsp; /rss/Pepcom/Pepcom+June+2008?oauth_consumer_key=ced9fcdbcae5bd941f51cf82421e6413 
&nbsp;  
&nbsp; You can use this: 
&nbsp;  
&nbsp; "[HTTP::path]?oauth_consumer_key=[URI::query [HTTP::uri] oauth_consumer_key]" 
&nbsp;  
&nbsp; Lastly, 'persist none' will disable persistence for the duration of the TCP connection.  If there are multiple HTTP requests over the same TCP connection, with an RSS request followed by a non-RSS request, the non-RSS request wouldn't be persisted within the gallery pool.  You would want to explicitly set persistence for both cases (persist none and persist . 
&nbsp;  
&nbsp; Aaron

ross_79174 · Answer

Hi Aaron, 
&nbsp;  
&nbsp; Thanks for your prompt response.  The string manipulations all worked and we are now happily caching RSS feeds.  
&nbsp;  
&nbsp; I appreciate your help! 
&nbsp;  
&nbsp; -Ross

Forum Discussion

Help caching urls with unique date/time stamp

3 Replies

Recent Discussions

When F5OS r2800 appliance reboots, interfaces configured at tenant level for VLAN are lost

cat and grep command for rst messages

iControl for Gtm wideip

Telemetry streaming to Elasticsearch

Portal Access to HTTPS resources slow

Related Content

Adding date and time to ASM response pages

Distributed caching of authentication requests with NGINX Ingress Controller

Unique Identifier for irules Http response

DNS Caching

3. SYN Cookie: SYN Cache