Web scraping tunning issue

Question

Hi, 
I have questions about how to apply Web scraping and read the great article "https://devcentral.f5.com/articles/more-web-scraping-bot-detection" written by John Wagnon. I tried to implement this feature in my policy ASM where my homepage have 125 requests in the first access to the site. &nbsp;
I understood that the value of "Grace Interval" should be at least greater than my number of initial requests but ASM will test whether it is a robot (here's my problem) and punish future access configured in "Unsafe Interval". If you do not detect a robot will allow navigation without checking the next N requests that are configured in "Safe Interval" and returns the validation flow.&nbsp;
My problem is that ASM performs a POST to test interactivity and makes me lose navigational information about google analytics. &nbsp;
Well, my questions are:
Counters consider request source IP or trusted XFF? 
This counter is for each client connection or globally? 
What is the ideal values for a site with the characteristics value of my homepage?
What is the ideal number for the "Safe Interval" value? I think "2000" is just too much for my case, am I wrong?
Can anyone help me?&nbsp;
Thank you very much!&nbsp;

chris_grant · Answer

It really varies by use.  I don't think anyone can give you optimal settings.  You might want to talk to your FSE or possibly your account manager and see if you can get some guidance on optimizing ASM for your environment.  It is possible this may involve a PS engagement, but if you're just looking to have bot detection optimized the costs should be minimal.  They will need to look at the traffic flow for your website to answer these questions.  &nbsp;
Alternately you can experiment with different values and see which ones get you the result you need.&nbsp;

Forum Discussion

Web scraping tunning issue

1 Reply

Recent Discussions

F5Access | MacOS Sonoma

enable tls1.2 on management interface on F5 ltm running version 10.x

[ASM] - what is "Browser Challenge file" ?

SMS server with BIGIP

F5 terminal - help to run commands - disk space full

Related Content

Web Scraping Feature

Regex issue

More Web Scraping - Bot Detection

F5 GTM resolution issue

Irule Example Issue