Googlebot blocked when running split tests

jeppeliisberg · January 21, 2020, 9:01am

it seems googlebot is blocked by robots.txt when I have split tests enabled. This happens when googlebot is routed to a branch deploy, where the robots.txt disallows everything. It makes sense to disallow indexing of branch deploys, but Googlebot should never be routed to anything but the master branch. split tests are really useful, but this behaviour is a major flaw!

fool · January 23, 2020, 10:55pm

We never use robots.txt, so this sounds like something your code is creating. We DO intend to disallow robots on deploy previews - that is, PR builds - but we use X-robots-tag: noindex for that. Could you let me know a branch deploy that shows this behavior so I can confirm my assertion?

jeppeliisberg · January 27, 2020, 10:29am

Hi Chris, thanks for looking into this - rather embarrassing - It was my own robots configuration that did this block - BUT I kind of expected that googlebot would always be routed to the master branch, which obviously is not the case, and I think that adds a bit of confusion to the consequences of running split tests with branch deploys.

Is this intended behaviour for branch deploys/split tests? I’d like to request that bots are always routed to the master branch deploy when running split tests.

fool · January 27, 2020, 2:07pm

If you see a way to improve the documentation around split tests to make this more obvious, please let us know and I’ll make sure the documentation team seems your suggestions.

That is how the feature is designed - we are literally serving different branches - so it is working the way we built it. We will not be changing that feature, since, well, people are intended to be able to set anything they want in each branch otherwise the feature is not very flexible

roburidge · October 13, 2023, 2:42am

Picking up on this again as this thread didn’t answer the question and I’m looking into hosting solutions for our marketing site. We would like to run a/b tests but it’s critical that the Google bot is always served the control version of a test. I can’t see any information in the docs about how bots are treated when traffic is split between variations in an a/b test.

Does Netlify already forward bots to the control or is there a solution we can apply that ensures a specific variation is always served to bots indexing our site?
@jeppeliisberg did you find a solution to this?

jeppeliisberg · October 13, 2023, 6:22am

Nope, I lived with it as is.

hrishikesh · October 15, 2023, 2:46pm

The recommended way to run Split Testing on Netlify now is using Edge Functions: Set up an AB Test | Edge Functions on Netlify (edge-functions-examples.netlify.app), where you can have more control over what you wish to do.

jeppeliisberg · October 23, 2023, 7:01am

@hrishikesh this comment seems contradictory to your docs at Split Testing | Netlify Docs - can you elaborate on how to use those bucket cookies in the example to set up a split test?

hrishikesh · October 23, 2023, 5:50pm

I’m not sure how that’s contradictory. The docs mention it’s a Beta feature, and I’m pointing you to a stable feature.

The example code has already been shared above. You can combine it with a rewrite: Edge Functions API | Netlify Docs

Something like:

if (parseFloat(bucketName) > 0.5) {
  return new URL(new URL(request.url).path, 'https://deploy-id--site-id.netlify.app')
} else {
  // same for the other deploy
}

Topic		Replies	Views
What are the SPECIFIC limitations for split testing with proxies Support proxying , split-testing	5	696	March 15, 2024
Split testing Under the Hood? Support split-testing	17	4091	July 30, 2023
Split testing with branches containing backslashes Support deployment , branch-deploys , split-testing	4	1285	February 20, 2020
Perform URL targeted split testing through new redirect options Features redirects , split-testing	1	889	June 29, 2021
Prevent google indexing dev/staging subdomain Support seo	12	4744	February 14, 2021

Googlebot blocked when running split tests

Related topics