• Resolved Timmmy

    (@timherinckx)


    Hi all,
    I’ve noticed in my server logs that the cache page preload fetches wrong sitemaps from the sitemap index.

    Sitemap.xml is fetched correctly, but the contained sitemaps are fected as /%3C!%5BCDATA%5Bxxxx%5D%5D%3E with xxxx the intended url but with a single slash after https. Seems like an issue with relative/absolute paths.

    Any ideas?

    Thanks in advance!

    The page I need help with: [log in to see the link]

Viewing 5 replies - 1 through 5 (of 5 total)
  • Plugin Contributor Marko Vasiljevic

    (@vmarko)

    Hello @timherinckx

    Thank you for reaching out and I am happy to help!

    I’ve checked the sitemap and the website, I cannot seem to visit the homepage of the website, as it’s always redirecting me to the sitemap URLhttps://uniekegeboortekaartjes.be/product_cat-sitemap.xm

    Can you please let me know if you are experiencing the same thing and does the issue persists if you disable the W3TC temporarily?

    Thanks!

    Thread Starter Timmmy

    (@timherinckx)

    Hi @vmarko,

    Thanks for the quick reply!
    Yeah that was a mistake, I was experimenting with redirection as a workaround but one of the wrong attempts got cached… Should have been fixed meanwhile…

    Issue still persist. If I disable cache preload in W3TC, the entries do not show up anymore in the logs. If I change the time between checks, it clearly reflects in the logs. So quite sure it’s the W3TC page cache pre-load functionality that is causing these requests…

    Kind regards,
    Timmmy

    Plugin Contributor Marko Vasiljevic

    (@vmarko)

    Hello @timherinckx

    Can you please share the screenshot of this because I am not sure how and where this is occurring.

    Thanks!

    I’m having the same issue, here’s the information with the IP and URL hidden. The plugin is constantly scanning the first page of the sitemap and he won’t open into it.
    Also, you can see that the plugin visits htts:/WWW…. Visit again https://WWW . . .

    121.XXX.XXX.113 - - [03/Apr/2024:15:43:33 +0800] "GET /%3C!%5BCDATA%5Bhttps:/www.XXXX.com/product-sitemap.xml%5D%5D%3E HTTP/1.1" 404 28323 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:34 +0800] "GET /%3C!%5BCDATA%5Bhttps://www.XXXX.com/category-sitemap.xml%5D%5D%3E HTTP/1.1" 301 5 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:36 +0800] "GET /%3C!%5BCDATA%5Bhttps:/www.XXXX.com/category-sitemap.xml%5D%5D%3E HTTP/1.1" 404 28323 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:37 +0800] "GET /%3C!%5BCDATA%5Bhttps://www.XXXX.com/post_tag-sitemap.xml%5D%5D%3E HTTP/1.1" 301 5 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:39 +0800] "GET /%3C!%5BCDATA%5Bhttps:/www.XXXX.com/post_tag-sitemap.xml%5D%5D%3E HTTP/1.1" 404 28332 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:39 +0800] "GET /%3C!%5BCDATA%5Bhttps://www.XXXX.com/product_cat-sitemap.xml%5D%5D%3E HTTP/1.1" 301 5 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:41 +0800] "GET /%3C!%5BCDATA%5Bhttps:/www.XXXX.com/product_cat-sitemap.xml%5D%5D%3E HTTP/1.1" 404 28333 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:42 +0800] "GET /%3C!%5BCDATA%5Bhttps://www.XXXX.com/product_tag-sitemap.xml%5D%5D%3E HTTP/1.1" 301 5 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:44 +0800] "GET /%3C!%5BCDATA%5Bhttps:/www.XXXX.com/product_tag-sitemap.xml%5D%5D%3E HTTP/1.1" 404 28344 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:45 +0800] "GET /%3C!%5BCDATA%5Bhttps://www.XXXX.com/yith_product_brand-sitemap.xml%5D%5D%3E HTTP/1.1" 301 5 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:47 +0800] "GET /%3C!%5BCDATA%5Bhttps:/www.XXXX.com/yith_product_brand-sitemap.xml%5D%5D%3E HTTP/1.1" 404 28335 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:47 +0800] "GET /%3C!%5BCDATA%5Bhttps://www.XXXX.com/discontinued-sitemap.xml%5D%5D%3E HTTP/1.1" 301 5 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:49 +0800] "GET /%3C!%5BCDATA%5Bhttps:/www.XXXX.com/discontinued-sitemap.xml%5D%5D%3E HTTP/1.1" 404 28336 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:50 +0800] "GET /%3C!%5BCDATA%5Bhttps://www.XXXX.com/application-industry-sitemap.xml%5D%5D%3E HTTP/1.1" 301 5 "-" "W3 Total Cache"
    121.XXX.XXX.113 - - [03/Apr/2024:15:43:52 +0800] "GET /%3C!%5BCDATA%5Bhttps:/www.XXXX.com/application-industry-sitemap.xml%5D%5D%3E HTTP/1.1" 404 28334 "-" "W3 Total Cache"
    • This reply was modified 2 years, 2 months ago by xu wu.
    • This reply was modified 2 years, 2 months ago by xu wu.

    Looking at my xml file, now the xml map format generated by the AIOSEO plugin is as follows, it looks like your plugin reads the CDATA tag into the link and executes it, can it be fixed?

    <sitemap>
    <loc><![CDATA[https://www.XXXX.com/post-sitemap.xml]]></loc>
    <lastmod><![CDATA[2024-03-20T10:43:44+00:00]]></lastmod>
    </sitemap>
    • This reply was modified 2 years, 2 months ago by xu wu.
Viewing 5 replies - 1 through 5 (of 5 total)

The topic ‘Page cache pre-load – wrong urls fetched’ is closed to new replies.