Skip to content

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

  • Unsolved

    crawl error

    Recently we start having random error messages about crawling issue:
    2024-08-30 edweek:Ok
    2024-08-29 marketbrief:Err. advertise: Err, edweek:Err, topschooljobs:Ok
    2024-08-23 edweek:Ok
    2024-08-22 marketbrief:Err. advertise: Err, edweek:Err
    2024-08-21 topschooljobs:Ok, edweek:Ok
    2024-08-15 marketbrief:Ok. advertise:OK
    2024-08-13 edweek:Ok
    2024-08-12 marketbrief:Ok
    2024-08-08 marketbrief:Ok, advertise:Ok
    2024-08-03 edweek:Ok, topschooljobs:Ok
    All for 2024-07 - are Ok Yesterday I set 2 more crawls for the same sites (edweek and marketbrief) and I get a morning email about original edweek site is ok (still have some problem but crawl occurs and all is fine) but for test crawl for the same site "EW Test" I just got error email.
    Also I suppressed ALL email communications and frankly surprised by this email. Can you please check what is wrong with a crawler or stat collection or I don't know who produced the issues.

    Product Support | | DTashjian
    0
  • Unsolved

    robots.txt crawl error

    I'm trying to setup a campaign for jessicamoraninteriors.com and I keep getting messages that Moz can't crawl the site because it can't access the robots.txt. Not sure why, other crawlers don't seem to have a problem and I can access the robots.txt file from my browser. For some additional info, it's a SquareSpace site and my DNS is handled through Cloudflare. Here's the contents of my robots.txt file: # Squarespace Robots Txt User-agent: GPTBot User-agent: ChatGPT-User User-agent: CCBot User-agent: anthropic-ai User-agent: Google-Extended User-agent: FacebookBot User-agent: Claude-Web User-agent: cohere-ai User-agent: PerplexityBot User-agent: Applebot-Extended User-agent: AdsBot-Google User-agent: AdsBot-Google-Mobile User-agent: AdsBot-Google-Mobile-Apps User-agent: * Disallow: /config Disallow: /search Disallow: /account$ Disallow: /account/ Disallow: /commerce/digital-download/ Disallow: /api/ Allow: /api/ui-extensions/ Disallow: /static/ Disallow:/*?author=* Disallow:/*&author=* Disallow:/*?tag=* Disallow:/*&tag=* Disallow:/*?month=* Disallow:/*&month=* Disallow:/*?view=* Disallow:/*&view=* Disallow:/*?format=json Disallow:/*&format=json Disallow:/*?format=page-context Disallow:/*&format=page-context Disallow:/*?format=main-content Disallow:/*&format=main-content Disallow:/*?format=json-pretty Disallow:/*&format=json-pretty Disallow:/*?format=ical Disallow:/*&format=ical Disallow:/*?reversePaginate=* Disallow:/*&reversePaginate=* Any ideas?

    Getting Started | | andrewrench
    0
  • Unsolved

    crawl error crawling crawl

    Hello,
    I don't understand why MOZ crawl only the homepage of our webiste https://www.modelos-de-curriculum.com We add the website correctly, and we asked for crawling all the pages. But the tool find only the homepage. Why? We are testing the tool before to suscribe. But we need to be sure that the tool is working for our website. If you can please help us.

    Product Support | | Azurius
    0
  • Unsolved

    crawl error

    Hi Moz crawler keep failing on my site with the error showing : Our crawler was banned by a page on your site, either through your robots.txt, the X-Robots-Tag HTTP header, or the meta robots tag. I'm not sure what am I missing out.. this is my robots.txt.. i don't think Im missing anything else.. https://www.wearefutureheads.com/robots.txt can the support team help ?

    Moz Pro | | teikh
    0
  • Unsolved

    crawl error robots.txt

    Hi all, im facing an issue where moz crawler is unable to crawl my site. The following error keeps showing Our crawler was banned by a page on your site, either through your robots.txt, the X-Robots-Tag HTTP header, or the meta robots tag. This is my robots.txt file : https://www.wearefutureheads.com/robots.txt I'm not sure what else am I missing.. can anyone help

    Product Support | | teikh
    0
  • Unsolved

    crawl error url issue

    I recently recieved a slew of content crawl issues via Moz for URL's that I have never seen before For example:
    Standard URL: https://skilldirector.com/news,
    Newly identified URL: https://skilldirector.com/news?offset=1469542207800&category=Competency+Management). Does anyone know where the URL comes from and how to fix it?

    Moz Pro | | HannahPalamara
    0
  • Unsolved

    roger-bot crawl error

    Hi, We're trying to get MOZ to crawl our site, but when we Create Your Campaign we get the error:
    Ooops. Our crawlers are unable to access that URL - please check to make sure it is correct. If the issue persists, check out this article for further help. robot.txt is fine and we actually see cloudflare is blocking it with block fight mode. We've added in some rules to allow rogerbot but these seem to be getting ignored. If we use a robot.txt test tool (https://technicalseo.com/tools/robots-txt/) with rogerbot as the user agent this get through fine and we can see our rule has allowed it. When viewing the cloudflare activity log (attached) it seems the Create Your Campaign is trying to crawl the site with the user agent as simply set as rogerbot 1.2 but the robot.txt testing tool uses the full user agent string rogerbot/1.0 (http://moz.com/help/pro/what-is-rogerbot-, rogerbot-crawler+shiny@moz.com) albeit it's version 1.0. So seems as if cloudflare doesn't like the simple user agent. So is it correct the when MOZ is trying to crawl the site it uses the simple string of just rogerbot 1.2 now ? Thanks
    Ben Cloudflare activity log, showing differences in user agent strings
    2022-07-01_13-05-59.png

    Moz Pro | | BB_NPG
    0

  • 403 errors crawl error

    My wordpress website has 162 crawl 403 errors. Based on what I read it means that the server is denying crawlers to access the pages. The pages itself will load so guessing it's just an issue with crawlers only. How do I go about fixing this issue?

    On-Page Optimization | | emrekeserr3
    0
  • Unsolved

    crawl error crawl errors

    Moz is being blocked from crawling the following site - https://www.cleanchain.com. When looking at Robot.txt, the following is disallowing access but don't know whether this is preventing Moz from crawling too? User-agent: *
    Disallow: /adeci/
    Disallow: /core/
    Disallow: /connectors/
    Disallow: /assets/components/ Could something else be preventing the crawl?

    Moz Pro | | danhart2020
    0
  • Unsolved

    crawl error crawl in progress crawl stalled

    The latest crawl on my site was the 4th Jan with a current crawl 'in progress'. How do i cancel this crawl and start a new one? I've been getting keyword ranking etc but no new issues are coming through. Screenshot 2022-05-31 083642.jpg

    Moz Tools | | ClaireU
    0
  • Unsolved

    25 404 error crawl error

    Hi Community, has anyone else had a 404 error reported by Moz, where the end of the domain is /%25s? The error comes from my blog home page https://kaydee.net/blog/ But when I look at the source code, I can't see anything that has a space at the end of the URL. I wonder if it is to do with the WordPress search? Thanks in advance for any insight.

    Moz Pro | | kaydeeweb
    0
  • Unsolved

    performance metrics crawl error

    I am getting an error:
    Crawl Error for mobile & desktop page crawl - The page returned a 4xx; Lighthouse could not analyze this page.
    I have Lighthouse whitelisted, is there any other site I need to whitelist? Anything else I need to do in Cloudflare or Datadome to allow this tool to work?

    Product Support | | bhsiao 0
    1

Looks like your connection to Moz was lost, please wait while we try to reconnect.