• Resolved Hadar Cohen

    (@hadarhacohen)


    I am maintaining the following site for my customer, and have some questions about Google Search Console, “Not Indexed”. Currently I have 554 no indexed URLS, most before I started the maintenance activity.
    1. Google is detecting my login’s to WordPress as an admin, and list these longin’s in the “Excluded by ‘noindex’ tag” search results, for example:

    https://www.moshe-kaminim.co.il/wp-login.php?redirect_to=https://www.moshe-kaminim.co.il/?attachment_id%3D8274

    Why is that and how can I avoid it?

    2. Is there an easy way to purge all the 404 results from the results list, beside asking removal one by one?

    3. It looks like google is detecting some of the media images files, and mark them as “discovered not crawled”, for example:

    https://www.moshe-kaminim.co.il/category/chimneys_info/

    Is there a way to avoid scanning the media?
    And/Or – is there away to make the media as a valid result and not “discovered not crawled”?

    Many ThankS

    The page I need help with: [log in to see the link]

Viewing 2 replies - 1 through 2 (of 2 total)
  • @hadarhacohen Thanks for your message. We’ve identified several concerns and responded to each below:

    1. Google is detecting my login’s to WordPress as an admin, and list these longin’s in the “Excluded by ‘noindex’ tag” search results, for example:

    https://www.moshe-kaminim.co.il/wp-login.php?redirect_to=https://www.moshe-kaminim.co.il/?attachment_id%3D8274

    Why is that and how can I avoid it?

    Whereas the login page contains a ‘noindex’ tag, this page being found and crawled is being caused because Google is finding a link to the login page somewhere on your site. Googlebot by nature attempts to crawl a page that it finds a link to.

    Typically, the link causing it is one being added by your theme. Sometimes in the form of something like a widget that has a login link contained within. By no means is this the only thing that might cause a login link, but just the most common way we’ve seen them be accidentally added.

    2. Is there an easy way to purge all the 404 results from the results list, beside asking removal one by one?

    No, not really. Google does have other options for how to handle them which are described here.

    It’s really a good idea to examine each of the 404’s though, if it’s feasible. For example, an example of why might be if the 404 is coming from another site, but that the traffic itself is valuable and it should be addressed with a redirect to another specific page.

    3. It looks like google is detecting some of the media images files, and mark them as “discovered not crawled”, for example:

    https://www.moshe-kaminim.co.il/category/chimneys_info/

    Is there a way to avoid scanning the media?
    And/Or – is there away to make the media as a valid result and not “discovered not crawled”?

    The example URL you provided appears to be a media attachment page. In most cases, the setting for Yoast SEO>Search Appearance>Media for ‘Redirect attachment URLs to the attachment itself?’ should be set to ‘Yes’, which addresses these types of URLs by redirecting the page to the image.

    By default, WordPress will create attachment pages for media that you upload, which is essentially a page with only the content of the image. This is viewed by many search engines as being thin content.

    ​Unless you have a specific reason to set it otherwise, we would recommend ensuring that the attachment URL redirect setting is set to redirect to the attachment. To set attachment URLs to redirect to the attachment itself you would need to perform the following actions:

      • In WordPress go to Yoast SEO>Search Appearance>Media
      • Set ‘Redirect attachment URLs to the attachment itself?’ to Yes
      • Click ‘Save Changes’

    ​We also have more information on this in the following article: https://yoast.com/media-attachment-urls/

    Thread Starter Hadar Cohen

    (@hadarhacohen)

    Devnihil – Thanks for the reply,

    Regarding the 404 – I am trying to clean the list by inspecting each URL and submitting it to google for removal.
    For some cases, Google is saying the removal was approved. However, I still see the specific URL in the Google’s “Not Indexed” list.
    Should I expect Google to remove it from the list completely? When will this happened / what should I do to see the list actually become shorter and on one day disappear?

Viewing 2 replies - 1 through 2 (of 2 total)

The topic ‘Google Search Console “Not indexed” results’ is closed to new replies.