I suspect the normal crawl errors work fine for most issues but the news specific issues are gone. As I covered at Search Engine Land weeks ago, those specific errors included these errors Article disproportionately short, Article fragmented, Article too long, Article too short, Date not found, Date too old, Empty article, Extraction failed, No sentences found, Off-site redirect, Page too large, Title not allowed, Title not found, Uncompression failed, and Unsupported content type. This was also removed from the Google help document. … [Read more...] about Google Search Console Crawl Errors For News Publishers Gone
Ability to download all crawl error sources. Previously, you could download a CSV file that listed URLs that returned an error along with the pages that linked to those URLs. You could then sort that CSV by linking source to find broken links within your site and had an easy list of sites to contact to fix links to important pages of your site. Now, the only way to access this information is to click on an individual URL to view its details, then click the Linked From tab. There seems to be no way to download this data, even at the individual URL level. (Update 3/17/12: This detail is still available from the API-based crawl errors feed.) 100K URLs of each type. Previously, you could download up to 100,000 URLs with each type of error. Now, both the display and download are limited to 1,000. Google says “less is more” and “there was no realistic way to view all 100,000 errors—no way to sort, search, or mark your progress.” Google is wrong. There were … [Read more...] about Google Webmaster Tools Revamps Crawl Errors, But Is It For The Better?
We've been tracking the big AMP crawl error and AMP issues in search over the past week. Yesterday Google confirmed it was a bug on their end and said they would fix it. Today they documented the issue on their data anomalies page and RankRanger is showing AMP content coming back in the search results this morning. … [Read more...] about Google AMP Crawl Error Bug Fix Is Here
When the sitemaps.org alliance was first announced, not everyone was sure that Microsoft was truly on board to build a relationship between their search engine and site owners. However, since then, they’ve begun processing Sitemaps, launched a webmaster portal, and now added several substantial features. In contrast, Yahoo!’s interest in this relationship seems to have waned as they’ve turned their attention to (arguably equally important) support for developers with SearchMonkey and BOSS. (They haven’t launched new material features for Site Explorer in a year). Google still leads the way, with a fairly robust toolset and regular blogging, responsiveness in their discussion forums, and conference appearances, but Microsoft’s latest launch shows that they’re interested in strengthening their relationship with site owners as well. … [Read more...] about Live Search Webmaster Center Gains Crawl Error And Backlinks Reports
Google's John Mueller said on Twitter that serving too many 5xx errors, the internal server types of errors, can cause Google to slow the crawling of your web site. If you are seeing an increase in crawl errors, that is not a huge concern, but if you are serving up tons of 5xx errors that means something is significantly wrong with your web site and Google doesn't want to pound it with crawls. … [Read more...] about Google: Many 5xx Errors Can Slow Crawling