Automated visual inspection
Visual search engines will be at the center of the next phase of evolution for the search industry, with Pinterest, Google, and Bing all announcing major developments recently. How do they stack up today, and who looks best placed to offer the best visual search experience? Historically, the input-output relationship in search has been dominated by text. Even as the outputs have become more varied (video and image results, for example), the inputs have been text-based. This has restricted and shaped the potential of search engines, as they try to extract more contextual meaning from a relatively static data set of keywords. Visual search engines are redefining the limits of our language, opening up a new avenue of communication between people and computers. If we view language as a fluid system of signs and symbols, rather than fixed set of spoken or written words, we arrive at a much more compelling and profound picture of the future of search. Our culture is visual, a fact … [Read more...] about Pinterest, Google or Bing: Who has the best visual search engine?
But, more problematically, it also means it’s becoming impossible to diagnose why a website has dropped in rankings. Google ultimately does not want us to understand how its ranking algorithm works, because there will always be people who manipulate it. We now suspect that core signals are rolled out so slowly that SEOs won’t even realise when Penguin or Panda has refreshed. For this reason, it makes it crucial that we understand how well our website is performing at all times. This blog post is intended to show you how to do a comprehensive content audit at scale, in order to find any gaps which may lead to rankings penalties. Essentially, there are five types of content gaps a website may suffer from. I’ll explain each one, and show you how you can find every instance of it occurring on your website FAST. 1. Internally duplicated content Internally duplicated content is the daddy of content gaps. Duplicating optimised content across multiple pages will cause … [Read more...] about Does your website suffer from one of these five content gaps?
A patent granted to Google today explores Web spam and the manipulation of documents and links on the Web. It describes how the rankings of pages may be influenced if they are identified as "manipulative." The identification of manipulative documents, how they might be grouped together, and how they could be treated by the search engine is described in some detail. That treatment might include removal of pages from the search index, reductions in rankings for pages, and possibly a change in how quality scores (PageRank) are calculated for links from manipulative pages. The patent was filed almost 4 years ago, on December 10, 2003, and wasn't granted until today. A good number of papers and patent applications have been published since then on Web spam, and have explored more detailed approaches, but this patent is interesting in that captures some aspects of how Google may have been detecting and fighting Web spam over the past few years (and may still be). Here's the patent: … [Read more...] about Google Patent on Web Spam, Doorway Pages, and Manipulative Articles
Yesterday I noticed a very large number of new patents listed in the USPTO assignment records for Google from IBM, and made note of them in a post, Google Acquires Over 1,000 IBM Patents in July. I didn’t expect or anticipate the interest that my post would stir up, though I probably should have, given what seems to be an increased amount of litigation directed at Google involving patent infringement claims, with Apple taking on HTC and Google, Oracle and Google disputing use of Java in Android, Purple Leaf taking exception to Checkout, and other suits. Given the interest in the IBM patents in a number of places on the web and some conversations I had, I thought it might be a good idea to provide the list of patents that Google acquired earlier this month. Google acquired a number of additional patents from IBM earlier this year and last year as well. I included those in my February post, Google Patents, Updated and Google Self Driving Cars Get Jumpstart from IBM Patents. In … [Read more...] about Google’s New Patents from IBM