Does Having a Large Google Index Equate to the Certain Success of Your Website or Blog?

We are all looking to find favor with the Google Gods, right?  The article below will teach you how find your Google Index and will outline the metrics that are truly important in evaluating your success on the web. 

What?
Whoa, let’s back up just a step - what is Google’s index anyway?  Google’s index is all of the unique pages on all of the websites and blogs out there that Google knows about and has explored (with Googlebot or its "web spiders").  Would you believe that there are 1 trillion unique URLs out there?  And, that’s only the URLs that Google has found so far. 

The Right Questions
So why do we care about Google’s index?  Good question.  Finding out the number of pages that Google knows about and has spidered on your own sites will you how many pages Google has indexed for you.  Unfortunately that is all it tells you.  While this is good to know - the number of pages you have indexed is no magic number by any means.  Does having more pages indexed equate to Google recommending those sites to searchers more often?  While perhaps this seems logical the answer in my opinion is likely not.  A case could certainly be made for the opposite viewpoint, but to my knowledge there is no evidential proof for this position.  It is important to note that not being found in the index at all is not good.  This means Google does not know your site exists and therefore cannot recommend you to searchers.  If you have a concern about this, please visit Google’s Help Center on the subject.   

Let’s Be Logical
Think about the following scenario… Website A contains only 10 pages in total.  Website B on the other hand contains 1,000 pages in total.  It follows that website B will have many more pages in Google’s index simply because more pages exist right?  Right.  Now let’s talk about what those two websites contain.  Website A contains 10 comprehensive pages that are updated regularly about protected sea turtles in the Sea of Cortez in Mexico.  Website B contains one page about the same subject that has not changed in over a year and 999 pages about real estate, travel, vacationing, recreating, rentals, airports, govt policy and everything else you might want to know about Mexico.  Website A is a well respected educational resource and is linked to often, website B is not.  Despite website Bs sheer volume of additional pages which site do you think Google will recommend for those searching for information about endangered sea turtles in the Sea of Cortez?  The site with lots of pages and little relevant information or the site with few pages and lots of valuable and current information?

Quality Not Quantity
Finding Google’s favor has a lot less to do with how many pages are in your index (and thus existing in your website or blog) and more to do with how good those pages actually are.  Are they fresh, on topic, and chocked full of value?  Excellent - then you have a good chance of getting them recommended.  While having a large Google index might feel like something to brag about - take a step back and ask yourself if those pages are bringing you a return of value.  This is a game of quality, not necessarily quantity.  Just because you have lots of pages indexed for a particular site does not automatically mean the site is any more valuable to you than your other sites with fewer pages in the index.  I will say however that if you have lots of good quality content, more seems to be more in the game of search.  Quality first though - always! 

Tread Carefully
Be wary of sales tactics that use a scenario where a large index = smashing success.  Many systems can auto generate lots and lots of pages that offer little human value and therefore won’t add to helping you get recommended for the keywords and keyphrases that are important to your business.  The real metrics you should be concerned with are much more tangible: how many visitors does your site get, how many terms are the search engines recommending you for, what pages within your site are getting you noticed, how many leads is your site generating, how sticky is the site (how long do visitors stay), etc.  These are the metrics that will help you measure the business success of your site.  Ask yourself where the real value is - in tangible results, or in subjective statistics?

Finding Your Index
In the event any of you would like to know what your index looks like for your website or blog (it is nice to know - you may find pages on your sites that are old and need to be deleted or cleaned up, you may find that your meta data needs some work or you may find that you don’t exist - yikes!), you can find out by doing the following:

  • Go to Google
  • In the search field enter site:url.com where the url is your blog or website

There are a few specifics you should know about, for instance (also - no spaces, important!):

  • site:abc.com will include every page on abc.com including anything in any of your subdomains (such as blog.abc.com)
  • site:www.abc.com will include only those pages on your www
  • site:blog.abc.com will include only those pages on this subdomain

Here is a bit more information about advanced operators from Google for those of you wishing to learn more.

Most importantly - it’s not an exact science by any means!  Google does the best it can to give you the facts and they don’t always add up exactly.  That’s right, nobody is truly perfect 


You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.

AddThis Social Bookmark Button

7 Responses to “Does Having a Large Google Index Equate to the Certain Success of Your Website or Blog?”

  1. Bill French Says:

    While quality typically trumps quantity (no disagreement there), it’s important that we recognize the debate was never [solely] about quantity; at its root, it is about thoroughness, and at a higher level, it’s about size because without size, you can’t achieve a sustainable marketing model for leveraging the long tail. Having a thorough and complete representation of all your content in Google is far better than having only a fraction of your content in Google, regardless of whether the pages are focused on whatever axis your subject expertise happens to align with.

    “While having a large Google index might feel like something to brag about - take a step back and ask yourself if those pages are bringing you a return of value.” - Kinetic Knowledge Author

    There’s one thing that’s certain about this statement - if content cannot be found, it certainly won’t provide any value to you or your prospects. ;-) But I have to ask - if you have pages that you’ve published, and they aren’t in the search indices, why did you publish them? Furthermore, you don’t get to decide with ones get indexed and which ones get passed over - what if many of the pages that aren’t indexed are key value pages that you want prospects and future customer to find?

    Unfortunately, the writer of this article bypasses any discussion of (or requirement for) creating a sustainable marketing model that depends on the shear number of posts that [each day] capture a few long-tail hits on every post, thus creating a continuous flow of new people to your brand. If you’re blogging to reach the long tail, by definition, you must have lots of ways to reach the many markets-of-few in a way that will sustain the effort for doing it in the first place. A small index footprint is unable to achieve this - a large footprint makes it possible, but not a certainty. Suggesting that it’s okay to not achieve a thorough and sizeable index footprint dismisses the guidance (outlined here - http://blogsite.com/public/blog/198211) from an array of SEO experts that have studied this exact requirement.

    The writer of this post also fails to embrace a growing and serious implication of poor indexing performance - custom search. Google’s CSE (custom search engine) solution depends on your pages being in the Google index, otherwise, like Google [worldwide], your own search engine will always be an incomplete version of reality. When visitors use your CSE, they expect that it contains 100% of everything you’ve published. If Google (itself) is unable (or unwilling) to index 30% or 40% of your content, your search engine will provide a fraction of the benefit. Perhaps one of the pages that aren’t indexed is the difference between getting a new customer and not.

    In my view, I don’t think it’s wise to advise anyone that anything but comprehensive and thorough indexing is a key business objective. Indeed, as the author concludes, nothing is perfect, but tolerating anything less than what’s possible and reasonably feasible, is simply an excuse for not improving the SEO craft.

  2. Kristen Veraldi Says:

    Hi Bill,
    Thanks so much for taking the time to add your valuable input. These are just my opinions - I certainly support multi-faceted views about the subject. My objective here is not to overlook the value of having pages indexed by Google at all. It’s quite clear that having quality content indexed is a healthy phenomena and certainly something to strive for as a business objective.

    The only point to the lay person is to make it clear that this is not an end all be all for measuring the business success of a site. In fact, I believe it to be a very poor measure of overall success. The truly valuable measures, as suggested above are those that will help an individual get closer to measuring a return on their investment. Pages indexed is simply a starting place to give you an idea of possibilities. I’ve seen many sites with realtively small indexes competing well above sites with comparatively large indexes in terms of actual recommendations served out by Google. Generating leads and sales is the name of the game. And of course this is only one viewpoint in the vast sea of web space out there.

  3. Real Estate Blogging Success Story: Santa Clarita Realtor®

    http://realestateblogsites.com/2008/07/14/real-estate-blogging-success-story-santa-clarita-realtor%c2%ae

    Real Estate Blogging Success Story: Two months, 25 posts, 1st page in Google and new business for Phoenix AZ Realtor®

    http://realestateblogsites.com/2008/07/22/real-estate-blogging-success-phoenix-az-realtor%c2%ae

  4. Kristen:

    Thanks for allowing this opportunity to add some thought on this subject that I find so passionate. ;-)

    >>> Generating leads and sales is the name of the game. <<<

    I don’t disagree that for any business to function and create a sustainable revenue model, it needs customers, and leads (by definition) “lead” to customers. However, the purpose of a business blog solution is to compete for attention, and one requirement for attention is to create a lot of ways to grab attention. Tolerating a plan that grabs only fraction of the possible ways to grab attention can be justified on many dimensions - too costly, not enough time, lack of skills — these are a few good reasons to lower your indexing performance expectations.

    Making the decision to tolerate constrained indexing [however] should not be based on the reasoning you put forth (i.e., sites with small index footprints [sometimes] work well too). This is [at best], a tenuous justification given the fundamental reason why businesses typically decide to participate in the conversational web. They choose this path to garner attention in non-traditional ways. Suggesting that (a) it’s not necessary, or (b) it still might work - are each inappropriate business drivers.

    As you point out (and also not in debate), creating lots of attention that is directed to one or a few [un-indexed or un-findable] page(s) is possible. However, it also requires a completely different (and a highly proactive) marketing approach that is only achievable through advertising mediums that are unlike blogging and more like traditional marketing. In any case, this approach is typically far more expensive than business blogging alternatives, even expensive ones like MyST Blogsite.

    Suggesting that it’s ok to embark upon a “conversational” advertorial strategy that doesn’t get all your pages indexed, is not a good strategy at all. Indeed, why publish knowledge about anything if it cannot be found in search engines?

    Consider this example —

    If you have something important to say at a large gathering, would you want to whisper your message into the crowd, or shout the message so everyone can hear it, or broadcast it so they may all consider your viewpoint? Business blogging is specifically about this metaphor.

    Some real data …

    In a microcosm we can easily perform empirical tests to understand this subject more deeply. Ironically, you reference a common customer (Artur Ciesielski) and the blog ArizonaApartmentInvestor.com. Artur is a prolific writer with excellent and deep insight in the Phoenix real estate market. In addition to ArizonaApartmentInvestor.com, he also has another domain (RealestateConfluence.com), another Wordpress-based site by Real Estate Tomato. This relationship is far more interesting for two reasons – (i) he also has MyST Blogsite™ (PhoenixMarketTrends.com) and an integrated multi-domain search system (MyST VS™). Setting aside all of the vendor-competitive angles we could discuss here ;-) as well as the fact that some of these domains are very new, let’s look only at the data related to index footprint and attention opportunities. Google shows the following index footprints for each of the domains (at this moment):

    - ArizonaApartmentInvestor.com: 71 pages
    - RealestateConfluence.com: 234 pages
    - PhoenixMarketTrends.com: 4,470 pages

    If we then look at the search engine traffic from Artur’s MyST/VS™ system we can glean a number of patterns. Without disclosing confidential data, I’ll speak only in terms of percentages and at a very abstract level.

    ArizonaApartmentInvestor.com represents about 1.5% of the total pages in Artur’s findability arsenal (an aggregate impression in Google of about 4,775 “places” that have been indexed). Even with a collection of biased visitors using Artur’s MyST/VS™ search solution, recommendations to ArizonaApartmentInvestor.com from that search engine are rarely higher than 4% of the total traffic. In contrast, the search-generated traffic to RealestateConfluence.com, is generally three to five times higher. This is predictable since the number of indexed pages is a little more than three times higher.

    This is just one data point and there are always additional biases that must be considered because the sites are probably designed to serve different audiences. However, having many (or few) opportunities to be found (even in a microcosm such as this) clearly has an impact on traffic patterns.

    Personally, I can’t find any logical reason to spend the grueling effort to write something great with the risk that it not be indexed; search engine marketing is simply too crucial for small businesses. Justifying the risk of writing great copy which ostensibly lay dormant in a desk drawer, and occasionally discovered only when a visitor happens to open the right drawer, is not an ideal search marketing strategy.

    In my view, investing in a content strategy requires two fundamental tasks - (i) writing the content, and (ii) making prospects aware of the content. As such, I think businesses do (and should) think this way about their brand messaging and how they invest in their content.

    Cheers! –bf

  5. Bill,

    Thanks for your view.

    Does it make sense to compare two blogs for potential pages and topics indexed when the blogsite is so much older with so much more published content?

    Per Artur’s comment on the wordpress blog, “ArizonaApartmentInvestor.com has been up less then 2 months and it’s still being completed; so far the results are page one placement for many of our target terms, direct contact from readers and 3 ready and qualified buyers. I like those numbers.”

    You believe the blogsite is working well and no argument here, but the Wordpress Blog seems to have generated some business so it’s working too.

    Chris

  6. Bill French Says:

    “Does it make sense to compare two blogs for potential pages and topics indexed when the blogsite is so much older with so much more published content?”

    Not really, and that’s why I specifically refrained from comparing Artur’s long-standing blogsite to his really new (still fresh paint) Arizona Apartment Investor blog with Kinetic Knowledge. It would be like comparing apples to fermented wine. ;-)

    Instead (as my comments clearly say), I compared the really new blog with Real Estate Tomato to your service in terms of index size and the traffic each attracts based on respective page index penetration. What the data shows is that traffic generated from Artur’s MyST/VS search to his Kinetic site is about 1/3 that which his Tomato site generates. This is approximately the same ratio of index footprint distributions between these two sites; hence there’s a correlation and likely causal relationship between index penetration and traffic. If the Kinetic-produced blog were to expand to say three times the footprint of the Tomato site, I would expect the search traffic from Artur’s MyST/VS solution to mimic that shift in indexed content.

    While the anecdotal evidence that his Kinetic blog certainly positive, the debate in this thread is about justifying small index footprints, and I don’t see any evidence to support a position that weak indexing is still okay.

    bf

  7. Your debate, not ours.

    Seems what you call weak indexing is working quite nicely. We’re just happy he’s got competitive indexing for important topics and new business, despite any conclusions or debate anyone might pose.

    Thanks ;-)

Leave a Reply