Archive for December, 2009


The typical question that gives many SEO professionals a bit of a gut-check is, “How long will it take for you to show me results?” Typically, the answer will vary based on the age of the site in question and the level of competition for high volume keywords, if boiling it down to two major factors.

However, there are considerably more elements to a fully-fledged answer. These are also split into new vs. redesigned or reformatted (URL) Web site.

When dealing with a brand new domain, it will likely be difficult to gain the necessary “trust” for rankings for broad competitive terms for at least three months, but often closer to six to nine months. This isn’t always as clear cut — if you have and continue to put out useful content, and if you’re fortunate to benefit immediately from authoritative and consistent links (or are willing to cheat), you could see results sooner.

Of course, if your technical infrastructure doesn’t support good crawling of your pages and understanding of your content, you’ll be dead in the water before you even start.

When dealing with either redesigns or post-launch SEO, the best thing to do is never give, or accept, a response “from the cuff.” Setting unrealistic expectations will only lead to disappointment. Not only should people take time to project the best and worst possible scenarios, but these projections should be updated post-launch at least once, as crawling and indexing patterns get established.

How Quickly Do Search Engines Crawl/Index a Redesigned Site?

Recently, we helped relaunch a fairly large site with thousands of pages. A major directory level was gone, and it wasn’t possible to implement all redirects prior to launch.

Within 12 hours, Google had indexed and was returning well over 1,000 of the new pages in its results. We started seeing a few dozen new pages in each directory level off the root that we were monitoring, within about four hours after launch. I can’t claim as dominant of a performance by Yahoo, and look forward to our first Bing-monitoring, which has to be better than the last.

The catch to this process is that, at the same time, the search engines are ridding themselves of any no-longer used URLs, through new robots.txt files, 404 errors, and in a pattern that appears to indicate an “as we can” programming attitude. They will also continue to index those with rankings, fortunately.

If the site relies on a lot of home page rankings for top keywords, then a redesign usually won’t hurt, even if the new home page URL now resolves to the root instead of a prior directory-appended version, based on a number of sites.

How Difficult is it to Maintain Current Rankings After Launch?

For new domains, brand terms should rank nearly immediately after the first crawl — maybe not first however, as Rosetta experienced after launching and finding Apple and the language software ahead. A unique brand name should have relatively no problem ranking number one very soon.

Benchmarking is crucial to understanding how post-redesign crawling behavior is treating a Web site. Monitoring “cherry” keywords, such as the top 10 to 50-plus branded and non-branded referring terms, and measuring them by search engine, can give you insight into this.

This is also an important high-level indicator of how much the search engines initially trust your new domain structure. It happens a lot quicker when you’re shifting URLs on the same root domain, than when you’re moving to a new one.

How is Traffic Affected After Launch?

The classic answer to this question is often unfortunately accurate. There’s always going to be at least a slight drop in traffic, even with SEO.

On a percentage basis, this can equal dozens, or even hundreds of traffic-driving rankings. The better you do with maintaining your high-volume keywords, the more likely the dip may be being caused by other factors as well, including an unexpected or seasonal dip in branded traffic.

The best ally in monitoring post-launch success rate is, of course, analytics. As long as it’s been properly configured, very important data can be gleaned from 404 reports and high-bounce pages or referrers. Google Webmaster Tools and Yahoo Site Explorer are important allies in the quest for new or continued rankings.

Lastly, level of implementation is probably the single worst enemy to SEO success. Lack of SEO implementation is so dangerous because even what seems like the smallest de-prioritization during a launch cycle can often be a very big wrench in the spokes.

In a partnership with an SEO provider, or in dealing with your own SEO team, you have to remember the “fear of flying” analogy. If you’re afraid of flying, simply ask your captain if he’s planning to get you there safely. Chances are very likely that any partner you entered into an arrangement with, and even more so an internal SEO team, is just as concerned about ranking and performance as you are.

For a new site, the first goal should be to rank for brand and potentially some local or other modified long tail searches. Give the process time and continued attention, and rankings should be visibly rising after six months.

For redesigns, the primary first goal should be to maintain current rankings for high-traffic terms immediately post-launch. Next, establish that your optimization is in place and start working to develop trust in the new pages and understand how the search engine sees them. From this, you can either increase or decrease your expectations for incremental success over the old site.

Frank Watson Fires Back

Interesting mate — length of time it takes to get ranked is always something you get asked, and it is a hard one to answer. Competitiveness, domain age, and brand level are the three things I look for at the start — after that it becomes an endless endeavor.

Using existing analytics can help, but many times you’re implementing them as well. One of my clients put bonus conditions on getting ranked on front page — left him after six months, then a couple months later many of the bonus terms popped into place. They just don’t get the idea of time in our industry.

Google changes faster than you can blink right now. Just some buzzwords that will bug you in 2010:

  1. Google Caffeine

  2. Google Real Time search

  3. Google Personalized search results

  4. Google Living Stories

  5. Google Phone

I could go on like for a while. It’s not my intention to confuse you even more though. I want to shed some light on the near future of Google, search and SEO. It’s a daunting task as I’m overwhelmed as well.

Thus I picked 30 resources on Google, search & SEO changes in 2010 from around the search industry and some main stream sources.

These articles explain some or several of the upcoming changes.

Google Caffeine

Google Real Time Search

Google Personlized Search

Google Living Stories

Google Phone

Google Breadcrumbs

Misc. Resources

Courtesy: 30 Resources on Google, Search & SEO Changes in 2010

We all love the holiday season, and so now, here are a few reasons to love it even more! A few weeks ago, we announced a set of powerful, flexible, and intelligent features. Today, at SES Chicago, Phil Mui announced additional features that build on these same themes to make your life as an analyst easier. We hope you’ll enjoy them.

Annotations

Do you ever wonder about an inexplicable change in your traffic? Or forget exactly when you launched something, or who was responsible? After scratching your head, did you have to chase down different departments in your company or go digging through old emails to get an answer?

For instance:

  • Was that dip in traffic because the servers went down?
  • When did the new display ads campaign launch?
  • Who’s responsible for the checkout page redesign and when did it go live?

Running around asking everyone from marketing, IT, and product doesn’t scale. More and more large companies are using Google Analytics, so we wanted to cut down on the mileage you need to cover to account for everything that happens to your website and online marketing.

This week, the wild goose chase is over — you can now easily denote unexplained dips or spikes and figure out “what happened” with the launch of Annotations in Google Analytics.


Annotations allows any user with access to a Google Analytics profile to leave shared or private notes right on the over-time graph. Building upon the concept of bringing Intelligence to data, Annotations complements existing anomaly detection by capturing the tribal intelligence of your company, which tends to be the most expensive and easily lost resource of all. A simple note from a colleague can save hours of real work (and frustration) for an analyst who is tasked to explain a usually dry set of numbers. This short video will show you how to use Annotations.

Taking its usefulness even further: Annotations can become your central repository, or logbook, for all online marketing and website design actions within your business. So even if you have multiple marketing teams, agencies, or webmasters, or if you have employee churn or other disruptions, you can always see which events may have caused conversions to increase or decrease. No wonder this has been one of the top requested features in Analytics for such a long time!

Custom Variables Now Available In Advanced Segments


Custom Variables provide you the power and flexibility to customize Google Analytics and collect the unique site usage data most important to your business. In Google Analytics, not only are you able to define multiple custom variables, each custom variable is a name-value pair and can be assigned one of 3 scopes: page, session, or visitor. Each custom variable name and each value is an arbitrary string defined by you pertinent to your business needs.

When we announced Multiple Custom Variables in October, the only way to view metrics on these Custom Variables then was to open the standard “Custom Variables” report in the Visitors section. This week, a user can create an advanced segmentation based on any key, value, as well as key-value combination of all Custom Variables. In other words, if you’ve created a Custom Variable such as “Logged In Member”, you can also create an advanced segment based on that variable and see it across all of your reports.

The ability to create visit segments based on Custom Variables is critical in maximizing the full potential of Custom Variables. Users can now slice and dice their metrics by decorating their site traffic with the appropriate key-value pairs.
Custom Variables Available In Custom Reports
You can also create Custom Reports with any of the key or value dimensions associated with any Custom Variable. Now, you can see how a segment defined by Custom Variables behaves along any of the metrics available in Google Analytics.

New Analytics Tracking Code Setup Wizard

One of the more daunting tasks in setting up analytics on any site is to manually configure the tracking code for specialized situations, such as multiple subdomains, cross-domain tracking, mobile web tracking, PHP sites, campaign tagging, etc.

Well, fear no more. When you create a profile, you’ll notice a new tracking code setup wizard in Google Analytics. This wizard automatically generates the appropriate tracking code according to the setup options specified by you.

New Version of The Analytics API

Later this week, there will be a separate announcement about a set of very exciting features to our Analytics API. Here’s a little preview: Support for Advanced Segmentation will now be available through the API.

In addition, new data dimensions and metrics will be made available, including those in our recently announced features.

How search engines could get granular

One area that is worth looking at for SEO in 2009 (for me at least) is page segmentation. Now this approach really isn’t new and I came across papers as far back as 1997 and beyond. But unsurprisingly most IR methods don’t just appear over night. The big three, (of search) have each had various research papers and patents dating back as far as 2003-4. It just seems to have some traction and is sensible as well.

Essentially page segmentation is when a search engine looks to break a given web page down into its component parts. They could analyze a web page and assign various relevance or importance scoring for different regions of a page. Some of the methods include fixed-length page segmentation (FixedPS), DOM, (DomPS) and location based and white spaces (vision based or VIPS) and even a combined approach (CombPS).

Page segmentationAs with many IR methodologies they try to improve the signal to noise ratio. In this case by hopefully identify the noisy segments; resources can be focused on the relevant areas of a web page. Furthermore most people do tend to understand web pages in a segmented or structured view. When you arrived at this page did you instinctively know where to find the main content? Aware of common locations for navigation and other elements? Banner blindness? You get the idea.

Advantages of page segmentation

The main advantages are increased relevance and streamlining processing elements. Search engines hope to use page segmentation to be able to asses a more finite understanding of a given pages relevancy, but also (theoretically) be capable of dealing with multi-topic pages, semantically related or not.

The second advantage, processing and resource management, can be achieved as they could define site templates in an attempt to only crawl/index the relevant parts of the page and not the boilerplate elements.

Now, while there are a few ways of going about it, what’s important here is that such systems are sensible not only from a relevancy perspective, but could also help crawling and indexing resource management.

One has to imagine new ideas at the big three will be tempered in a volatile economy. Once a template has been established, indexing a site on a regular basis could be far easier on a search engine (and site owner as well). Just have a little ‘template bot’ crawl a few pages now and again to ensure the profile is unchanged.. but I’m rambling now…

Another implementation (as noted by the Google patent) could be pages that have a number of listings that are geographic in nature. As search for ‘stone oven pizza, Toronto’ could produce better results as larger listings of pizza shops in Toronto could be segmented and digested by more finite parameters than normal.

The text associated with the smallest hierarchical level surrounding a business listing may be associated with that business listing” – Patent; Document segmentation based on visual gaps

Segmenting the page

The nuts and bolts I shant trouble you with (links later as always) but it varies from code analysis (DOM) approaches to vision based. The main idea is establishing common (boilerplate) segments of a web page… And from there the systems can be set to even more granular levels to find an optimal rate (playing with the dials).

Boilerplate type elements can include; Headers, Footers, Navigational elements found throughout a site or a single page. When looking at multiple pages common elements can also be identified such as the phrase ‘Copyright 2009’ for example. Within a piece of content common boiler plate elements such as a copyright notice or navigational links (Home, ‘Contact Us’) can also be identified and disregarded as needed. The same can be said of advertisements and other blocks of information found throughout a website.

By disregarding such boiler plate text during indexation the search engine can also attain greater relevance and save processing resources.

Now, obviously we’re a content centric bunch in the SEO world and so understanding how they might look to identify ‘content’ areas of the page is paramount (more later on the why). Elements often cited are;

  1. Number of images in the block
  2. Size of the images
  3. Number of links
  4. Anchor text length
  5. Number of words
  6. Length of form elements
  7. Text formatting elements (<strong> <text> <i> <em>
  8. Other code elements (<table> <p> <hr> <ul> <td>)
  9. Background color of a node (or child node)

Also the size and position of the block can give added signals as to where the core content of the page resides. In most situations for search optimizers, we deal in this space. This is where we really play – this isn’t 03 and site wide footer links sucked long ago.

Now it’s not all dill and pickles…there are a few potential issues with page segmentation systems.

Problems

Not all people value the same parts of a page given the different types of sites out there. For example some may look at the stories on the home page of a social media site while others may look to the latest comments… because of this setting the thresholds and valuations of segments is problematic. Even the boilerplate concepts suffer from this (as the ‘Top 10’ and comments are on every page at a site like Sphinn)

Inferring interest

One might also have varied feelings of importance to navigational elements when not directly on a page of interest (thus editorial areas can also vary). As with the above example, navigating to the ‘search marketing’ segment may be of greatest interest to me. What of a side panel element of ‘related topics’ which may or may not be of great importance to the viewer? The point being that it’s not a golden egg entirely.

As you can see, adoption still has a few mountains to traverse, but even in a limited capacity there are uses… one really jumps to mind (and shame on you if you weren’t already asking)…

Linked implications

Yes, that’s right… you know we just had to end up here right? One of the things that really drew me into this topic more and more over the last year is the potential implication for links. One would have to imagine the link dominant search engines would welcome a system that could potentially provide more granular levels of link valuations (on site and off).

To begin with, page segmentation can help bolster link analysis methods Multi-topic pagessuch as page rank, HITS and their ilk. Or so the story goes. Consider a page with a variety of semantically or not so related content, complete with links (internal or external). Traditional analysis tells us that the page is treated as a whole and thus link relevance can be effected from a lack of focused theme. If search engines can begin to break out blocks of information, independent of the whole, new valuations can be had for links from within a single document. In short there could be more link juice to go around.

Another interesting element would be the ability to build links to a multi-semantic page with diverse anchor texts. Many times in SEO one creates target page(s) built around terms and builds related links to that page. This has always made ecommerce SEO a struggle between clicks to purchase and SEO readiness as far as structural elements and ‘landing pages’ are concerned. Page segmentation methods mean we could build more diversified link profiles to a given page (such as a main category page in the case of the ecommerce example).

Think of links from a block-to-page level and page-to-block (instead of say PageRank which is page-to-page). One can see how greater relevance from link analysis can be had.

Now this can be a doubled edge sword in that block level link analysis (such as BLPR) could play into the valuation of links. This could mean wholesale devaluation or dampened of certain link types. This could include;

  1. Advertising blocks
  2. Blog comment links
  3. Header/footer links
  4. Site wide links
  5. Forum signatures

As you can imagine, this would put a high premium of editorial (diverse) links within content. The ‘boilerplate’ models could also easily pick out mass paid link programs and article marketing links as common (unnatural) elements.

Spam busting

I think it is interesting that much of what has been looked at in page segmentation, to me, has some interesting implementations in spam detection. For starters more complex template and content analysis is bound to turn up many boiler plate websites such as those produced by web spammers. On a granular level, poorly generated content for spam sites could in itself create boilerplate text right within the content.

Beyond that certain template systems employed by spammers over and over can be identified and cross referenced with other factors (link spam analysis, IPs, whois) to profile spammers. And speaking of link spam, these systems could also identify common locations which boilerplate link texts are showing up for a given link profile. Ultimately any IR system should have some spam detection capabilities and these methods satisfy that on a few levels.

As mentioned above some elements of block level link analysis could be used to identify linking schemes in concert with existing methods. Consider large scale paid blog reviews or article marketing campaigns where the template changes from site to site, blog to blog, but the main content (once identified) contains identical anchor texts/author bios. Analysis on a page by page basis wouldn’t be as effective as a block by block analysis.

What could it mean to your SEO?

And so what does any of this mean to you? Well, to be honest we don’t know if any of the big three implement page segmentation concepts on any level, though Microsoft certainly has had a strong addiction over the years. As with many search applications, the end user experience is a running concern. Many of the adaptations we’d make with such methods in mind would ultimately make for a better and more concise end user experience. At very least we can improve usability and prepare for potential search evolutions all at the same time.

Some key take-away could be;

  1. Create distinct blocks when constructing pages and ensure it is obvious where the content it.
  2. Use CSS to define text types; when possible ensure content text face is unique
  3. Use tables or unique div background colors for child (related) topics within content.
  4. Present content in semantic page formatting; title’s, italics and bold to segment concepts

Ultimately if such systems did gain traction it would become increasingly possible to rank a single page for a variety of terms, beyond the abilities we currently have for targeting. One instance one might find for these concepts are ecommerce applications and varied product lines (though semantically related). Let’s look at the following example;

Ecommerce segmentation
*courtesy Wave Shoppe – Hawaiian clothing

In an instance such as this formatting and targeting the text within each of the blocks becomes of great importance. We could also consider alternating the background colors of the product nodes to define them as unique segments. You can also ensure the upper and lower segments are properly targeted as a parent or child node of the over-all presentation.

For all we know search engines could look at top performing pages in a query space and analyze them for semantic block and other segmentation variables to create new signals for other pages in the set (query space). We can say that there is interest, potential and even potential advantages from processing and spam detection stand points. What we can’t say is how deep or valuable page segmentation will be to search engines in the future.

So far Microsoft and Yahoo seem to have the most interest, although I wouldn’t count Google out as what I read of ‘block-level-pagerank’ seemed promising. I wouldn’t go changing how you optimize just yet, but tuck this one away. It is something to watch for in 2009… just in case.

Resources: SEO implications of Page Segmentation concepts

Follow

Get every new post delivered to your Inbox.

Join 1,002 other followers