Friday, October 18, 2013

Ecommerce Product Pages: How to Fix Duplicate, Thin & Too Much Content

Content issues plague many sites on the web. Ecommerce sites are particularly at risk, largely due to issues that can stem from hosting hundreds or thousands of product pages.
Typical issues with ecommerce product pages are:

  • Duplicate content.
  • Thin content.
  • Too much content (i.e., too many pages).
Left unchecked, these issues can negatively impact your site's performance in the SERPs.
If you run an ecommerce site and you've seen traffic flat-line, slowly erode, or fall off a cliff recently, then product page content issues may be the culprit.
Let's take a closer look at some of the most common content woes that plague ecommerce sites, and recommendations on how to can fix them.

Duplicate Content

There are typically three types of duplicate content we encounter on ecommerce sites:
  • Copied versions of the manufacturer's product descriptions.
  • Unique descriptions that are duplicated across multiple versions of the same product.
  • Query strings generated from faceted navigation.

Copied product descriptions

A large degree of ecommerce resellers copy their generic product descriptions directly from the manufacturer's website. This is a big no-no. In the age of Panda, publishing copied or duplicated content across your site will weigh your site down in the SERPs like a battleship anchor.

How to fix it

The solution here is to author original product descriptions for every product on your site. If budget is an issue, prioritize and get fresh content written for your highest margin product pages first and work backwards.

Unique yet duplicated product descriptions

With many ecommerce sites, site owners have authored original product descriptions, which is fantastic. Where they run into trouble is they sell multiple versions of the same product (different sizes or colors or materials, etc), and each product version has a different page/URL with the same boilerplate description.
Now even though this content is technically unique to your site (it's not copied from somewhere else), it's only unique to a single page. Every other page it lives on is considered duplicated content.

How to fix it

The solution here is to concentrate multiple product version pages to a single page, with all the different product options listed down the page. Or you can position them as a list in a drop down menu, like Zappos does.
Product Dropdown Nike Lunarglide
Once you combine all pages to a single page, 301 redirect the other URLs to that single page, in the event they've attracted links and/or accrued link equity. The redirects will also help Google sort out the true version of your product page, and can help with any potential crawl budget issues.
Depending on the ecommerce platform you're using, concentrating multiple versions of a product page to a single URL can be difficult or impossible. If that's the case, think about moving to a SEO-friendly platform, like Magento or Shopify.

Faceted navigation issues

Many ecommerce sites host category pages with a range of filters to help users easily navigate their site and drill down to specific products, like this Weber Grill page on Home Depot.
Home Depot Faceted Navigation
A faceted navigation menu like the one above can create dozens if not hundreds of query strings that are appended to the URL, thereby creating duplicate versions of the same page. Faceted navigation can be a fantastic UX feature for consumers, but can problematic for SEO.

How to fix it

There are a few ways to prevent searches engines from indexing duplicate content from faceted navigation:
  • Block faceted pages via Robots.txt file.
  • Parameter handling via Webmaster Tools.
  • Add self-referential canonical tags (rel="canonical") Note: this may help Google distinguish original from duplicate content, but it won't address crawl budget issues.

Thin Content

Even if a site has 100 percent unique product descriptions, they can often be on the thin side (i.e., a few bullets of text). Now, product pages with light content can still rank well where domain strength helps supersede potential thin content issues.
But most sites don't have the backlink profiles of Amazon or Zappos, and I like to think in terms of risk/reward. Thickening up descriptions makes sense because:
  • It can reduce any risk that thin content issues might negatively impact SERP visibility
  • It adds more content for engines to crawl, which means more opportunities for your page to rank for a wider basket of search queries.
  • It freshens up your page, and freshening up your content can definitely pay dividends with Google.
To audit word count for every page on your site, crawl the site with Screaming Frog and looking for potential trouble spots in the "Word Count" column.
Word Count Audit

How to fix it

Some of the ways you can address thin content on your ecommerce product pages include:
  • Enable (and solicit) user reviews and feedback. User-generated content is free and helps thicken up your content with naturally-written text (not "SEO" content). This additional content can help improve potential relevancy scoring, time on page, user engagement levels, and can help the product page rank for a broader basket of search queries. Also, user reviews offer social proof and can improve conversion rates as well.
  • In the previous example, I spoke about condensing multiple versions of the same product to a single page. Doing this would also help thicken up that pages since you'd list all the different dimensions, size variations, colors available to consumers.
  • Write some additional, original content. You can hire a writer to help thicken up these pages with additional features and benefits, or you can do it yourself. Again, given it could be very costly to thicken up every product page on the site, you can prioritize your highest margin products first.
  • Pulling in mashups of links/text of similar products, product accessories, special offers and recently viewed items is another way to add more content to a page, and a tactic many larger ecommerce sites use like Amazon.com.
Amazon Product Mashups

Too Much Content

Saying that a site has "too much content" may sound contradictory to the issue of having content that's too thin. But when I say an ecommerce site may have too much content, I'm really talking about two distinct issues:
  • Too many product pages.
  • Improper handling of paginated product pages.
And specifically how having too many pages of low value content can cause PageRank and crawl budget problems.

Too many product pages

This is really an addendum to the duplicate content issues posed by faceted navigation or hosting multiple versions of the same product on different pages.
Aside from low value content concerns, hosting a mass of duplicated product pages dilutes your site's PageRank or link equity, which weakens its overall ranking power of your important content.
The other issue pertains to your site's "crawl budget" (i.e. how deep/how many pages Googlebot crawls each time it visits your website). If a large percentage of your site if comprised of duplicate or low value content, you're wasting your budget on junk content and potentially keeping quality pages from getting indexed.

Improper handling of paginated product pages

Another concern of hosting "too many pages" is not handling pagination correctly. Often times, ecommerce sites can have product categories containing hundreds or thousands of products that span multiple pages.
Pagination Issues
Like duplicate product pages, excessive paginated results rob link equity from important pages and can hurt your crawl budget.

How to fix

Some of the ways to address equity dilution or crawl budget issues that can stem from too many product pages include:
  • Rel=next, rel=previous: This markup tells Google to treat ecommerce product listings spanning multiple pages in a logical sequence, thus consolidating link equity (rather than diluting it) with all pages in the series.
  • Canonicalization: It's effective for consolidating link properties (thus solving equity dilution), but it won't solve potential crawl budget issues, since Googlebot will still crawl all your dupe content.
  • "Noindex, follow": If your goal is to optimize crawl budget and keep duplicates or pagination out of the index, use brute force and block Googlebot via robots "noindex, follow" meta directive.

Monday, October 14, 2013

The Causal Nexus of SEO



Dominos Falling
There are some aspects of online marketing that play a huge role in the bigger picture, but aren't as easy to see. Things like emotion, motivation, awareness, and relationships can be hard to gauge with our usual metrics.
But sometimes the effects of an action aren't evident right away. There are times when we can't associate cause and effect directly. But everything we do in SEO fits into a much bigger chain reaction and we might not able to see every piece.
When something doesn't fit our typical measurements it may be easy to write it off entirely. There's actually a word for that: floccinaucinihilipilification.
The exact definition of floccinaucinihilipilification from Dictionary.com is "The estimation of something as valueless." It's actually the longest non-technical word in the English language. Sorry, antidisestablishmentarianism.
Aside from being a semi-useful piece of party trivia, floccinaucinihilipilification is actually a great description of one of the most frustrating aspects of modern SEO. There are just so many things that are easy to dismiss because they are outside of our usual expectations for results.
Search is evolving to a point where we get much less instant gratification. Things take a much less linear path than they did in the past; you can't just walk across the room and turn on the light anymore. You have to use a Rube Goldberg machine to do it.

The Social Part of Social Media

In social media you can measure friends, followers, retweets, circles, referral visits, and sales through unique promotions. There are all sorts of fantastic metrics for judging how well a social campaign is performing. Of course not every one of those translates to visits, or dollars.
If a comment on your wall doesn't result in a sale or if a retweet doesn't improve rankings, then does it matter? Yes.
With social media it's also about the prospect of exposure. It may not be as clearly measurable when someone shares something on Facebook and one of their friends sees it and later searches the brand name. It's not always obvious when retweeting someone's post and getting a "thank you" leads to that person clicking on the Tweeter's site in the SERPs because they recognize the name.
If social media efforts aren't directly impacting your rankings, or the traffic numbers aren't approaching search engine referral proportions, that doesn't mean the campaign isn't working.
A comment on a wall may not mean much on its own. But a comment may lead to a new fan that may lead to a new sharer, who could grow to be an evangelist if the relationship is cultivated.
While direct leads are a possibility from social media, there's more to it than that. It's access to a huge and active audience if you're willing to play to the crowd.

Simple, Single Links

Links are probably one of the hardest places to deal with all of the changes in the last year. Links have been both the salvation and devastation of too many websites.
Bought links, links with keyword anchor text, easy, cheap, unlimited links weren't supposed to work, according to the rules. But they did. So forget the rules, people made money. Except now, best case scenario they don't work as well and worst case, they can tank a site.
So now links mean a totally different thing. They aren't as easy to get any more. They don't necessarily go to the pages where products live and links that go to different kinds of content don't always work the same way.
Links with your URL as anchor text probably won't move a site up for its head terms as quickly as a hand full of links brandishing keywords used to. So now maybe it's a about getting a link from a small community organization instead of 150 directories. But those little links are a much bigger deal now.
It's never going to be the same, but this is where we live now. A link from a person's enthusiast site for a how-to guide may not seem as effective as syndicating an article across 300 sites, but it's real. Things that are authentic may take longer to feel.

Trust and the Human Factor

Google has shown a continued effort to become scary close to emotional intimacy with the preferences of its users.
Authorship is one indicator of Google's improving efforts to identify individuals as entities. Public signs point toward their increased attempts to incorporate that information into how they evaluate websites. This interest in using real people's association with websites to determine trust, should be more than enough to pique our in getting onboard early.
On the other side, Google also seems to be trying to figure out which sites people trust through their own choices and patterns. That means visitor loyalty isn't just important for repeat sales, the signals it sends can be beneficial for SEO.
Some loyalty is measureable. Getting people to want to return to a site is measurable. We can see when the percentage of repeat users goes up.
We can measure how many people come to a site through subscription based newsletters or email marketing. We can measure when people become regular commenters or forum posters.
But it's hard to measure where those relationships start. Was the first time they came to your site searching for what you sell? Or is it possible it's because they knew you before they needed what you sell?
It isn't always as clear cut as which search word brought you the most visitors, or what was the last click before the sale. Sometimes that sale was months in the making based on a chain reaction that couldn't be tracked.

A More Convoluted Path

Each action that creates a positive connection has value even if it falls outside of our traditional data tracking.
We absolutely have to evaluate numbers, show correlation and prove ROI. That's the job of anyone working in SEO. But trying to optimize within the new system, we've had to get more creative.
It may take time for an initial action to produce a desired end result and there may be 10 steps in between instead of 3. But that doesn't mean it isn't worth it.
So don't immediately floccinaucinihilipilificate an effort in which direct results are a little ambiguous. There may be more at play than is immediately evident.
There's a time to give up on something that isn't working, sure. But make sure you're not comparing more slow-burning efforts to the precedents of the past.
At this point, shortcuts are getting shut down more and more every day, and the long way is about the only option left. So yes, an action might not lead to more rankings, traffic, or sales directly, but that doesn't necessarily mean it didn't work; it may simply be the first domino to fall.

Top Search Result = Poor Ad CTR [Study]

Advertising network Chitika released a study today that showed how ad click-through rates on a website vary when users come to that website from Position 1 in the organic search results versus other positions. Data showed the highest CTR on ads in a website occured when users found the site from Position 10 in the SERPs.
ctr-by-referring-position
As a follow up to Chitika's study last summer that showed how rankings yielded traffic, Chitika said this is a stark contrast in terms of ad performance.
"What is clear from the data set is that although the first position of a Google search result drives the most search traffic, an average visitor coming from that link is the least likely to convert into an ad click," according to Chitika.
Chitika said the reason why Position 10 might be driving the most ad CTR on a site could be due to unsatisfactory results.
"When a user scrolls down and clicks on a link at Position 10, it is more likely that they have not found what they were looking for, increasing the probability of that person clicking on an ad related to their search query," Chitika said.
Chitika said that marketers shouldn't necessarily be vying for 10th position on every keyword, but that in terms of driving ad revenue, it's not a bad place to be overall.
google-results-page-rank-average-traffic-share-chart
"On a popular search term, 2.4 percent of potential visitors still represents a sizable audience, and by being the number 10 result, it's likely a site will see higher ad revenues," Chitika said in its report. "However, for lower volume or specialized search terms, ranking as high as possible will help in attracting the largest audience, since the proverbial 'pie' of users on those terms is already fairly small, along with the potential revenue impact of higher visitor CTRs."
So what's a marketer to do with this data? Cristian Potter, a data solutions engineer at Chitika, said it's important to note that this report examines aggregate traffic trends, and may not apply to groups of sites.
"Hitting the sweet spot requires some analysis of an individual site's traffic, for example, understanding how users are finding the site, and how certain campaigns have impacted actions undertaken by users on the site itself," Potter said. He added that this research can serve as a "as a point of reference in plotting metrics and key performance indicators."
While the data seemed to show an interesting relationship, sites that go after ad revenue have a seemingly delicate balance of providing a great user experience and making money. Not having the most relevant content (Position 10 versus Position 1) and subsequently driving users away through an ad doesn't seem like a great idea, either.
Potter agreed.
"User experience is always a key consideration when it comes to deciding on the number and placement of ad units," Potter said. "This also ties in with expected CPM on each ad unit - it should be worth the site's while to place an ad in a prime position. However, this study was solely behavior focused. The characteristics of the one or more ad units on each site within the sample will have varied considerably."

New AdWords Estimated Total Conversions Tracks Consumer Purchases Across Devices

Starting today and over the next few weeks, Google AdWords will roll out a major reporting update to conversion tracking called Estimated Total Conversions. This feature provides estimates of conversions that take multiple devices to complete and adds this data to the conversion reporting we see today.
Following the launch of enhanced campaigns this year, search advertisers have combined mobile and desktops with the ability to further modify bids by mobile and other targeting factors. One gap in reporting and comprehension of the campaigns effectiveness has been the limited data on how consumers are navigating and converting via multiple device options.

What is a Cross-Device Conversion?

What is a Cross-Device Conversion
Consumers constant connectivity has enabled them to browse, shop, and interact with businesses on the go and from multiple devices.
A September 2013 Google study found that more than 90 percent of multi-device consumers move sequentially between several screens like mobile to desktop, or mobile to tablet to complete a transaction online. Google found that a high percentage of converters actually jumped from desktop to desktop too, presuming a work desktop to home desktop computer.

How Estimated Total Conversions Works

Measuring AdWords Conversions in a Multi-Screen World
Google calculates cross-device conversions for a particular advertiser based on how their customers convert when they are logged in. They then use this as the basis for extrapolating out to the complete data set to form an estimate of what total conversions that cross devices might look like. This data is only used in aggregate and not personally identifiable.

What's Next?

Estimating conversions across devices (estimated cross-device conversions) is only the beginning and one conversion type Google intends to measure.
In the future Google plans to incorporate other conversion types such as phone calls and store visits where advertisers are hungry to gain new insights into how their advertising is working.

Link Building 101: Competitor Analysis

Link Building 101 Competitor Analysis
Link building is something anyone can accomplish. There's no great secret, just hard work, creativity, and determination to get links that matter.
When you're looking for some practical link building opportunities that will help you find and acquire quick, yet quality, links, there are five "quick wins" you should explore at the beginning of a link building campaign:
  1. 404 Pages and Link Reclamation
  2. Competitor Analysis
  3. Fresh Web Explorer/Google Alerts
  4. Local Link Building
  5. Past/Current Relationships

Competitor Analysis/Backlink Profile

Competitor analysis is an integral step in any link building campaign. Why? Because running a backlink analysis on a competitor:
  • Teaches you about the industry:
    • Gives you a sense of which sites within the vertical are providing links
  • Helps you understand your competitors, including:
    • Their link profile, and why they're ranking
    • Their strategies used to acquire links
    • Their resources that didn't acquire many links
Gives you a list of obtainable links (if they can, why not you?)
Competitor backlink analysis is great – you get the initial research into the industry done, it helps you understand the competition, and it gives you a tidy list of high opportunity links.
So, let's dive into the how of competitor backlink analysis:
  1. Make a list of competitors
    • Direct
    • Indirect
    • Industry influencers
    • Those ranking for industry money keywords
    • Watch fluctuations – who's winning and who's losing
  2. Take those competitors and run their sites' through a backlink tool previously mentioned (OSE, Majestic, Ahrefs, CognitiveSEO, etc.)
  3. Backlink Analysis
  4. Download the top 3-4 competitors' backlinks into CSVs. Combine into a single Excel sheet, removing duplicates, and find obtainable quality links already secured by competitors.
Step 2 and 3 were previously covered in "Link Building 101: How to Conduct a Backlink Analysis", and step 1 is pretty self-explanatory.
To recap the advice for these steps:
  • Don't phone-in the list of competitors. Spend time doing research and investigation, giving yourself a well thought out and understood list of potential competitors.
  • Information you should be examining in a backlink analysis:
    • Total number of links
    • Number of unique linking domains
    • Anchor Text usage and variance
    • Fresh/incoming links
    • Recently lost links
    • Page Performance (via top pages)
    • Link quality (via manual examination)
  • Additionally, think creatively while looking through competitors' backlinks. Think about:
    • Which resources/pages performed well
    • Which resources/pages performed poorly
    • Commonalities in competitor's link profiles
    • Differences in competitor's link profiles
    • Strategies likely used to acquire links

How to Find Obtainable Quality Links

So, that takes us to Step 4: downloading competitors links into CSVs, combining in Excel, and drilling down into the data to find worthwhile links and insights.
Honestly, SEER has done an amazing job of writing a very easy to follow guide for Competitor Backlink Analysis in Excel.
To summarize their steps, you:
  • Download CSVs of competitor's backlink portfolios (‘Inbound Links' will give you a list of all the pages linking, ‘Linking Domains' will give you only the domains).
    • Note: if you're unfamiliar with your own (or client's) backlink portfolio, you may wish to include their backlink portfolio in this process for reference.
    • Using OSE don't forget to filter to the whole domain:
Pages on this root domain export to CSV
  • Open the CSVs and combine (copy and paste) all the data into a single Excel sheet.
  • Filter down to clean URLs, keeping the originals intact.
    • Move Column J (target URL) to Column P (to be the last column)
Move Column
    • Delete Column J (the now empty column)
Delete Empty Column
    • Duplicate the URL and Target URL columns on either side
Duplicate URL Target URL columns
    • Remove http:// and www. from both column A and column P - select the column, click control+H (find and replace shortcut), type in what you want to find (http:// and www.) and replace them with nothing (by leaving the second line blank).
Remove http and www
    • You might want to rename column A and P at this point - call them bare URL and bare target URL, or whatever you so desire (in the SEER article they were called ‘clean').
  • Remove duplicates
Remove Duplicates
    • Make sure it's only for column A (bare URL) and P (bare target URL)
Remove Duplicates URL
Notice the check mark on "My data has headers". This is important to keep your data from being jumbled up. Anytime you're removing duplicates make sure this box is checked.
This will give you a complete list of stripped URLs next to the full URL linking (along with the rest of the important information provided by OSE) and a list of full target URLs next to a complete list of stripped target URLs.
Note: you'll still likely have a lot of duplicate URLs in column A (the linking URLs) at this point. This is because there's multiple links on the same page going to different landing pages – which is potentially important information (shows a competitor acquired multiple links per page).
If you'd like to delete these multiple link pages/URLs to reduce data noise, highlight column A, and run ‘Delete Duplicates' again - making sure to have the ‘My data has headers' box is checked:
Remove Duplicates Bare URLs
Now, you'll be down to unique URLs (pages, not domains if you've used Inbound Links) linking to competitors. If you're looking for only referring domains, you should start back at step 1 and download a CSV of referring domains, as opposed to all links.
At this point, you're still dealing with a lot of data, so you'll want to filter it further. I recommend filtering by domain authority to see the most authoritative links first.
Filter Domain Authority
This will make your list ordered from highest domain authority to lowest – pretty useful information. Keep in mind however that the domain authority is thrown off by any subdomains hosted on a popular site – example.wordpress.com, example.blogspot.com, etc.
So, don't take the domain authority as absolute – you'll need to verify.
There's also a few other filters you can use to find interesting data:
  • Page Authority (PA)
  • Anchor Text
  • Number of domains linking (shows best ranking pages - don't get stuck on home pages)
Take time and play around with the data. Look through the top DA's (manually excluding anything artificially inflated), then PA's, check out top performing pages via number of domains linking, and even play around with filtering the anchor text.
This should be the fun part - the analysis. You've filtered the data down to a semi-digestible level, and should start taking advantage to find insights and understand your competitor's links.
Remember, any links your competitor has should be considered fair game for yourself. Once you've determined quality links from domains you haven't secured, look into the link and pursue it appropriately.

More Insights

If you're looking for an even better (and more advanced) deep data insights you can move all this information into pivot tables. Simply select all rows, click over to the insert tab, and select ‘Pivot Table':
Insert Pivot Table
Once here you have the option to choose which fields you'd like to further examine:
Pivot Table Fields to Add
Playing with this data should reveal potential insights, although we're getting a bit beyond Link Building 101.
Furthermore, if you want to really dive into pivot tables (or excel in general), I can't recommend Annie Cushing enough. Check out her Moz article "How to Carve Out Marketing Strategies by Mining Your Competitors' Backlinks".