SEO and Validation

Posted by SEO Dave on August 26th, 2006 at 10:20am

This post is based on a thread (Re: My Google Page Rank is 4) in the NG alt.internet.search-engines and a request for help in the comments at SEO Expert page.

Original poster asked about his 3-4 month old site www.cre8ivemind.com, with a PR4 home page, but no pages indexed in Google despite using Google site map.

Confirmed with the Google toolbar the home page was indeed PR4.

PR4 Screenshot
Click for large PR4 screenshot

Confirmed with a link:http://www.domain.com/ search in Google that the site had backlinks (5 backlinks) did the same with Yahoo, though Yahoo has changed it’s link: format recently and sometimes it doesn’t work (used to work with any site including deleted ones!)! Yahoo showed 113 backlinks.

Google Backlinks Screenshot
Click for large Backlinks screenshot

Confirmed with a site:http://www.domain.com/ search in Google that the site had no pages indexed in Google! Did the same with Yahoo, Yahoo showed 8 pages indexed.

Google Site Search Screenshot
Click for large Site Search screenshot

So the site has a PR4 home page, backlinks, but no pages are indexed in Google. 113 backlinks on Yahoo should be more than enough to index this site, so as an expereinced SEO consultant I know something is wrong.

A very quick check of the home pages code (view source with Internet Explorer or View Page Source with FireFox) didn’t immediately show any problems. The site was created using the WYSIWYG (What You See Is What You Get) editor Frontpage and so there’s a lot of extra code (makes it harder to check manually for serious code errors).

On this first manual check I saw Flash code which matched up with the sites navigation (bad for spidering/SEO), but also found corresponding text links to the same internal pages, so if there are no major code errors this site should be spiderable, also found this meta tag-

<meta NAME=”Robots” CONTENT=”All”>

This is telling all search engines to index this page and follow all links, BUT this is also the default for all search engines (they do by default follow and index), so pointless adding it. You should check this though just in case you added an exclusion like noindex or nofollow, see more Robots meta tag options at Meta Tags Optimization

It’s also a good idea to check for a robots.txt file by looking at http://www.domain.com/robots.txt This site didn’t have a robots.txt file, so this can’t be a source of the problem.

I couldn’t see anything else obviously wrong with the code, but it’s messy code so not easy to follow. This is when I use a validator specifically the W3C Validator.

Result: Failed validation, 45 errors

Anyone not familiar with validation may think 45 validator errors sounds bad. The vast majority of pages online today (August 2006) have numerous validator errors, yet work fine in the main browsers and the search engines can index them (check the top 10 ranking sites for almost any SERP, most pages won’t validate). Basically the browsers and the search engines can handle many validator errors, but they can’t handle ALL code errors, so we must remove the serious ones you find manually or via a validator.

We could try to remove all validator errors so we are 100% certain the code is serious error free.

Most of the validator errors reported are like these

No DOCTYPE found! Attempting validation with HTML 4.01 Transitional.
there is no attribute “LEFTMARGIN”.
there is no attribute “HEIGHT”.
required attribute “ALT” not specified.
value of attribute “ALIGN” cannot be “”; must be one of “TOP”, “MIDDLE”, “BOTTOM”, “LEFT”, “RIGHT”.
element “EMBED” undefined.

they are not going to cause problems, generally anything mentioning an attribute isn’t going to cause spidering problems.

There was however two validator errors involving the head and body tags that concerned me. These are important tags and to have an error associated with them is serious.

Line 22 column 6: end tag for element “HEAD” which is not open.
Line 23 column 59: document type does not allow element “BODY” here.

So if you ever find an error like the above two investigate and fix.

Looking at the code again I find a complete set of both tags (one each for opening the head and body tags and one each closing the tags) both in the right place. Hmm weird, I expected to find an extra closing head and body tag!

I then noticed just above the head closing tag some incorrectly formatted code for this Doctype (1st line below)-

<META name=”verify-v1″ content=”YG0wFdONXpRc175v3+BUnh67nykv+vqzaH/H5B1z27o=” />
</head>
<BODY LEFTMARGIN=0 TOPMARGIN=0 MARGINWIDTH=0 MARGINHEIGHT=0>

We use / there when using XHTML (which this page clearly is not), so +vqzaH/H5B1z27o=” /> should be +vqzaH/H5B1z27o=”> since it’s not the right format for this Doc type (HTML 4.01 Transitional is what the validator is working with)

I’m not familiar with that meta tag, so I’d delete (along with 90% of the other ones used!).

I don’t think this would cause problems with spidering, but best to fix it (could be the cause, who knows for sure how Google treats this sort of thing). By fixing this one error both the body and the head validator errors will be eliminated.

The head/body errors though potentially devastating I have a feeling Google would be able to handle them (several browsers versions/types handle the code fine, as does Yahoo which indexed 8 pages), so I think the problem is in another area.

There’s one other (set of) potentially serious validation errors-

Line 90 column 48: document type does not allow element “A” here.
Line 90 column 85: document type does not allow element “A” here.
Line 90 column 146: document type does not allow element “A” here.

with this code-

<a href=”http://www.cre8ivemind.com”><br>
<a href=”http://www.cre8ivemind.com”><a href=”http://www.cre8ivemind.com”>Home</a> l <a href=”http://www.cre8ivemind.com/about-us.htm”>About-us</a> l

The first bit-

<a href=”http://www.cre8ivemind.com”><br>
<a href=”http://www.cre8ivemind.com”>

Shouldn’t be there (causes over a dozen potentially serious validator errors).

The author has probably been dragging and dropping links and/or deleting them in a WYSIWYG type editor, ahh, Frontpage :-).

Sometimes when you delete a link in the WYSIWYG editor it only deletes the anchor text and not the full link. A little dragging and dropping later and you get the sort of code mess above. Not sure how Google etc… will handle this, so not the sort of error you want to leave to chance. This is the sort of error that could cause spidering problems, though shouldn’t prevent indexing of the home page (so not THE cause).

There is nothing else wrong with the code from a spidering perspective, so if the above isn’t the cause of the problem next is to look at possible penalties.

Didn’t check for penalties, IE duplicate content problems etc… since this post was about validation and SEO. That said the content isn’t very good, it’s mostly a list of keywords and so not of great importance. So might be we are seeing a site penalized for keyword stuffing. If it is that I’m impressed Google caught it (assuming it’s an automated penalty).

When I get some time I’ll take another look to see if I can find the source of the problem.

Popularity: 2%

Under Free SEO Advice+ SEO

7 Comments for SEO and Validation

  • 1. Manish Tuli  |  September 9th, 2006 at 4:14 am

    Hi,
    First of all thanks for helping me, & give me the advice, i got indexing in google yestarday, I did lots of work, like link exchange & work on codes, now W3C Validator show only 19 errors, & all thess errors for flash file & java script, anyway final think is that i got indexing in google & My main keyword is “freelance web designer Mumbai” from this my site is coming on 1st page in google,

    Thanks again,

  • 2. SEO Dave  |  September 10th, 2006 at 9:41 am

    Glad I could help.

    David

  • 3. CIO Jerry  |  January 14th, 2007 at 11:35 am

    For the benefits of future readers, the results from using Google webmaster tools. Inserting the tag under is one of the two methods to verify ownership of a site before Google attempts to analyze the site for you. The other method is to upload a file to / with a specific name.

  • 4. Asish Kumar Das  |  February 17th, 2007 at 7:22 am

    Hi,

    I’m enquiring about google validation code for my site.

    Now, I’m using the same from my old site, which is as under

    My question is, Should I use the old code in new site?

    If not, where can I get a new code?

    Need your kind help.

    Regards,

    Asish

  • 5. Iry  |  April 2nd, 2007 at 5:05 am

    Hi David,

    I come across your advice for Manish and I wonder if you could check out my blog as well.

    Here’s is my scenario. I created a blog called My Dark Circles Blog 3 months ago and I’m targeting “dark circle under eyes” keywords.

    I’ve tried to include these “keywords” in most of my posts, including title and mention the keywords few times in the content. Yet my blog is no where to be found in top 50 google search results.

    I’ve also try to validate my page though http://validator.w3.org/ but fail to validate. I hope you can give some advice about my blog.

    ( I wonder if my blog file size is too big and take long time to load that affect the SEO?)

    Thanks.

  • 6. Ravi Karandeekar  |  April 14th, 2007 at 6:52 am

    Thanks, reading in your post about the number of errors was some sort of consolation for me. My w3c report says 275 errors! But i don’t know anything about html/xhtml/css. How can i correct the important errors? Yes, i have stated w3c tutorials but how could i survive till i am able to do it own my own?

  • 7. Custom Ecommerce Web Development India  |  January 18th, 2008 at 9:27 am

    To do seo perfectly for a site w3c validation is urgently required because search engine preffer sites code should be error free.

Leave a Comment for SEO and Validation

Required

Required, hidden

RSS Comments Feed RSS Comments Feed

Related posts to SEO and Validation

Page size and keyword density

Having a discussion at alt.internet.search-engines under the thread Re: Links and PR with Borek, and decided I'd put my answer here instead of on the...


Translations

Popular Articles
Categories
Adverts

Tags
Blogroll
Social Network
Meta
Recent Articles