|
Search Engine Optimization Technical TipsThis page contains technical how-to information, tips and advice on how to adjust for server and design issues that may be negatively impacting your search engine optimization efforts. It has long been known (we admit that it may only be based upon coincidence at times) that certain server issues seem to impact search engine ranking and that when "repaired" the ranking significantly improves. This page discusses ways to address those that we feel are going to have a negative impact upon your ranking.
You have multiple domain names pointing to the same content.Usually your first consideration for purchasing multiple domain names may be to cover your product category with similar names and possibly the purchase of commonly misspelled variations. Or you may just want to deprive your competition of some other nice domain names. But you own a bunch of them, and you have had each of them pointed to your main web site. This is usually considered spam because you're trying to index multiple website domain names that point to the same physical content on the server. This is common where DNS services are used to "map" multiple sites to the same server files. Most search engines have the ability to save and check file size and the ability to save and check long text strings. They will notice duplicate content and only index one site while throwing the rest out. How to correct this problem.By using what we refer to as an IP-funnel you can have the domains go to your production site, but without the problems associated with duplicate data for each domain. Below are directions to correct the domain spam problem so that a search engine does not view your multiple sites as deceptive or misleading.
How to move a site to a new hostIf you are moving your site to a new IP address or ISP this procedure will help minimize the downtime and confusion during DNS propagation.
How to configure a 301 Permanently Moved actionIn the above example the IP Funnel page should transfer to the production site via a 301 Permanently Moved action. A 301 is also useful in resolving typographical errors in links from other sites, as well as creating shortcuts. In UNIX you may also choose to use it to handle improper case specifications in URL's. How to correct this problem.In a UNIX/Linux (Apache) environment you would modify the .htaccess file to include the command:
RedirectPermanent /techtips.htm http://www.bruceclay.com/SEO-tech-tips/techtips.htm In a Microsoft IIS environment you would normally open the control panel, select "Home Directory", and select "Redirection to a URL". In both cases you would be well advised to include a custom 404 page. The 301 Moved action (where you do not specify a from page name as in the "/" example above) causes http://www.old-domain.com/xyz.htm to be sent to http://www.new-domain.com/xyz.htm -- specifically requiring that the page names match or a 404 will result. It is possible to intercept and alter the 404 to be a 301 thru programming in IIS or Apache.
How to configure a 404 File Not Found actionIn any case, you might have a case where the page name is invalid and instead of having a basic page displayed you would like to see a custom page (usually with some navigation). This is easily accomplished in all systems. How to correct this problem.In a UNIX/Linux (Apache) environment you would modify the .htaccess file to include the command(s):
ErrorDocument 404 /404.htm In a Microsoft IIS environment you would normally right-click on the appropriate Web service icon in the IIS Management Console and select Properties. Select Custom Errors then select the 404 error, and then select Edit Properties. In the Message Type list select Edit URL being careful (!) to specify the error recovery page URL.
Tables are causing long navigation lists to be seen as the first contentYour site is developed in "tables" and by the time the search engines encounter your "main content" it is too far down the page for the spider to consider it important. Generally, the text at the top of the page is the more important text, and sometimes site design pushes the main body content hundreds of lines from the top of the page source file. A normal web page usually has a header, a nav bar that is usually on the left side of the page, and the main content is on the right. The search engines usually look at many page attributes, i.e., title, description, and at least the first 200 words of your content after your opening <body> tag. The engines do spider your whole page but if your nav bar lists many products, the search engine may not encounter your main body within the first 200 words. How to correct this problem.This technique is also known as the "table trick". We include our description as an example for our clients. Most of the search engines will read a table a certain way. They will find the opening <table> tag and look for the first "table row" <tr>. They will begin to read each "data set" <td>"data"</td> inside the "table row" from left to right until they find the closing </tr> tag. They will try to keep going until they find the closing </table> tag. They will continue until they have crawled the entire page. Your "main body content" is usually where you would have most of your keyword phrases and the "relevant" body copy that you would want the search engine to index. Knowing that the spider will often try to figure out your "theme" within the first 200 words of your site you would want them to see the relevant text as soon as possible. The table technique will "push" your left-side navigation (etcetera) bar down below your body content and "pull" up your body content so it will usually be within the first 200 words.
The site is frames-based -- What do I do?Search engines are having increased problems with trying to spider a "frames based" site. We believe there will be continued problems and we STRONGLY recommend a redesign to a non-frames environment. Note that the URL included in the frames pages are being indexed instead of your invoking frames page, so if you include content from another site in your frames page you are not getting ANY credit for that content... you are simply causing the other site to get spidered. It does not help you in the least in this case. If the content is from your own site and you need to use frames then there is a solution that will help by re-establishing the frames environment for the site. How to correct this problem.In the interim, adding this to the top of each page included in a frame will result in having that page detect that it is being loaded outside of the frame and it will re-establish the frame around this page:
Server Side Include TipsYour pages are .htm or .html and your server does not recognize your files when you're using SSI (server side includes). OR you are using SSI commands but you are afraid that the .shtml commands are harmful to your search engine optimization efforts. OR your site is entirely .html extensions, but you suddenly want to add SSI commands for tracking or promotional purposes. How to correct this problem.If you're on an Apache (linux) server, your ISP will have to edit the httpd.conf file to include your extensions. They usually know what to do, but the problem is that they will need to restart the apache server which will impact other sites.
If you're on a MS server (IIS) your ISP will have to possibly edit the registry to include your additional file extensions. IIS 6.0 will include a new node that is named Web Server Extensions. Your ISP should be familiar with how to do it. The same problem applies, your ISP has to stop IIS, make the change, and restart IIS. For additional information on MS servers please view the following links to the Microsoft Knowledge Base. IIS 6.0: Definition of Term Web Service Extensions
Oops, your IP is either dirty or virtualAbout 3% of all web sites "own" a private IP number, with the remainder being on virtual, or name-based, servers. Although only 3% are dedicated IP's, we have seen that in many instances well over 90% of the top-50 results in the search engines are sites having dedicated IP numbers. This was so strange that we have repeatedly validated these findings, and have found that switching a site from a virtual IP to a dedicated IP number alone has caused significant ranking increases. Of course, the web is so dynamic that this could be coincidence, but we do not think so. Likewise, we have found that there are "dirty" IP c-blocks, ranges of IP numbers that have been tarnished by spammers and left to be reassigned to unsuspecting sites. If your site is in the range of the spammers IP, then you are equally penalized. We have likewise found instances where simply moving a site has caused the ranking to improve. How to correct this problem.First, check your server for problems: This tool will perform a SOCKET TCP/IP Read for your site, a Request Read, and a Browser-type Get, comparing various header and source data to see if the site is the same and error free for each method.
The purpose of the Check Server Page is to check the configuration of your web server for errors that could create problems for search engines. This information is very helpful since search engines often reduce web site rankings due to web server errors they encounter. At the very least, even if you encounter a common error that will not cause you to be dropped from the index, a "cleaner" site is likely to rank above you in the search engine results if all else is equal. The following is a listing of the common codes produced by web servers:
Second, if there are ANY errors reported then correct them. The tool has a help file describing many remedies for each section of the report. For instance, error with robots.txt means that you do not have one or it is corrupted. Add one to the httpdocs directory (same as your home page) even if it is empty. Dirty IP list, contact your ISP and complain. Redirects, your server may be mis-configured... read the bottom of the report since the remedy is well defined right there.
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
![]() |
This site is owned and operated by Bruce Clay, Inc. |