|
|
|
The Unfair Advantage Book on Winning The Search Engine War
The examples we use in this book are real... they are not theory. Most, in fact, have been taken from our own experience with very successful web sites and web pages that we have developed for ourselves and our clients -- as well as other examples of actual circumstances that we have found on the world wide web. Several researchers and web designers within our company have spent thousands of hours accumulating this information over the past three years and update it the first of every month. It is written from a we point of view in an effort to reflect the collective effort that has gone into the original text as well as the continuous updates that keep this book on the cutting edge of today's competitive search engine positioning war. We mention this so that you do not underestimate the degree of time and ongoing research that continues in order to produce for you this valuable, effective, and up to date source of information. Please realize there is no such thing as a magic silver bullet that will vault you to the top of the search engines... if there was, everyone would know it, they would do it, and it would cease to work because of saturation. Fortunately for you, this is not the case. Instead, success on the Internet comes from doing many tiny little things exactly right... and the process is an ever changing science that this book and our Newsletter updates continuously reveal. If you wish to be successful on the Internet, expect to spend some time learning the secrets contained in this book. The rewards will be worth it... as you will become armed with an arsenal of tools that will leave your competition in the dust. Be sure to read the book completely before you start constructing or making changes to your web site and be aware that you may already know some of these techniques... especially in the beginning because we start with the most basic and then move on to the more complex techniques. If you already know the basics, your knowledge will give you an added advantage because you'll already have that experience to build on. This will enable you to comprehend the value of the more subtle, yet most valuable refinements contained in this book - as well as give you a foundation on which to build this additional knowledge and expertise. Rest assured, you will learn many tricks and techniques that you did not know... By keeping an open and intuitive mind you will be more likely to find those subtle changes that will make a HUGE difference in your web site's positioning on the Search Engines. On the other hand, if everything in this book is new to you... that's ok too. We are starting from the beginning with the basics. Assuming you are comfortable with your computer and have some idea of what the Internet is about... and aren't afraid to experiment with HTML documents (web pages) you will succeed... And... even if you are an Internet Beginner this stuff will make a tremendous degree of sense to you once you begin to familiarize yourself with the Internet. In fact, you'll even have a leg up on most of the so-called pros. This book, in actuality, is divided into three sections. Section One - Chapters One through Five - general section. In Chapters One through Five you will learn all of the known tricks and strategies that have been used with search engines over the past three years. This is important because all of them, in one form or another, still appear on pages listed with virtually every SE. It is also important because some of them can cause your pages to be penalized or even banned from certain SE's. Section Two - Chapter 6 - we get into the specifics of what is currently working on each of the 8 major search engines. Section Three - Chapters 7 and 8 - talks about the best search engine software tools as well as gives you tips on web design aspects such as frame style web pages, java script, and cgi generated pages -- as well as some parting information that we feel is important enough to include in this book.
Chapter One - Titles & Keywords
Although this is the most basic information.... It is impossible to exaggerate the importance of your web page's <TITLE> tag content in designing your web site. The first rule to remember is: Reason #1: Most SE's look first for keywords that are contained in the Title tags of your web page. Reason #2: Remember that you are also attempting to appeal to a person who is seeking your information, product or service in addition to attempting to appeal to the search engine (SE) indexes. When a search is requested through any search engine, the SE looks for the search word(s) contained within the <TITLE> tags first -- and usually gives preference to web pages that have that specific word(s) within the <TITLE></TITLE> tags. Therefore, it usually imperative that you insert the search words (AKA, keywords) that your potential customers are likely to use when looking for your service between the <TITLE> your title here </TITLE> tags in your HTML code. For example, if you own a Bed & Breakfast in Hanalei Bay, Hawaii - Island of Kauai that is called Kiluhana Inn do not use that as your title. If you do, your business will be handicapped in a search and buried by knowledgeable competition.
A better title would be:
Here's why: Reason #2. Some search engines and many directories give slight priority to alphabetically correct web pages. Although this is a secondary consideration, whenever possible, try to start your <TITLE> with a letter that starts early in the alphabet.... and of course whenever possible an A. To summarize: Make sure every word in your title is one that is likely to be used by a person when doing a keyword search for your business or service. AND... unless the name of your business is prominently recognized -- something like KODAK Film, it does NOT belong in the Title tags. Secret: On some engines, it may be worthwhile to use multiple <title>'s in your page to try to increase the relevancy. There are also many examples of people using very long titles to achieve the same thing. Here is the format for that trick, but be aware, you should only use this technique if you find it working for other pages that you are competing against for top positioning. <title>Bed & Breakfast Kauai - Hanalei Bay & Beach - Hawaii Bed & Breakfast Kauai - Hanalei Bay & Beach - Hawaii Bed & Breakfast Kauai - Hanalei Bay & Beach - Hawaii</title><title>Bed & Breakfast Kauai - Hanalei Bay & Beach - Hawaii Bed & Breakfast Kauai - Hanalei Bay & Beach - Hawaii Bed & Breakfast Kauai - Hanalei Bay & Beach - Hawaii</title><title>Bed & Breakfast Kauai - Hanalei Bay & Beach - Hawaii Bed & Breakfast Kauai - Hanalei Bay & Beach - Hawaii Bed & Breakfast Kauai - Hanalei Bay & Beach - Hawaii</title> If you use this, put everything all on ONE line if your text editor will allow it. TIP: Some HTML programs such as Microsoft's FrontPage will put your <title> tags AFTER your <meta> tags. You don't want this to happen because your results with the engines will be much better if your <title> tag appears directly after the <head> tag on your page. For example: <html><head><title>Put your title here</title>. FrontPage 97 and earlier versions will attempt to put the tags back in the old order the next time you edit the document, so you may want keep an eye on this. Frontpage 98, however has corrected this problem. Warning - Extremely long titles may cause some browsers to crash and will be difficult for your visitors to use in their bookmarks. Also, most of the engines will quit reading after a set amount of characters are reached. Next step: BEFORE you design your web page (site), make a list of every possible search word (keyword) and phrase (keyphrase) that your potential customers might use when looking for your information, product or service. Don't waste your time starting a web site until you do this. It is critical. Don't forget to include the common misspellings of your keywords and synonyms and pay particular attention to noun phrases. Next, use these search words and phrases to find your competition on the top eight search engines. Once you find who you will be competing with for the Top Ten positions in your keyword search, scour THEIR web pages for more search words that you may have overlooked ...by the way, these are called keywords and from here on, we will refer to them as such. Tip: If you want to be in the Top Ten of a search engine, find out who is on the Top Ten of each of these SE's and then build a better page. Later we will show you how to analyze your competition and then make your page go one better . Simply put, you must work these keywords into your body text according to the specifications of each search engine.
For example, if your service is a location sensitive offering, then be sure to mention the location in the text at every opportunity... For instance, if your motel is in the city of Port Angeles... a normal sentence might read: Here's an example of some keywords that we put into a client's page that vaulted him to the top of many Hawaii specific B&B searches:
Example A
Example B Secret #1 -- Some SE's will favorably place repeating keyword text and some SE's will ignore it or, worse, penalize you for it. In Example A we used a relatively normal statement type sentence and jammed it with keywords that a human SE quality control inspector would not be too likely to object to... (as in the case of YAHOO!) nor would a programmed SE computer ignore (as in the case of AltaVista) .... In Example B we simply repeated the main keyword phrases. Our research has shown that some SE's are programmed to ignore, or penalize, keyword text that is repeated within a tagged section on your page. (Notice that in this case, these keywords are enclosed in <H6></H6> tags). In Internet terms, this technique is called spamdexing or keyword stuffing . More about the ethics of this later... suffice it to say for now that there are acceptable and unacceptable ways to do this... and, remember that your business survival on the Internet may depend on you using this technique in an acceptable fashion with certain SE's. Secret #2 -- Here's a trick that you need to know about because, sooner or later, you will see it -- and unless you know about it you may become confused about what is going on. The trick is referred to as invisible text and is called such because the actual text is invisible to the visitor to your site... even though the SE sees it and indexes the invisible words.
A typical example of invisible text is often found at the top or bottom of a web page. The source code can look something like this...
A word about Keyword Placement Therefore, be sure to Headline your page with text that the search engines can recognize as relevant to what your are offering... AND do it early in your page before the images whenever possible. Remember, it is these keywords (search words) and key phrases through which people will find you on the Internet. We will refer to these <TITLE> & text keywords frequently throughout this book. However, for now, suffice it to say that you should apply the correct keyword density within the your web page as well as in your <TITLE> if you expect to be found in the top ten list of any given SE search. Since this is SO critical, be sure to spend the crucial time needed to select your keywords carefully. Pick words that you feel others would use if they were looking for your service or product. Get online and search (using your keywords) the SE's for the sites that come up in the Top Ten; ...and be sure to keep careful notes as to the results. Later we will show you how to use your competition's web pages as tools to build yourself a better page. Remember that when you submit your web page(s) to the SE's, you will need to put the best keywords first, the second best next and then down the line with the rest. Also remember, your title must be relevant to these keywords and your web site has to match both the content between your <TITLE> tags as well as the keywords that you're using in your text. Another concept for you to grasp is Keyword Density . This is the number of times your keyword(s) appear in relation to the other words on your web page. For instance, if your page only had one word of text, say... Chicago , the keyword density would be 100%. If on the other hand the only text on your page was Eat at Chicago's finest seafood restaurant Then the Keyword Density of Chicago would be 20% because each word on the page represents 1/5th of the entire text, or 20% (SE's ignore common words such as the, at, of, etc.) Theoretically, your page that said only Chicago verses the Eat at Chicago's... page would be given a higher rating due to a higher keyword density. Although you can see this is a simplistic example, you get the idea I'm sure. Here's the point. When designing your web site, be aware that keyword density can play a MAJOR factor in regards to how well (or bad) your page is represented by the SE's. It can be to your advantage to develop entry pages to your web site that are ethically and appropriately loaded with the correct density of carefully selected and intelligently assembled keywords... then use these entry pages as a portal to your other pages containing the rest of the information you want your site visitors to access. In Chapter 6 we specify to what degree each specific SE is weighting keyword density and the range of keyword density mix that is optimum for each SE that is factoring in this ingredient. In Chapter 7 we will point you toward the best software program we have found for calculating keyword density -- a must for any serious webmaster. Caution: Do not use stop words or dead weight words in Titles: a keyword or key phrase that has become so common on the Internet that search engines either ignore it or return hardly relevant results when they are used. Some examples include Homepage, Home Page on the WWW, web, webpage, and in some cases, even the word sex. For instance, if you are in the business of webpage development the TITLE -- Webpage Design and Development -- will more likely land your pages in a design or development search than in one for webpage services. Your efforts would be far better served if you find a niche group of businesses that you wish to market to and then design your web services page to land into their keyword search! ...because, simply put, it is almost pointless to attempt to score high in a stop word keyword category. Here are some obvious stop words: the, of, that, is, to, etc.. In addition to titles, you should also be aware that SE's tend to ignore these certain frequent and/or recurring words that commonly appear on web page text in order to save storage space or to speed up searches. If your business happens to be in a stop word category, we suggest you wrap your keywords in quotation marks and/or use capitalization or uppercase letters as such words are more readily findable through the SE's.
For the novice we must explain that Meta Tags are undisplayed text written into your HTML document intended to describe your page to the SE for the purpose of cataloging the content or description of your page. There is some debate as to whether or not they should be included in your HTML document since some SE's openly state that they ignore <META> tags. However, since several Search Engines claim to, and do , support them, our view is that they should be included in your HTML document... however you must be careful how you use them. The trick is to make them useful for the SE's that support them while avoid being penalized by the SE's that do not support them. For instance, AltaVista, and HotBot use <META> tags when they retrieve their search results. In some cases, the SE will use your <META> description as the summary for your web page. Regardless, we know of no SE that will penalize you for using <META> tags as long as you use them properly. However, be careful that you do not repeat your keywords too many times. In some cases, one to three times is the limit. (For better specifics, see Part Two, Chapter Six pertaining to each individual SE because there are exceptions). There is no clear standard to this rule, the best we can tell you is to study the individual SE recommendations that we outline in Part Two, Chapter Six
Here is an example of where your <META> tags should be placed in your HTML document and an example of how they might look: In this example you see the <META> tags we used for a client to describe his web site dealing with precisely what the contents suggest. In the <META> description we wrote as many keywords in Plain English sentences as possible and limited the content to just less than 250 characters (counting all spaces and periods). We did this because this description is used by some SE's as the summary for this web site. That means that people will see it... and use it to decide whether or not to click on this link. We limited it to less than 250 characters because SE's seldom, if ever, display more than 250 characters as a summary... and sometimes they display fewer characters.
The reason we offer the <META> tag description as a summary is some SE's will otherwise pull a random paragraph from your text as a summary if there is no <META> description. In some cases that means that the person who is searching could get a nonsense description of a web page... for instance, here is an actual summary example taken from Alta Vista from a page that did not use a <META> description... Now, we are sure that this company did not want click to go home as its summary but that is what they got because they had no <META> description. I'm sure you can see the point. Would you click on a URL with that description?
On the other hand, here is an example of a page using a <META> description that was pulled from Alta Vista using the exact same search In this case, Alta Vista used the first 150 characters of the <META> description and truncated the description... however, this is by far superior to having a summary selected from the page at random. For your information, here is the complete <META> description below. <META name="description" content="Sun Seeker Scuba Diving Charters! Welcome aboard for Scuba Diving or Whale Watching on the Sun Seeker Yacht in Hawaii on the Big Island. Come to Kona for overnight excursions and scuba diving with Paul Warren on his Dive Boat"> As you might, accurately, conclude... you should put the most important part of your description first because the SE may not use all of it, as was the case in the example above. On the other hand, the <META> keywords are seen only by the SE's. Getting back to our previous example above... In this case we were careful to use each word no more than the currently acceptable number of times and we alternated between capitalized and non-capitalized words in order to hedge against going unnoticed if the SE happens to show a preference based upon how the person enters the keyword into the search engine. In addition, the meta keyword tag area is a good place to put misspellings of your keywords that are not found in the body of the page. Contrary to what some people think, simply installing <META> tags is not usually enough to insure top positioning for your web page(s). However, it is useful when the SE uses it as a summary for your page. Using a <META> description gives you some degree of control over how the search engine presents your link to the viewing public. Remember that <META> tags and the keywords in them are used like mini indexes of your web site. Plan to put effort into finding and using words and phrases for your <META> tags that are simple. Be sure to use words and combinations of words that the AVERAGE person would use. Warning: Do not try to fool the search engines with words & phrases that are unrelated to the content of your web site with <META> tags. It is not only unethical, it can work against you because some search engines may refuse to even list your site if you use unrelated keywords or repeat the same keywords within the meta tags too many times. You must be aware that some companies have sued because they found their trademarked words hidden within other sites <META> keyword tags or by using other means to deceive competitor's customers. At this time the lawsuits are not settled, but many people believe this is highly unethical, and at some point in the future perhaps even illegal. Also, keep in mind that too many times is set by them, not this book. In other words, do your homework... check on your competition and see what the successful page designers are doing -- then follow their examples so that you do not get squeezed if (when) the SE's change the rules. Later, we will give you specific <META> tag guidelines for the Top Eight SE's. For now, you have simple, yet precise, info on how to use them effectively for the SE's that support <META> tags -- while reducing the chance of being penalized by the SE's that do not support their use. Keep in mind that the rules can change. Therefore, it will be in your best interest to constantly monitor the top ten pages of several keyword specific searches in the various SE's to see if the pages that are coming to the top are using <META> tags. There may come a time when word frequency and repeating keywords more than once could be penalized. Watch for it! ...and if you would like a second opinion as to whether or not your <META> Tag keywords and descriptions are properly written, here is a free utility that will check them for you... go to: http://www.northernwebs.com/set/setsimjr.html and enter your URL's before you register your web pages with the SE's
Chapter Three -- Tricks & Illusions In this Chapter, we will present what we call the Tricks & Illusions of web page design -- techniques that are being used -- or have been used in the past -- to improve positioning on the SE's. Before we continue, allow me to make something perfectly clear. We may not agree with many of them of these so-called tricks. However, if the keyword category that your company is competing in is very competitive, you may need to use some of these techniques if they are working and being used by your competition. Therefore, we feel it is only fair to alert you to every known trick that has ever been used to improve positioning on the search engines. Frankly, in today's search engine climate, many of these techniques can hurt you more than help you... it depends entirely upon which SE you are submitting your web pages to. At the very least, you must know what has been used, and what is being used, on the Internet to improve positioning on the SE's -- and unless someone points these techniques out to you, you will find yourself wondering how certain pages have climbed into the top ten lists... and in other cases, you may wonder why your page was penalized. Later on, in Chapter 6, we explain on a case by case basis when to, and when not to, use the various techniques that we outline in this section. This is a VERY important section, so read carefully. Enough said... here we go behind the doors of Internet Web Page Design... Trick #1 - Keyword spamming and Keyword stuffing a.k.a. spamdexing You probably have already seen this, or at least figured it out, but this technique consists of repeating keyword(s) over and over in text -- usually at the top of the page and/or at the bottom of the page in very small letters, i.e. <font size=1> or headline <H6>. In addition, spamdexing can also be found in some <META> tags and even <TITLES> -- in order to find it however, you will need to check the Source code. In Netscape you can do this by clicking View then Document Source. In Microsoft Explorer you can access the Source code by clicking View and then Source. By the way, these are the only two web browsers that you should be using. You should definitely use the latest versions or else you will be missing what the web really looks like -- and you will not be able to determine what most Internet users are seeing when they view your web pages.
An example of spamdexing could look like this: In the past, some people have gone overboard with it and included totally irrelevant keywords in an effort to appear in every search regardless of the topic. This is stupid! ...and it will no longer work. In fact, chances are that using irrelevant words will sooner or later get your domain banned from most search engines. In addition, repeating keywords over and over on most search engines will cause your pages to be penalized or rejected by the search engines. A more acceptable way to stuff keywords today involves the technique of working them into regular sentences as much as possible up to the point that your keywords and phrases blend to create the correct keyword density mix. Note: On some search engines you may find pages that are ranked high and DO include repeating keywords. As is the case with Alta Vista, these are older pages -- maybe even two years old. The pages have not yet been removed by the search engine but if they were submitted today they would likely be scored low, or rejected altogether or they may get your site banned. Be careful not to emulate very old pages. When given a choice, select the most recently indexed pages to emulate when analyzing your competition for the correct winning formulas. Note: On some pages, if you check carefully, you will see literally hundreds of keywords & phrases printed in visible or even invisible text across the bottom of the web page -- This is called Tail Tagging. It is another way of Keyword stuffing (spamdexing). By doing so, you increase the keyword density of your web page. This is sometimes effective but generally not at this current time. Again, it depends on the search engine. Trick #2 - Invisible or Semi-visible text We have already touched upon this one, however, we will now expand on our explanation and show you how this is sometimes done effectively. If your goal is to add keywords, phrases, or sentences that you want the SE's to see yet be invisible to the actual page viewer -- invisible or semi-visible text will accomplish this for you. Simply put, by setting the paragraph font <font color=#??????> the same or almost the same color, as the background color <BGCOLOR= #??????>, the text will blend into the background and appear invisible. For instance if you want white, replace the #?????? with #FFFFFF . If you want yellow, use #FFFF66, and so on. To get semi-visible text, you can offset one of your colors a little bit making it invisible to the viewer without letting on to the SE that the text is not visible. For example, set the white background color to <BGCOLOR="#FFFFFF"> then, to make your text slightly off white, use <font color="#FEFEFE" >. For some page designs, you may want to use <background="image.gif"> instead. This trick will work there too. Just keep in mind that if the background image is too big in file size, your text may be visible while the image loads. The two techniques can be combined to prevent this, just remember to keep the value in the <BGCOLOR> tag slightly different than the one in the <font color> tags. Warning: You must be CAREFUL using hidden text some engines, for example, may automatically reject any pages that it finds having the same background color as the font color . Google (and in some cases AltaVista) will also reject pages and possibly ban sites that have ANY hidden text IF one of their human Spam Task Force members view them manually regardless of how close the background color is! For a complete color chart check the help contents of your HTML editing program or go to: http://tanega.com/java/color3.html and there you will find a very handy and easy to use HTML color chart complete with HEX codes. If you want to see if one of your competitors is using this technique, hold down the left mouse button and drag it along the top or bottom of the web page. If invisible text is present, dragging the mouse across it with the left button held down will render it visible. Another way to make sure you have viewed everything on the page is to click your browser's edit/select all function and all of the text will be selected.
A pointer page is a page that uses keyword density or other techniques to score high in a given search and has one or more Links that all point to your main page. Here is an example... In this page there are only four viewable words... they are; Sailing Kauai, Sail Hawaii and this page links to the client's main page -- giving each word a density rating of 25%, which normally is extremely high... in fact, the word sail actually has a density rating of 50% and you will also notice that the <TITLE> matches the text exactly, thereby increasing all the more that this page will score high in a search for sail hawaii or sail kauai. The idea behind a pointer page is to optimize different techniques for a specific search engine or a specific keyword that you are working on. You will likely need to make multiple pointer pages to optimize your results. For example, http://www.mysite.com/page1.html, http://www.mysite.com/page2.html and so on. Please refer to Chapter 6 for details on each specific engine. NOTE: if you put images into a pointer page, you must make them tiny (file byte size wise, that is). In this case the images that we included in this page totaled less than 6K. If you put images that are any larger, the page will take too long to load and you will likely alienate your customer (by making them wait) before they even see your main page. Here's a trick that most search engines hate and we do not like to use, however, it is has been successfully used in the past and you might encounter it. We will therefore explain what it is. If you ever click on a link and notice a page loading that automatically then loads another page (without any action from you) you have encountered a redirect page. The redirect page may score high on a search because, again, it takes advantage of the correct mixture of keyword density.
In any case, the HTML code contains the line... Since this page has only one viewable text word Hawaii, the keyword density is 100% Hawaii. Hopefully you will not need to resort to tactics such as this. However, if any particular SE allows it -- and your competitor uses it -- you may need to design a redirect page out of self defense. You can do so by copying the HTML code above and simply replace the info on the page with your own info -- then register your redirect page(s) with the appropriate Search Engine that is allowing it. Most search engines will not allow you to submit a page with a meta refresh redirect, so make sure and read the details in Chapter 6 on each engine. Trick #5 - Make your keywords work double duty Here is a standard technique that you can use to increase the effectiveness of some of your keywords. Whenever appropriate, add an s to the end of the word. For example, if you are in the book business, and one of your key phrases is book sales, then add an s to the end of book. By doing so, your page will be found for both book and books - since, obviously, some people will use the plural when searching while others will use the singular. In any case, your page will qualify to be found by the SE. The same goes for words such as beach / beaches etc. Trick #6 - Program your site to be found even when people are looking for something else OK, here's an area where you need to use some good judgment.
Let's suppose that you own a Motel in Port Angeles, Wash... Nearby is the Olympic National Park, the Sol Duc Hot Springs, the ferryboat to Victoria, hiking, fishing, etc. You should program your page to come up high on a search for the following keyword searches. Here's why. People who are looking for information on these activities are likely to need lodging close by. If your Motel happens to be the gateway to other services, then most people will not object if they find you while looking for info on the other guys. Again, this is a real life example -- this is precisely what we did for a client recently and guess what. The Hill Haus Motel comes up in the top 10 on many of the above searches on many engines. Now that you know how to do this... Please, please, please, do not attempt to put unrelated, irrelevant keywords into your pages in an attempt to better your position and be careful about using other companies' trademarks. Here's a STORY as to why -- Not too long ago, a very aggressive company in Florida sold snow crabs. They quickly became the Kings of spamdexing by repeating every city, state, county, national park & monument, football teams and colleges -- as well as many commonly used words like, sex, nude, pictures, adult, women, software, erotica, gay, naked, etc. -- into their pages and repeated each word literally hundreds of times. The result -- they showed up on almost every search on several SE's. To make matters worse, they had at least 50 different URL's... so all of their pages monopolized the first 30 or so positions on any given search. Did it work? Sort of -- it got them to the top of the search engines but since people were not looking for that kind of info, it created a backlash toward the company. Eventually, they disappeared from the SE's... they were either kicked out or their technique did not produce significant sales. The point is... be appropriate when you program your HTML documents (web pages). After all, we are giving you The Keys to the Vault and you will find it unnecessary to flagrantly cheat in order to get noticed when you appropriately apply the information you are learning here. OK? Trick #7 - Using the <!...comment line> You may or may not know this, but anything that you put into a tag that starts with an " ! " (exclamation point) is invisible to the viewer. This is called a comment tag and it looks like this <!-- your comment here -->
This is an ideal place to put additional hidden keywords into your document to increase the page's keyword density. Here's an example: This trick has worked in the past, however it does not currently work on Excite, AltaVista, HotBot, WebCrawler and Lycos at this time -- but some of the other, less popular, engines may make use of it and we mention it so that you will recognize the technique should you happen across it. It is possible that you may find a page description on an Excite search that appears to show it has indexed either the comments tag information or the JavaScript code imbedded in the web page. At one time, Excite had a bug that sometimes allowed pages to score highly with this technique. Supposedly, that bug is now fixed and any such pages that remain are likely left over from a time gone past when this technique was working. Trick #8 - ASCII, Numerical, and Alphabetical text order The top Search Engines no longer consider ASCII, numerical and alphabetical listings with the same importance as in the past -- especially since Yahoo quit listing alphabetically as their default method of choosing which pages received top billing -- however some of the minor SE's may still give preference to this type of order. If ever you should find this to be the case, choose a web page <TITLE> that starts with an A or better. Better than an A you say? Yes... as a matter of fact there are many ASCII characters and numbers that are much better than an A. When alphanumerical order is used, ASCII characters and numbers appear before alphabetical letters.
Here's the normally accepted order... Therefore something like...... !!#1-A -- should be listed before anything that starts with an A . However, remember that you will want a person to click on your link... therefore you will have to decide what works best from a computer vs. person compromise perspective. Something like...... "A-1" Hot Tubs could be very effective, considering that " (quotation mark) is second only to ! (exclamation point) on the ASCII chart. Keep in mind that search engines are getting smarter and site reviewers are aware of the alphabetical tricks. If you use a <TITLE> that is alphabetically privileged, then you had better make your web page reflect claim to such a <TITLE>. Trick #9 - Turn Images from liabilities to ASSETS with the SE's The problem with images in your web page is that the SE's do not index them at all. Therefore, your Logo may say what you are, who you are, and even state a benefit but the SE's don't show that. In fact, if your image loads higher on your page than your text... your page is automatically handicapped in a BIG way!!!!! This is a mistake. Do not load images higher on your page than keyword text for reasons that we have stated previously in this book.
However, when you do put images in your page, make them work for you, not against you. Here's how. Always include the <ALT="here is a list of my keywords"> in your <IMG SRC="image"> tags. Here's an example of one that we have used. The reason for doing this is because some SE's (at this time only Lycos, AltaVista and Google) index the <ALT="keywords"> -- by using the <IMG ALT> tag to contain your keywords, you will give your web page additional keyword / phrase help. In many cases it will improve the position of your web page(s) on the SE's -- and it is a simple thing to do. Trick #10 - The secret of the Phantom Pixel Now that you know how to use the <IMG ALT=> to your advantage, We'll tell you about one of the sneakiest tricks we have found on a web page.... the phantom pixel. Suppose you had an image that was so small it could not be seen. Imagine an image that is only one pixel square... tiny -- and that single pixel is WHITE against a WHITE background.
What's the point you ask?
Wrong. Now, let me ask you another question. What if you loaded the <IMG ALT=> with TONS & tons of keywords and phrases? Trick #11 - The ole Bait & Switch Technique This technique has been very popular. If you have ever seen a page that is in the Top 10 but you can find no reason whatsoever in the source code* for the page to have earned a Top 10 position, then you may have found a page that is using a bait & switch approach to scoring high in that particular search engine. Here is how it is done. A page is submitted to the search engine that is designed specifically to appeal to that particular search engine. In some cases, it is even a nonsense page that is laden with all of the components that a search engine robot looks for in a specific keyword/keyphrase category. Then, once the page is indexed and linked by the search engine, the page is switched to the REAL page that the company wants you to see. We have heard of instances where competitors have helped out their competition by resubmitting the competition's switched page -- and in such cases the page will fall like a stone in the ratings. Remember, we are not advocating such tactics... we are only reporting what is taking place in the competition for SE positioning. Trick #12 - The Food Technique Here is what we feel is one of the sneakiest and most complicated trick of all. Basically, the Food Technique is a sophisticated variation of the bait & switch... with one very important difference. Here is how this works. When a SE robot comes visiting to index a website they first identify themselves with a calling card and an IP number. Some companies have gone to the trouble of identifying each of the IP numbers of certain SE robots and when a certain robot, like AltaVista, comes calling they feed the robot the page that they want that robot to see (by using a server side CGI script). However, when a normal visitor clicks on a link to visit the page, the CGI script dishes up the consumer version of the page... the real page. When you encounter this technique you will see nothing in the source code that indicates why the page is ranked high on the search. In addition, if you were to help the company by resubmitting their page it would be unlikely that your efforts would have any significant effect on the page's positioning... in other words, the page would most likely remain somewhere high in the rating. The bad news is that unless you have the server side resources and available expertise to write such a CGI script in order to accomplish this magic you are unlikely to be able to match this particular slight of hand. The good news is that this technique is mostly being used in only the extremely competitive categories. In addition, you can still be successful provided that you apply the successful formulas in this book to your real pages up front. Also, please keep in mind that one reason we explain this technique is so that you will not think yourself crazy in the event that you encounter a website that is highly rated with no visible means of having earned such a rating in the viewable source code. Trick #13 - The JavaScript Redirect Technique
This trick is closely related to Trick #12, but much easier to identify. It is used primarily on AltaVista because they no longer will index pages that have the <meta> refresh tag in them. What this script does is redirect the browser to a different page once the script on the page loads. The bad part about this script is it won't rank as good as a page that utilizes Trick #12 because of the additional JavaScript code that is enclosed in the page, forcing the text to be indexed lower in the source code. But if you must have a replacement for the <meta> refresh tag, this will work. Note: Search engines really do not like this technique because of abuse in the past and many will view it as spamming. We do not know of any of them that automatically check for it, but because of the location.replace text inside, they could easily ignore pages that have it. If a human editor sees this technique, they will likely remove the page from the index. Trick #14 - The Hidden Form Tag The <input> Hidden type tag is part of a form that is generally used for pages that use Forms to collect information. On occasion you will see some pages using this tag to try and increase the number of hidden keywords in a page. This doesn't work with all SE's. If you want to try this one, attempt to format your keywords in an English sentence to avoid any penalties for spamdexing.
The HTML code is: Note : Make sure you place it outside a real form or it may interfere with your real hidden tags.
There is one form tag that will get indexed that we have been testing with. It isn't actually hidden text, but it does get indexed at on some engines like Excite and Google. The part of the form tag that will get indexed is the text following the form <option> tag. This is the text that you see inside of some of those drop-down forms and menus. Here is the html code for an example: Note: This works for a Perl redirect script, not a JavaScript drop down form. This trick closely resembles Trick #10, the Phantom Pixel but the sole purpose is to get more of your pages indexed into some of the difficult search engines, such as Excite and AltaVista. Several of the deep search engines have begun limiting how many pages you can directly submit to their add url feature or they have limited how many pages you can submit per day. To help explain how this will work to your advantage, first we must discuss how a search engine spider crawls or travels through the web. When a search engine is out crawling the web, it will often start with pages within its own index and check to see if there have been any updates made to those pages. When the engine's spider visits a page and it discovers new links, the spider will attempt to index those new link's urls too, and add them to the engine. To make this work to your advantage, it is a good idea to put links to ALL of the pages you want indexed on EVERY page of your site. That way, no matter how the spider finds your site, it is only one link away from any page. The first thing you might say if you have a large site is I don't want to fill all my pages with links!, that's where the hidden pixel trick #10 comes into play. By using the small hidden pixels on your pages, you can add many links without affecting the look of your web pages, and still reap the benefits of getting more pages indexed.
Here is an example on how to put in several hidden links: The above example would only take up 4 pixels of viewable space on your web page. Don't forget, you want to use an image that is 1 pixel wide by 1 pixel tall, and it either needs to be the same color as the background, or transparent and make sure and set the border to 0 so that the blue outline doesn't show up. It's also important to not have any spaces between the >< parts of these links, they could show up on the page as a blue _. We recommend NOT using the height and width attributes for the image tag (width=1) because that could be used by a search engine to ignore links like this. If you do wish to use the height and width attributes for the img tag, make them something other than 1. An added benefit is some engines that measure link popularity will score your pages higher! Link popularity is a term search engines use to measure how popular a web site is based on how many other web pages or sites have links to them. In case your site has a large number of pages, then the above method may not be totally practical. In that case, build a single table of contents or site map file that has links to all of the pages you want indexed on your web site. Then on each of your web pages, put a hidden link, like the example above, to that table of content file. This helps tremendously in getting more pages into engines like Excite and AltaVista!
Chapter Four - How to analyze your competition's pages before you design your own Earlier we mentioned that you must use either the latest versions of Netscape or Microsoft Explorer to browse the World Wide Web -- Forget the reasons, just believe us. If you are serious about making money on the Internet, then you must use professional tools -- like it or not, everything else in the realm of Web Browsers is second rate. That said, we will tell you how to analyze your competition so that you can build yourself a web page (and web site) that goes one better and will significantly improve your chances of making the Top Ten Hit List in your selected search fields.
First, -- be systematic -- start with a single search engine... Then enter, one at a time, each of your keywords and keyword phrases into that SE's search function in order to see what is currently the Top Ten hit list in each particular category. Next... click on the first URL at the top of the list... go to that website. Once their page has loaded try to determine what caused it to land on top. Look for keywords, invisible text, etc... Use the information that you have learned so far to analyze their page. Viewing the Source Code Next... you need to look at the code behind their page. Click View, then click Source. Here you will see if the page contains Meta descriptions, Meta keywords, <IMG ALT=> tags, spamdexing, and so on. If the page is particularly well done (format wise), save the source code to a file. You may want to use it later for a template to build your own page(s). In fact, every time you find a page that works well with a Search Engine, save it in a separate file Be sure to keep track of your research as you go. Perhaps the best way to do this is to open the source code file that you save and write your notes at the top of the HTML document. Keep track of which SE it appeared on top with and what you think caused it to make the top of the list. After you have finished the top ten in one keyword search, go to another keyword search with the same SE and repeat the process. After a while you will notice that some pages keep popping up in a variety of searches. Those are the ones that are well done. They are the ones you should use as a template. You've found the pages to beat and you've captured the code for the ones that are working! ...and you even know why they are working..... Congratulations!
In highly competitive areas, what you see isn't always what the search engine indexed. Some businesses are using the Bait and Switch Trick #11 or the Food Technique Trick#12 and you won't be looking at the correct source code. In those cases, you might find a page that doesn't seem to be using the same number of keywords or format that the other pages seem to have. In that case disregard that page and analyze the pages that appear to be similar to one another and score closely. Here are some clues that often are associated with pages of this type:
If one of these four clues applies to the page you're studying, it's best that you skip that one and analyze a different page. It's important to note that these indications don't mean that those sites are using special tricks, it just means they MIGHT be. Now remember, we are being systematic. Completely analyze one SE before you move on to the other... and then, repeat the process until you have finished all of them. Very soon you will become your own expert on what works and what does not work... because you now know what you are looking at -- as well as what you are looking for!!! NOTE: here's a good tip. While you are doing your research, browse the web with your images turned off. That way the pages will load much faster! ...and if at any time you want to see the images on the page you simply click the images button. With Netscape, you can turn the images off by clicking OPTIONS then make sure the Auto Load Images is un checked (click it to check/uncheck it). Doing this will considerably speed up doing your research. When you want to see the images, simply click the Images button on the top bar of the main screen. This is how we frequently view the Internet and we recommend it. With Microsoft Explorer, click VIEW then OPTIONS then un check show pictures. If you decide that you want to see the pictures, follow the same path and re check show pictures then go back to the main screen and click refresh ....and, yes, Netscape does work easier.
Chapter Five - How to set up several entrances to your Internet Storefront Once you start analyzing your competition, you will find that the true Internet professionals have several ways for you to find them. In other words, they have many web addresses for the same page or similar pages... In essence, they have multiple side doors into their main site.
This is easier to do than you might think. Even if you only have one actual page, you can create the illusion that you have several and get them all listed separately. Here's how. Ok, so now you have four addresses that all go to the same exact page. This will work with some search engines. However, be forewarned that this technique is not something that the searchengines like. It does happen quite often by accident when a search engine spider finds a link and follows it similar to the above example. By the way, this will not work with YAHOO!, so don't even try it. OK, here's a way to get as many URL listings as you can handle. By having your own domain, you can have sub-pages. Design every sub-page as a potential entry page with links to all of your other pages. Just make sure you give each page a different unique <TITLE> AND you need to vary the content. Then register all of the sub-pages along with your main page.
Example:
Note: Avoid naming your pages in what appears to be a consecutive order (like the ones above with the 2,3,4,5), this makes it appear obvious that the pages are copies. Instead name your pointer or entry pages with unique files names like products.html, sample-products.html, items.html etc. OR you can have an UPPER level domain such as: Here the sky is the limit (well, almost). You can have as many listings as you have URL's. It could literally be hundreds if you want and... this way you can design unique pages to fit each individual SE criteria. This can make you extremely successful on the Internet. This technique worked wonders in the past, but now in 2000 the engines are starting to watch for duplicate pages. It’s important that you don’t use exact duplicates of your pages. The titles, meta descriptions and the page content needs to be a little bit different for each one. ( Note: AltaVista does seem to be limiting the number of pages you can have listed to around 400 or so per directory, with total site page numbers somewhere around 5000 (subject to pending changes). Yes, it is a fair amount of work but if you do only a dozen pages you are likely to bury your competition in most cases. Note: The trend we have been seeing over the last couple months is that the search engines are giving higher scores for the main home page (http://www.domain.com). As of April 1st 2000, AltaVista, HotBot, and Excite appear to be doing this. For these search engines you may need to get numerous domain names. Preferably with your keywords in the domain name (www.keyword-keyword.com) and if possible each should have unique IP addresses to score highly in the more competitive categories. Again, when you research your competition, you will find that some of the professionals are already doing this... and your ability to compete on the Internet will depend on your willingness to level the playing field and go one better than your competition. Remember, each page must have its own name, title, URL address and keywords to work... and, believe me, it will work like MAGIC!!!!!
Chapter Six - The In's & Out's of the TOP EIGHT Major Search Engines Note: This section uses many examples of pages that are currently listed at the top of the various Search Engines. Please note that these examples may change from week to week even when the techniques that are being used remain the same. This is due to the fact that the SE's are continually updating their indexes and because new pages will frequently outdo the previous position holders -- and, to a lesser degree, because the Search Engines can change their relevancy sorting without notice. Ok... now that you know the secrets of the very top professionals in this business, let's discuss which search engines, what they are, how they work, and the best way to effectively register your web page(s) with them. First of all, you need to know that over 95% of all people on the World Wide Web (WWW) use only eight search engines. These are the only SE's that you should focus on. However, if you decide you want to register with others, you may... but, we are telling you that your results will probably not be worth the effort it takes to analyze these engines ...because hardly anybody uses them. However, if you decide that we are wrong -- you now have the info you'll need to analyze them and make intelligent choices regarding how to design your page(s) to fit with just about any search index or catalog out there. Suffice it to say, if you do a good job with just these eight , your site will be successful... provided that people are interested in the topic, product, or service that you are offering. For example, our company's very first web site still draws over 5000 visitors per day and we are only doing SOME of the things right -- and we have only registered with these eight SE's. Tip : As you begin the registration process of your web pages / site(s), it will be helpful if you open a text editor and list your URL's. This way you can accurately copy & paste these URL's into the registration forms -- decreasing the chance of typos and mistakes. In addition, as you develop descriptions (some catalogs require them), write them in your text editor first and when they are finished you can copy & paste them into the registration forms. Likewise, as you register your sites... keep a record of dates with each SE, as well as vital info in your text editor. That way you will be able to recall what you registered, with whom, and when. This log will become valuable later when you draw conclusions about how long it took to be indexed by each SE as well as what worked and what did not work with each web page on any particular SE.
As you may have gathered already... here is what we consider to be the TOP EIGHT search engines and Internet catalogs: Now, we will talk about the specifics of each one, where to find them and how to register with them... as well as many shortcuts and tips.
Directory Size: 1,120,808 + sites (source- Open Directory http://www.dmoz.org) To register with the Lycos Open Source Directory, go to the directory you would like to be found in and click on the Add Web Site link at the bottom of the page, or submit at directly at the Open Source Directory -- http://www.dmoz.org. To register with the OLD Lycos Search Engine (T-Rex) which is used for most overseas versions of Lycos, go to -- http://www.lycos.com/addasite.html. To register with the current search engine at Lycos, submit to alltheweb.com at http://www.ussc.alltheweb.com/add_url.php3 The old Lycos (T-Rex) has one of the most friendly submit pages on the Internet. Be sure to read the information on this page. You will also find that Lycos has a very user friendly way to find out if your URL is in their catalog... it is on the same page as the submit page and easy to use. New Subscribers: Play close attention to which search engine your optimizing your page for at Lycos. Most of the information below is specific to the Lycos T-Rex Search engine which at this time (5/31/00) only powers the overseas versions of Lycos. They are now using Fast (alltheweb.com) to power the search results for the US version of Lycos.
What's New
Lycos joins Goto affiliate program The Lycos search results page is now so busy that a search newbie would have a very hard time figuring out what is going on. The mixture of search results from so many different sources gives Lycos the appearance of a confused metacrawler. The only non-paid search results, in most cases, now appear below the fold (i.e., you'll have to scroll down to see them with most screen settings). Lycos deserves some credit however -- at least they give the Goto search results a slightly different appearance than the non-paid search results.
Fast claims worlds largest index Google, however, could likely dispute their claim since they're very close, and possibly larger, than Fast. And since Google updates more often than Fast they may also have the freshest index. http://www.fast.no/
Fast - more pages give better results In the case of the example above, besides getting the #1 listing, we also got #3, #4, #7 and #30 (it was an honest accident). In our analysis we found that the most significant factor was that each of these pages happened to be heavily crosslinked. This indicates to us that the number of links to your main pages, even from your own site, do in fact make quite a substantial difference. By the way, we noticed a similar effect at AltaVista this month as well. The more pages you have indexed, the better the sites appears to rank.
Fast powers Lycos results
The more generic a keyword that you're working with, the more important it is to be listed within the Direct Hit Popular Results or within the ODP Directory. If your keywords or keyphrases are more unique, then it's also important to be listed in the Fast engine.
What does Fast index?
Fast doesn't index - Fast doesn't support the meta description tag, instead it will display approximately the first 255 characters that it finds at the top of your document in the body text. Fast also appears to be using link popularity as a portion of it's algorithm. We'll be conducting more tests at Fast during June to learn more of the exact details on this engine. To submit to the Fast search engine, go to http://www.ussc.alltheweb.com/add_url.php3 One method of building a page for T-Rex engine powering overseas versions of Lycos. Lycos completely surprised us on the 18th of March. We had changed a few of our doorway pages on several sites in February while doing some maintenance work. To our pleasant surprise that let the Lycos spider to discover some of our VERY old pages that we built back in July of 1998 and, guess what -- they indexed them! Taken alone, that doesn't sound like much of a big deal. However, what was interesting was that those pages were designed with a very high keyword density which worked pretty good back in 1998. Guess what?... the old pages are back in the money again. In fact we are scoring number 1 through 6 for some of our competitive single keywords. Even though this page worked great, your best chance for getting into the Lycos engine is through submitting to http://www.dmoz.org (Open Directory Project). Be sure to read the submission information found on the page at http://www.dmoz.org/add.html.
Keywords in URL'S
The 1% difference If you go to Lycos UK -- http://www.lycos.co.uk -- you can see the scoring percentages. We did some testing with a single unique keyword only appearing once on the page in the title. The page that had the unique keyword at the first of the title consistently scored 1% lower than the one that had it as the third word. Assuming all other factors to be equal, if the keyword is not in the first position your page will likely score 1% higher. Note: This only works on the Web Sites part of Lycos search. The directory and popular result listings don't appear to favor this strategy. Note: New readers - The Lycos search engine that we are talking about is their original search engine called T-Rex. Lycos currently uses the ODP directory database and also has listings from Direct Hit in it’s search results. Search results from T-rex show up after the ODP and Direct Hit listings in the overseas versions of Lycos and are not as important unless your working on a unique keyphrase. This means if you want to get into Lycos with your listings, right now the best way is to submit to ODP at http://www.dmoz.org
Pictures Added to Search Results If you're interested in promoting your site on Lycos, you had better start by submitting to the appropriate category at the Open Source Directory ( http://www.dmoz.org ) and only worry about working on search terms for the Lycos engine if they are not found in the ODP Categories.
Lycos' Top 10 Web Page Search is powered by Direct Hit After three months of testing we have learned that it is indeed beneficial to submit your site to directly to Direct Hit by using the add URL page at http://www.directhit.com/util/addurl.html. This may actually be a faster way of getting your site listed at Lycos than submitting to the Lycos search engine because Direct Hit updates much faster. The pages we've submitted have only gained position in the top 10 while none of them have slipped lower. Submitting directly to Direct Hit may become even more important as they launch their own private label search engine service similar to Inktomi's. At present, Direct Hit is staffing up in what appears to be preparation for an IPO in the not to distant future. Look for them to become an increasingly important player in the search engine game.
Overview Since the search engine results only show up after all of the Open Source Directory links, we don’t recommend spending much time optimizing your pages for the Lycos Search Engine. That is…unless the keyphrase or keyword that you are working on does not show up in the Open Source Directory - only then should you spend the time on optimizing for the search engine itself. This is a similar strategy that we recommend for Yahoo - which is the model that Lycos has based their last change. To identify whether you’re looking at the Directory or the Search Engine results, they are separately identified using the Yahoo terminology. Directory results are labeled in the top of page headings as Web Sites and search engine results are labeled as Web Pages. Note that at this point in time (3/30/00), the overseas versions of Lycos, like Lycos UK are still using the Lycos search engine without the Open Source Directory results. If you’re working in one of these markets, then by all means work on optimizing your pages for the search engine.
ODP listings at Lycos Pages from the Lycos search engine are now listed below the directory listings, or on subsequent pages under the heading of Web Pages. The actual search engine listings still appear to be old, even though Lycos was spidering heavily during the month of April. Wired news Article on the change - http://www.wired.com/news/news/business/story/19164.html So far, this change to the Open Source Directory has not been implemented on the other versions of Lycos - such as http://www.lycos.co.uk If the search phrase matches one of the top level categories in the Open Source Directory, Lycos will display first the top level categories, then the Web Site listings, and at the very last the search engine listings. Our recommendations are to concentrate on the Open Source Directory for your primary keywords and keyphrases. Unless the search term is unique and not found in the Open Source Directory, you'll not likely get much traffic from even a top 10 listings in the search engine itself. To add your site to the Open Source Directory, do a search for your best keywords. Lycos has a link called Add WebSite at the bottom of the directory results. This is a worthwhile effort as the Open Directory is now used by Netscape Netcenter, HotBot and the Dogpile Meta Search Engine. Keep in mind that the Open Source Directory Editors are volunteers, and will vary in their opinions on what is to be added to their categories. In some cases they may even be your competitors! But, that's ok -- You can also be an editor, signup at http://www.dmoz.org/about.html Note: Do not submit your pages to more than one site that uses ODP, we suggest submitting directly to ODP in the appropriate category.
Lycos T-Rex (UK) Search Engine Only Recomendations
Also you may wish to experiment with different machine names than www for optimum ranking. In case you are wondering, here is an example of a machine name Many top10 pages on Lycos are using the following techniques. They utilize very simple leader pages as gateways to the site's main page. They have the keyword in the <title>, once in the <h1> tag at the very top of the page, and very little other text on the page, with perhaps one more occurrence of the keyword within the body text. These pages are only about 1-2k in size. Keeping the html file size small with this method is important. After we believe that it became more important to have one of the keywords or keyphrases towards the bottom of the page within a hyperlink .
Recommendations Here's another technique that is being utilized successfully. Some pages are listed TWICE under slightly different URL's. Lycos accepts http://www.yourserver.com and http://yourserver.com as TWO different sites. Please understand, these are not copies of the same page... they are exactly the same page; just different versions of the URL address. This is proof that submitting multiple URL address versions of a Lycos friendly web page can earn you multiple positions with virtually no extra effort. The specific keyword used in a search did not necessarily need to be in the Title tag. Lycos is obviously recognizing synonyms. In other words, the word hawaii might pull up something with kauai in the title and the word accommodations would pull pages with condo in the title.
Technique Summary for Lycos T-Rex And, as with most of the other SE's, one of the best pieces of advise that we can offer is to carefully analyze the search topic [keyword(s) / phrase(s)] that you want to appear in and closely study the HTML codes of the top pages in that particular search. Next, use the page code for a template and, by inserting your own information you are likely to be competitive in a Lycos search in that particular category.
AltaVista - http://www.altavista.com
Indexed Pages: Approx. 150 Million + To submit a page to Alta Vista, go to: http://www.altavista.com/av/content/addurl.htm Note: The submit form is at the bottom of the page, careful not to be fooled by the advertising banners that sometimes appear to be a submit form.
What's New
AltaVista adds GoTo sponsored links
LookSmart Listings now affecting AltaVista Search Since LookSmart is now allowing up to 5 URL's from the same domain (at $199 each.) it may be worth getting multiple pages listed at LookSmart for each of your primary keywords. Based on how those listings are now affecting rankings at AltaVista and other LookSmart partners like MSN, if LookSmart will actually put your keywords in the title it might be very worthwhile. For this amount of money, we would suggest calling the submission in at (877) 512-5665 to make sure the editor understands what keywords you are looking to get into those titles. For more than 5 URL's, LookSmart has a special Subsite Listings program which might offer some discounts for more URL's. Basic Submission of one URL is now $99 (8 week lead time) and Express Submit runs $199 (48 hour lead time). Note that LookSmart frequently runs discounts on their Express submit package, so get on their email list and then watch for email notification of their specials. Before you submit your site to LookSmart, spend some time thinking about how the LookSmart title is going to affect your search results at different engines such as AltaVista, MSN, Iwon etc. Because you don't have complete control over what LookSmart selects as your title, it's possible that a LookSmart listings could hurt you more than help you. If LookSmart fails to include your keywords in the title, your home page may actually rank worse in these other engines than it did without the LookSmart listing. You would be well advised to consider using an alternate domain name especially for LookSmart to avoid this possibility.
AltaVista Pushing LookSmart Directory
AltaVista focusing on least popular of two word phrases For example, do a search for space elephants. You'll see the top page is www.fourelephants.com , which is also the top page for the single search phrase elephants. Our assumption is that when you see AltaVista ignoring a word in a phrase like this, there probably isn't a term vector for it and the engine is going into some kind of default mode. On the most basic level, term vector means groups of links about specific topics. Otherwise, to create a term vector for space elephants you would need to create some web sites about that topic and crosslink them. Next, you'd have to get some of the sites listed in Yahoo, ODP and the other directories. At some point AltaVista will discover them and learn about the topics they focus on and create a term vector for that phrase. Then when trying a space elephant search, those pages should show up -- theoretically replacing the page that's currently ranking tops on that particular search.
Subpages getting dropped As we've mentioned in earlier issues, AltaVista has a dupe detector which we believe may be catching these pages in its claws -- even though they're not actually duplicate pages. The indications are that subpages must now display a degree of file size randomness and content in order to remain solidly in AV's main index. Another possibility is that AV is dropping the pages because they appear to not be focused on a specific topic or theme. Keep in mind this is only a theory -- but also remember that focused topic specific pages tend to do better on search engines in general. In any case, designing your pages with this in mind will tend to help you and certainly won't hurt you regardless of why AV is dropping pages. AltaVista Launches Raging Search http://www.ragingsearch.com The big difference is that this version is free of ads and none of the typical portal clutter found on most engines these days.. While this may look like AltaVista's response to Google's recent popularity, it's more likely intended to promote sales of their search engine software technology, Search Engine 3.0, and related services. We think Raging Search will serve as a test bed for new technologies yet to be implemented. It's logical to assume their search engine technology and services have major revenue stream possibilities that they will be taking advantage of in the future.
AltaVistas tight spam policy AltaVista has implemented a very tight spam filter in their add url system. This system frequently (improperly) labels many pages or sites as spam. We've been attempting to reverse engineer their filter in an effort to figure out exactly what they are classifying as spam. We believe we are in the ballpark now and hopefully these tips will help you out and even squelch some of the urban myths that are spreading around. This submission filter DOES NOT reject sites for spamming based on any one of these individual items alone:
This submission filter will very likely trigger a human review of the page or automatically give you a too many urls message if:
AltaVista's new filtering system is quite aggressive. And, as with all SE circumstances on the net, it's subject to change without notice. If you exercise patience and focus on building highly optimized pages with relevant content -- while avoiding the don'ts we mention above -- you are unlikely to run into trouble with AV's spam police.
AltaVista overcomes rejection of question mark in URL's So far, we don't know if AltaVista will spider a link containing such characters on it's own. We believe they are accepting them only when they are submitted manually.
AltaVista's Help Page Reveals Interesting Information The new help pages that you should review are at... http://doc.altavista.com/adv_search/ast_haw_index.shtml
Here's some highlights... This means the more links to your site the better your chances that Scooter, AltaVista's crawler, will find it on it's own -- without you submitting it. It doesn't mean that you have to have a popular site to get indexed, just that a large number of links to your site increases the statistical probability of Scooter finding your pages on it's own. On page http://doc.altavista.com/adv_search/ast_haw_titles.shtml we found the following statement... In the ranking rules that determine which pages will appear near the top of a list of matches, the HTML title is the most important element of the page. Even though we've already told you how important titles are we have to admit it's darn nice to see AltaVista confirm it in writing -- thereby eliminating any doubt. And finally, we actually have a search engine that defines exactly what they see as Spam Some barriers to being indexed are due to the misbehavior of a handful of webmasters who have tried to fool search engines into ranking their pages high on lists of matches and including them as matches to queries they aren't appropriate for . This is one kind of behavior is known as spamming. Spamming degrades the value of the index and is a nuisance for all. Irrelevant content (spam) is the number one problem that all search engines are faced with. If you push that button you will have problems. The logic that leads people to try such tricks is rather bizarre. I figure everybody searches for the word sex. I don't have any sex at my site, but I want people to stumble across my site. So I'm going to put the word sex, three thousand times as comments. And any time that anybody searches for sex, my pages will show up first. People have actually tried that. They have tried doing the same kind of thing in the backgrounds of their Web pages. They have also created page after page of text that is in the same color as the background color so visitors won't see the words, but search engine crawlers will. They have tried everything imaginable to fool search engines. If being found via search engines is important to your business, be very careful about where you have your pages hosted . If the hosting service also hosts spammers and pornographers, you could wind up being penalized or excluded simply because the underlying IP address for that service is the same for all the virtual domains it includes. ( The information above was found at http://doc.altavista.com/adv_search/ast_haw_spam.shtml) That last very informative paragraph tells you why you must have your own unique IP address for each of your domains. You should never settle for a virtual domain that shares IP addresses. Doing so will most certainly squelch your ability to succeed at getting top listings on the search engines. Additionally, you should also make sure the IP numbers you are using wasn't previously banned by a SE before you started using it. Although unlikely, it is possible that when (if) someone else had a domain on it they may have played games with the engines, got banned and then released the tainted IP number(s) back into the pool. Admittedly it may be difficult to find out if you have a bad set of IP numbers. Regardless, you should be aware that the previous use of the IP could be a problem. If you find yourself being ignored by the engines you can send a very polite message stating your suspicion and explaining your situation. Chances are such a request will be unique enough that it will stimulate a response -- maybe even open up a dialogue and help get you listed.
The following page may have the most important clue of all - Content counts; content near the top of a page counts more than content at the end. (Note: we've found that this statement isn't always true at AltaVista, depending on what algorithm they are using at the time, the text at the bottom of the page or the middle of the page may be the most important.) In particular, the HTML title and the first few lines of text are the most important part of your pages. If the words and phrases that match a query happen to appear in the HTML title or first lines of text of one of your pages, chances are very good that that page will appear high in the list of search results. Say you want to put your resume on the Web. Keep this rule in mind: Don't put your name first. You aren't trying to be found by people who already know you. You want to be found by people who have never heard of you. So don't waste any letter in the HTML title on your own name. The first word should be resume. After that, list your main qualifications and the kinds of jobs that you are looking for. Put the same kinds of things in the first lines of text . That's what will come up as the default as the description in match list, and it's also an important position for ranking. And the most important thing they wrote... Above all, remember that AltaVista does not reward web pages that practice useless repetition; AltaVista only counts each unique word twice. Source: http://doc.altavista.com/adv_search/ast_haw_query.shtml This is a very, very important statement that, for us, brings up even more questions. We've stated for some time that short pages seem to work best at AltaVista and to only repeat the words a few times, once in the title, once in the <h1> tag and if possible a link with the keyword in it. This statement suggests that AltaVista may be ignoring one of those words in the body. The final page in the How AltaVista Works is very insightful and basically tells you exactly how to score on this engine... AltaVista bases its ranking on both static factors (a computation of the value of a page independent of any particular query) and query-dependent factors. It values:
These static factors are recomputed about once a week, and new good pages slowly percolate upward in the rankings. Note that there are advantages to having a simple address and sticking to it so others can build links to it, and so you know that it's in the index
Query-dependent factors include: Keep in mind that in any query, rare words count more than common words. If someone searches for fruit and pomegranates, pages with the word pomegranates will appear at the top of the list (a technique known as inverse document frequency). Hence you should use specific terms on your pages, in your anchors, and in your METAtags, not general ones that won't give you any advantage. Be specific whenever you can. What we got out of the above information is that you do want to put the keyword in the title because you get a boost, but you don't want to put the keywords in your meta tags. Why? Because if you do, and if the engine only counts up to two keywords, then it's only going to count the first word in the title, the second one in the meta tags and ignore the one in the <h1> tag at the top of your page. If it misses the keyword in the <h1> tag at the top of your page then you don't get that little boost from the <h1> tag. That's why you see most of the top 10 pages without any meta tags. The third keyword that we suggest in the anchor text is good for boosting the score on the page that being pointed to. (So don't point it at your competitor ;-)
AltaVista Going Deeper On Title Data
Testing results with the <noframes> tag
What will AltaVista index if I submit an html page that uses frameset tags? The spider will index the contents within the <noframes> tags just like it would on a none frames page. In fact, you can leave the title and meta description off of the top of the page, and move it down to the <noframes> section and AltaVista will still use them in the same way. AltaVista will unlikely follow the links that are within the framesource.
Can I use the <noframes> tag on a page that doesn't use frames to hide text?
Does AltaVista care where on the page I put the <noframes> tag?
Title with One Keyword
The Real Names link on AltaVista Real Names will not allow you to register words that aren't directly related to your company or that are too generic. If Ford wanted to register the word Trucks then they would likely be turned down, but they could likely register Ford Trucks as a phrase and get away with it. If an exact match isn't found in the Real Names registry when someone clicks on one of these links, then the user is sent to a Real Name search page at http://www.realnames.com and they can choose from the directory. Not all of the names in the directory are actually registered, the index was originally seeded using Yahoo's directory. So if you find your site in Real Name directory and you haven't registered, that is where they found your page. Real Names is trying to get Netscape and Microsoft to include Real Names technology into their browsers, so in the future you may be able to type in Ford Trucks and be automatically sent to a Ford Truck page without knowing their domain name or url. Recent business partnerships with companies like Network Solutions may prove interesting.
Title Tags
Technique Summary
Inktomi - http://www.inktomi.com
Inktomi raises prices, adds partners There is a slight difference in the pricing between Network Solutions and PositionTech, both charge $30 for the first URL but prices differ for each additional URL, PositionTech changes $10 for each additional URL where Network Solutions changes $15. However PositionTech charges $30 for the first URL for each domain, and an additional $30 for each additional domain. Network Solutions doesn't appear to charge for domains after you've paid the first $30.
Paid inclusion program hiccups Inktomi is now ranking LookSmart reviewed sites higher than they were previously. However there is a new wrinkle in the fabric -- Inktomi is also replacing the web page's HTML title and Meta description with the LookSmart directory editor's rewritten title and description. If the editor did a poor job (like they often do) you may not like the title and/or description as it will now appear in the Inktomi index. Keep this fact in mind when you submit pages into Inktomi's paid inclusion program if those are the same pages that are already listed in LookSmart. In such a case, the LookSmart description will override your page's actual description that is hardwired into the HTML source code. In other words, you may want to consider this fact carefully when deciding which pages to submit to each index. You may find it unnecessary to submit the same exact pages into both indexes. Although LookSmart reviewed sites do tend to stick well in the Inktomi index they are not reindexed as often.
http://www.positiontech.com (POC recommended for excellent service & support)
Inktomi Search/Submit Service Launches The Inktomi Search/Submit service launched November 15th with their first Channel Partner, Position Technologies, Inc. of St. Charles, Illinois. Network Solutions is expected to be launching their version of the service in January. The new service offers a solution for webmasters who are experiencing problems getting their pages to stick in the Inktomi database. It also insures that pages will be routinely spidered -- something that is likely to help webmasters who are constantly adjusting pages in order to achieve optimum search engine ranking. The primary advertised features are:
We tried out the new service early this month on some test pages to let you know what to expect. The best thing is that Position Technologies does exactly what they say they will do, there are no hidden surprises here. We also learned...
The Results The URL's that are submitted via the Position Technologies service go into the IFD database within 48 hours. They are not added to the Long Term Database (LTD) until Inktomi has a full index -- which is approximately every 30 days. Some search partners, like HotBot, only show the LTD results if two duplicate results for the same URL are found. So the changes that are made to pages enrolled in the service, AND already in the LTD won't show up on HotBot until the LTD is updated. If the pages that are enrolled through the service are not already in the LTD, then you should see them at HotBot in 48 hours. You can however force HotBot to show you both database results if you search for a keyword on your site and click on the see results from this site only link. That will also allow you to see the date the pages were spidered. The old date for a given URL is the entry from the LTD and the newer date is from the IFD database. Some search partner's like MSN will show both the Long-Term and the IFD database information, resulting in two listings in the search results. Position Technologies has several other tools in regards to submission services that tie into the Inktomi Search/Submit service. Position Pro is their general submission service that covers all the other major engines and also includes position reporting services. To play the game we'd advise you to consider utilizing the Search/Submit services at Position Technologies. It does give you definite advantages that are otherwise unavailable to non-subscribers. We'll be using it for most of our own important pages.
Note these Important Changes on the Horizon Pages submitted to Free addURL forms at search partner sites will be penalized in ranking. Yes, that means that if you submit a brand new URL to Inktomi, that page will be ranked lower -- that is until Inktomi's spiders find links from other sites to it. Once the spider finds it, the penalty will be lifted. The logic is that any page with existing links to it is probably more popular (and therefore more relevant) than a page that is simply submitted. Pages that are discovered by their spider after following a link on another site will not receive this penalty. Pages that are already indexed in their Long Term Database (LTD) and get resubmitted are likely to be slapped with the penalty factor until the engine rediscovers links to that URL from other sites. And, (naturally) pages added via the new (paid) Search/Submit service will not be subject to this penalty. Based on this information, we will NOT be resubmitting ANY URL's from our sites that are already indexed at Inktomi. It should also be noted that Inktomi doesn't actively deep crawl sites. So, in order to avoid the penalty factor you'll need to obtain a link from a page on another domain that is already in the index. Then you'll need to wait and let the spider find that new link on its own. Do NOT resubmit the page containing the new link. Inktomi will likely be removing the Free addurl feature at their search partner sites in the future. Incoming Link-Popularity is a determining factor on how important your pages are. Pages with low or no importance will be penalized. Inktomi is dropping new pages that are submitted to it via the Free addurl services if they fail to receive some click throughs. We don't have enough data on this yet to tell you how many clicks are enough to get the pages to stick. However, it is important that your URL's receive traffic in order to keep them in the (free) index. Inktomi may substitute the description and title from LookSmart listings for a URL if its spider finds the page in the LookSmart listings. Our experience has been that pages we have entered in the Search/Submit service appear to be overriding the LookSmart listing at this time. Pages not in the Search/Submit program will likely have their title and description altered to match LookSmart's listing for that page. There are no guarantees how long MSN, HotBot or others will continue to use the Inktomi database. But at this point in time there is no doubt that a high ranking listing in Inktomi is a valuable marketing asset not to be overlooked. The Inktomi Search/Submit service is fast becoming a necessity for both high rankings and for stability for your pages in the Inktomi database. We suspect you might want to plan this into your 2001 marketing budget.
Inktomi Keyword Density Experiments Anyway, we're now able to make changes to pages and see the results of our tests within 48 hours. That's fantastic. In the world of search engine optimization there's nothing better than to get the results of your tests QUICKLY. It's simply the best way to find out if you're making the correct optimization changes. Not having to wait months to get reindexed is a tremendous asset. A few people have noticed that on Inktomi based engines like HotBot, Canada.com and others that the title and description in the search results aren't matching what's on their pages. After closer inspection, it is clear that Inktomi is replacing the page's title and description with the one from LookSmart. Since Inktomi powers LookSmart's search in the Web Sites section, and also spiders their directory your LookSmart listing it is going to affect your Inktomi listings. On the plus side, it means that you're going to be less susceptible to the shuffle that occurs every few weeks at Inktomi because you'll be in the main cluster with the other LookSmart listed sites. It might also mean that you won't have to pay the submission to Inktomi when that program is announced since LookSmart is already paying to get the sites listed in their directory spidered. On the negative site, it's very likely that your title and meta description won't be what you want since they're using the ones created by the LookSmart editors instead of those actually written into your pages. Our opinion is that LookSmart might actually increase their express submission sales if they left webmasters' titles and descriptions alone. At the very minimum, we think they should leave the titles as is provided that they are relevant to the page content. We know some people who are flaming mad -- customers with a bad taste in their mouth from having had their title changed after paying for a review. To us it seems like an unnecessary antagonization of a paying customer. Sure, if the title is inappropriate or not relevant (people still do that? ...why?), then LookSmart can have them re-title and resubmit. Doesn't that sound reasonable?
Inktomi Background For instance, they can access different portions of the Inktomi database as well as respond differently to the scoring algorithms than other sites using the same Inktomi engine. In fact, one site, goto.com even sells the top positions listed on its site.
Inktomi now powers the following private labeled sites: Note: Submissions to free add Add urls links will not rank as well as those that use paid URL feature at http://www.positiontech.com until the Inktomi spider finds links to your pages. Then they should rank just as highly, but won’t be spidered every 48 hours like they would with the paid submission features. If you are listed in any one Inktomi powered site, then chances are your site can also be found in the other engines that are also powered by Inktomi. In this discussion we are only covering the major players – MSN, HotBot, Yahoo, and NBCi (Snap). We don't believe the other engines generate enough traffic to be relevant to your marketing efforts. One possible exception -- Goto.com receives decent traffic due to their recent advertising campaigns. However, keep in mind that to be listed well on Goto.com you must pay for a listing in competitive categories. Otherwise, there is no other way to outrank a paid listing at Goto.com.
HotBot - http://www.hotbot.com
Search Engine: Hot Bot - http://www.hotbot.com
An Important Difference A clear indication that the top 10 results are from Direct Hit are as follows:
Your search results are going to be dramatically different depending on which service is providing the top 10 results for HotBot. In case you're unfamiliar with Direct Hit, it's a relatively new search engine service that scores search results based on the traffic that a site receives. It does this by basically counting the clicks that a site's listing gets in the search engine, similar to banner ads. Theoretically, to influence Direct Hit to increase your score, a large number of different IP addresses (visitors) need to click on your link at HotBot and spend some time before returning to the search engine. These visits should probably be random in nature, as opposed to scheduled daily visits. There is probably a way to automate this, but to date we haven't heard of anyone with that capability. If someone were to build such a system it would also need to have the ability to present different cookies to the search engine to properly represent a human visitor. Direct Hit does have a submit page on their site, which may be useful for getting your site into their engine quicker. http://www.directhit.com/util/addurl.html Direct Hit's client list contains HotBot, AOL, ICQ, ZDNET, Apple, Lycos, LookSmart, and Microsoft / LinkExchange. Expect some of these companies to use similar technology in the future. HotBot and Direct Hit have been tracking traffic to individual sites since September 1998 and most recently they have also been monitoring traffic from the search results at Lycos. This information is compiled and used for some of the Direct Hit algorithm. Look for this new user information to be tied to their banner server at some point in the future as well. It's a very good source of information for them to compile user preferences. In fact new features are going to include gender and age profiling. This means that sometime soon the top 10 search results at HotBot and other Direct Hit powered sites will be served up using a formula that takes into account the site visitor's age and gender. This information will be stored in a cookie on the search engine user's browser. More than likely this will also be tied into other advertising methods that will generate different ads and search results for each users search profile, age, gender and location.
More Tips Home pages, http://www.domain.com, appear to get a small boost as compared to individual pages like http://www.domain.com/page.html , but the grouping features that HotBot uses may be influencing this observation.
Direct Hit - http://www.directhit.com A key component of the Direct Hit technology appears to be the new search results are not direct links to the web site that appears, but a logging system that HotBot is using which tracks click throughs to the web sites. This feature appears to be to unique to HotBot, we haven't observed this at other Inktomi powered engines This means that HotBot is likely now tracking how many hits it is delivering to each website, how many unique and return visitors the site gets and the length of time a visitor spends at each site. This information is likely compiled and used for some of the Direct Hit algorithm. Look for this new user information to be tied to their banner server at some point in the future as well. It's a very good source of information for them to compile user preferences. We gave Direct Hit a call to discuss their new technology and spoke to Gary Cullis, Chairman and co-founder of Direct Hit. According to his executive bio, Gary's been writing code since age 12, and he is likely the driving force behind this new technology. We also asked if repetitively clicking on a site's listing would increase its popularity ...or if each click had to be from a unique visitor to make a difference. Gary replied they had factored that into their system. He said that repeatedly clicking on a link wouldn't make a difference in the popularity. In our discussion we also learned that it does matter how long a visitor stays at the site before returning to HotBot. If people just visit your site then immediately hit the back button, it could have a negative effect on your Direct Hit popularity. Theoretically, to influence Direct Hit to increase your score, a high number of different IP addresses (visitors) need to click on your link at HotBot and spend some time before returning to the search engine. These visits should probably be random in nature, as opposed to scheduled daily visits. There is probably a way to automate this, but to date we haven't heard of anyone with that capability.
Inktomi/HotBot Scoring Our studies have shown that regardless of the same score, these pages vary wildly in the keyword densities and designs. We have seen in some cases the keyword density vary from .67% to 17%, all on the first page! And yes, we were very careful to make sure we were looking at real pages... We did notice some trends, however, that you might use to your benefit...
Inktomi, the universal search engine... Currently, the primary submission avenue to the Inktomi search engine index is through HotBot. Some of the sites outside the US, like Canada.com also have a submission form that we believe is also tied to the Inktomi index. By submitting to HotBot you avail your site to the extended search results for the new Yahoo, and NBCi. This is clearly an extra advantage. We suggest submitting your site to any of the Inktomi engines that have their own submit URL form. Therefore, even if your pages do not score well on HotBot, you should submit them anyway in order to increase your chances of being found by the engines and indexes that are using the Inktomi Index.
Technique Summary Since Hot Bot forces their users to be very specific in order to turn up meaningful search results -- and for you to have a fighting chance to come up on top of their search list -- you will need to pay close attention to adding very specific keyword phrases when you design your pages for HotBot. In addition, HotBot is case sensitive so you may want to include alternative capitalization in your keywords or body text. <META> keywords can affect your positioning on Hot Bot. You should use them, however, there is no evidence that repeating them will improve your page's position -- except for lower case / upper case repeats; (i.e. Bed Breakfas" & bed breakfast). Most importantly, make sure that you include all of your pertinent keywords in your <META> keywords description just once. <META> descriptions are used by Hot Bot for the summary description on the search results. More importantly, at this point in time it is the hottest place to put your keywords. Make sure you put your keyword/s in this tag for the best score on the Inktomi engine. Be sure to put your summary in the meta description to force Hot Bot to meaningfully describe your site. Otherwise, it will grab the first words on your page to use as the summary. Unlike other SE's that only use Meta descriptions found after the <TITLE> tag, Hot Bot uses them regardless of whether they are placed before or after the <TITLE> tags. However, in order to maintain consistency with your page designs, we recommend that you place your meta's AFTER your <title> tags... this way they will work for all SE's that use <meta>'s. Remember: It is important to use both the meta keyword and the meta description tag together. HotBot appears to ignore meta keyword tags if used without the meta description. Here's an interesting trick somewhat unique to Hot Bot. If you have registered the same page twice with a different URL, Hot Bot may list the second registered URL as an alternate and it will take up a position on the search page that is returned. This means that if your page takes the, say, #9 position -- your alternate will take the #10 position. This is important if you are in the top ten on a search because it means that you can bump your competition down on the list by registering the same page two different ways. Example: http//yourdomain.com and http://yourdomain.com/index.html are actually the same URL address returning the same page... but by registering it two different ways, you can take an extra position on the search engine -- and, theoretically, you have two more opportunities to register this same page by using the www in each of the above URL's. That would expand this page's presence & positions to four spaces in the search. In summary... META tags are extremely important to Hot Bot... be sure to use the description as well as the keyword meta's in order to give your page(s) a chance to be returned high on a HotBot search. Keyword density is also important to Hot Bot. Many top pages are short on words and laden with image links. By making your images the links to the other pages on your site, you maintain (increase) keyword density while refraining from diluting your page with words that are not supportive to the central theme of your web page.
Google - http://www.google.com
Indexed Pages: Approx. 575 million, links to 1 billion plus
Google now indexing PDF files Users at Google will soon have the ability to click on a Text Version link that will display text conversion of the files, very similar to what their Cached link does with HTML files. David explained that the formatting of the pages still is being worked on and that it would be enhanced as they release new versions. Another interesting note is that the engine will also crawl links within .pdf files to other .pdf files as well as to other normal HTML URLs. At this time, Google has only released the feature on two of their servers and they are in the process of installing it on the rest of them. We asked how the text in the title would be determined and David replied that it's something they are still working on. He also said that the Title of the page in the search result will likely vary as they enhance the software to get the best results. This feature also opens up new possibilities for Google for providing services to corporate intranets and other outsourced search services.
SEGuru/Google Webcast Notes
*Note: Search Engine Matrix is an Adult Webmaster Community. If you are easily offended by Adult content, be aware that this site deals with sexually related websites and adult material.
Penalties: Google's cofounder, Sergey Brin, when asked at the Search Engine Strategies conference, How does Google feel about webmasters that spam their engine?, he charmingly replied, We kill them. Of course, everybody laughed. Then Sergey smiled and casually stated that Google does not worry about spam. He further explained that Google never bans a site and never removes a page from their index. Sergey's demeanor, charm, and sense of wit in the face of this emotionally charge issue won the hearts of many professional marketers and webmasters that day. Over the next two years, Google's wise and far-sighted tolerance attracted the world-wide support of Internet professionals who, in turn, become a driving force behind Google's phenomenal growth in popularity. This grassroots ground swell of professional opinion that Google represented the good-guys facilitated their meteoric rise from ground-zero to become the world's largest index of web pages -- over one billion strong -- in a period of less than two years. Ok, flash forward to now, present day, March 2001. Google is now considered one of the big winners -- perhaps even the most likely survivor -- in the search engine wars. They're now one of the most important engines for webmasters to focus on in their quest for high rankings. Our research is showing that, generally speaking, Google delivers more traffic from a number one ranking than any other engine. The sad chapter in this story is, however, that Google, being so impressed with its own success, has forgotten those who've contributed so much toward making them successful. Alas, webmasters (with a commercial motive) no longer seem important to Google. And, the rules have changed!
Consider this paragraph from Google's Terms of Service page:
According to the above paragraph and the broad statements within it, all commercial web sites seem to be in violation of their terms of service. Another interesting but obviously out of date statement on their sites says: The highlighted statement above is actually not true. They are, in fact, now banning sites from their index and preventing them from being found. Google now exercises editorial control over what they believe to be non-appropriate pages for their index -- which, of course, they have the right to do. However we also believe they should publicly state in their written materials that the results are edited and some web sites are not included in their index. The point we feel you should realize is that the people that operate Google, do not care about your web site, or your sales, or how you rank, or how the changes they make may affect you. Stated another way, they no longer need you, the webmaster, to supply them with their content. Ok, fine. The question is, are they unaware that without you, the webmaster, their support will decline? ...do they not see what happened to InfoSeek when they finally told all of the webmasters that made it so immensely popular to piss off? ...do they not understand that their popularity is very much the result of being the safe harbor for webmasters in a search engine storm? Of course, only time will tell. However, recent executive and technical decisions at Google do not paint a rosy picture in respects to the quality of the relationship between engine and webmasters. Now that Google has lost its innocence and discovered that their engine can be manipulated and their algorithms are not totally spam free, they feel that Internet marketers are taking advantage of them. Behind the scenes they are fighting back while at the same time trying to maintain a friendly public face to webmasters. In any case, over the past few months several distinct patterns have emerged. And, as a web site promoter you may not like them but you should remember this is Google's ball game and they get to make the rules. Your choice to play, or not to play, by these rules is, well, your choice. Our job is to bring you the information. Part of that information is this... Google is now showing a penchant for selectively passing the death sentence over sites without trial and without explanation. In other words, if you commit any one of many actions they no longer like they may permanently ban your site from their index. According to Google, anything that you do to increase your web site's ranking is not allowed. If, for some reason they review your web site or you get their attention for some other reason, the following penalties could apply (and, your mileage will probably vary -- greatly!)
Things that might get YOU banned at Google, but not necessarily your competitor... By that we mean that some people get by with using these strategies without any problem whatsoever while others get permanently banned. We very strongly suspect the difference in treatment is the result of how these strategies are applied and, to a lesser degree, the personal mood of the search engineer that happens to be reviewing the site that's making use of them.
Based on evidence so far, however, we believe it is possible they may ban sites for using them sometime in the future. Therefore, just to be safe, we recommend that you avoid using redirects regardless of the type. This includes Javascript redirects, meta refresh tags, and server redirects. Thus far, these are the strategies and optimization techniques we've learned may get you into hot water with Google. We'll be attempting, over the next few weeks, to obtain further clarification from Google on exactly what their policy is toward commercial sites -- particularly those that use the various promotion techniques. Google does worry about spam. The good news is, they haven't yet started killing the webmasters.
Google Search Tips
Stop Words
Excluding words
Phrase search
Site Only
Links
Search Page Titles Only
Search URL's Only By the way, just in case you're still wondering whether or not it pays to have your keyword in the URL? ...it sure does in this case! For fun, try a search for something like allinurl:cellphones -- quite possibly THE most accurate search you can do for something like cell phones. We believe Google really likes the outgoing links with the keywords in them. If you're just starting to build pages for Google, the flowers page above is a good example of what it likes for popular single keywords. Keep in mind that you should also get listed in ODP and Yahoo in order to score well on Google. Additionally, it will help if you have incoming links from other relevant sites whenever possible. Along with the announcement of the new larger index, some of the new pages they indexed happened to be our test pages. This information could be invaluable to you when building your pages specifically for the Google engine. And, it will soon be even more important once Yahoo completes their already announced switch to Google. What does Google index?
What doesn't Google index?
Other important facts about Google...
Breaking out of the Google Cache Some web site owners have concerns regarding the use of the cached page feature. First on the list is the matter of up-to-date content. The cached results at Google are often old versions of your pages dating back 30 to 90 days old at times. If you are displaying content that is time or price sensitive, a visitor may be receiving and acting on out-of-date information. Conceivably, someone might purchase a product that is no longer available for sale on your web site. Google's cache feature is not perfect. For instance, when caching a frames based page the cached result is a blank page. Your framed site will not show within the cached result. We don't know how many people make use of this feature at Google, but for a frame based site it could be costing you visitors. There is also the issue of copyright. The fact that they do display a version of your web page on their site without asking permission may make it hard to enforce other copyright violations. This aspect has not yet been addressed in copyright cases that we're aware of but it may be an issue if you allow Google to display a copy of all of your web pages but selectively enforce another copyright violations somewhere else on the web. On the other hand, if you have an unstable web server that frequently is unavailable, this feature might come in handy for people looking for information from your site. In any case, there are two ways to prevent Google from displaying the cached results if you desire to do so. One method is provided by Google themselves and the other is one that we built in Javascript and will share with you. The Google method is called the Google no-archive tag. Simply add the following line to your HTML documents... <META NAME="GOOGLEBOT" CONTENT="NOARCHIVE"> Place the tag in the <head> section of all web pages you don't want cached. This prevents the engine from listing the cached link on the search results page. By the way, this tag has no effect on Google's ability to crawl or index your page, it merely tells their engine not to display the cached link in the search results. There is no penalty for use of this tag according to Google's Search Engineers and our own research verifies this.
The other method is a Javascript redirect that you can put in your page. When this script runs, it looks for the word cache in the browser's address window and if it finds it, the page automatically redirects to your home page: If you're not concerned about users seeing older content or about protecting the copyrights of your pages, then don't bother with these tags. Adding them will neither help nor hurt your search rankings at Google. For more information on Google's cache feature, go to... http://www.google.com/remove.html#uncache
History Since the early days in 1998, Google has received excellent reviews by those in the media and generated the interest necessary to get funding to make them a player. In June of 1999 Google was chosen as the fall-through search engine for Netscape search and is a premier search engine on the Netcenter Portal. This resulted in a gigantic increase in traffic for the new engine, and put them on our radar screen as a search engine with enough traffic to make it worth while to work with. Google now receives around 3-4 million searches per day. With that kind of traffic, it's a site you should try to rank highly on.
How does it work? Google has a full text index of around 100 million web pages, but actually covers somewhere around 300 million pages due to the way it analyzes links pointing to other pages. You can occasionally see some of the pages that it knows about but hasn't indexed. Those pages are the ones that don't have a title or the Cached link available in the search results.
Page Scoring Details To increase page ranking on Google it is essential that you have either a few links from important sites with your keywords in those links, or lots of links from less important pages, also with the keywords in those links. It is important for Google to know about every link there is to your site, so take the time and submit pages from important sites that are linking to your pages. Remember that Google is very slow at indexing pages so the links pointing at your site are must still be in place when Google finally decides to index your page. By the way, Google doesn't index the meta description or the meta keywords on your pages. Instead it selects text from the document surrounding the search phrase if possible and highlights the keyword in the search results. When Google does index a page it keeps a copy of what it indexed stored on their server. It's important to make certain that pages with sensitive information have a disclaimer in regards to such things as Prices good till..x date.. Keep in mind that Google is very slow to update it's cached pages. Therefore your old prices could still be showing up for a long time on their server.
Summary
Northern Light - http://www.northernlight.com
Indexed Pages: Approx. 110 Million
What's New Northern Light shuns robots. Recently Northern Light banned Webposition software from accessing it’s site.
Northern Light will index:
Northern Light doesn't index:
These tests were conducted using a seldom occurring keyword and placing just one occurrence of the word in different sections of a similar pages. The one page that consistently ranked highest (99%) was one that had the one keyword in the title and the lowest (82%) had the one keyword in the html link. The highest scoring page had the keyword as the 4th word in the title. The next highest scoring page (92%) had the keyword in the very first position in the title. 3rd highest scoring position was the page with the keyword at the top of the document (73%) and the next one, which tied (73%) had the keyword in a link towards the top of the document. Please note that these scores are for a search keyword that no other documents in the index were using except for these test pages. If we were testing with a more commonly used word, then the scores would have been MUCH lower. Of the 20 documents that were indexed, each one also had another unique word in the same body text position. When we did a search for this keyword, all 20 documents ranked 99%, but they were listed in the same order that they were spidered (oldest time the page was spidered ranked highest). This may not sound very important, but if you are fighting it out for that the very top position that little tidbit may move you from #2 to #1. Note: It appears that the more documents that a site has indexed in Northern Light the higher up it will rank in the Custom Search Folders on the left hand side of the search results.
Background on Northern Light We didn't find any data on how many visits Northern Light is receiving, but with the amount of press that it is has been getting, it will only be a matter of time before it starts forging strategic alliances. It's also surprising that they haven't been selling banner ads on the site, which will likely change sometime in the future. One unique feature that is new to the search engine field is their Custom Search folders, which are dynamically generated during a search. For instance, a search performed on hawaii accommodations generated Custom Folders on several topics such as Bed & Breakfasts, Personal Pages, and our personal favorite www.bestofhawaii.com. Clicking on our entry in the custom search folders, we were presented with a listing of all the pages that we have indexed there, and then it broke down even further, showing that it had another folder for Bed & Breakfast. In other searches we found that the custom folders would generate an index for just the commercial (.com) sites or just the .edu sites. It's an interesting feature that no one has matched yet. It appears to us that a site has to have a number of documents relating to the search term indexed to be able to appear in a Custom Search folder.
The Good News… Note: We don't necessarily recommend loading your titles like this for Northern Light, but we wanted to show you this example to let you know it will not reject your pages for doing it. Northern Light says they don't assign any additional relevancy to meta keywords, but they may be indexing them. We didn't find a lot of pages using loaded titles like this.
Technique Summary
Excite - http://www.excite.com
Indexed Pages: Approx. 250 Million - Scheduled to extend to 800 million
What's new? In addition Excite continues to show a boost if the site is listed in the (LookSmart) directory. No major changes that we've noticed in the algorithm as most all of our own sites continue to be ranked about as they were before the update.
Link Popularity at Excite Link popularity is very important. It's apparent to us that Excite is awarding bonus points in relevancy for sites that are listed in the major directories. Having your site listed in ODP and Yahoo is very important and it's clear that being listed in LookSmart is also very helpful. In fact, domains with a LookSmart listing (Excite Category Match) are often in the first page of the Excite search results. In our opinion, this alone justifies paying the fee for the LookSmart express submission. In regards to outside pages pointed at your target page, it helps to have text within the links of these pointer pages (but don't call them that, call them information pages). Whenever possible, put your keyword phrase within the links pointing to your target site. Here's an example of an ideal link on outside sites pointed at your target site... <a href="http://www.domain.com"> Keyword phrase </a>. Excite does still spider pages -- but they only appear to spider the home page. On some of our test sites we've achieved a top 10 result with a single home page that was also listed in the major directories. Keyword density is much less a factor these days at Excite although it does still matter when all other things are equal, especially when the outside link popularity is similar for the competing sites. Excite removes per day submission limit. Remember that they can change their policy at any time and without notice but you should at least be aware that the old 25 pages limit is probably no longer be valid. Obviously, if you abuse this non-limit policy you risk repercussions. So, proceed cautiously at your own risk until further notice.
Excite adds LookSmart paid submission to add url page Excite's categories, historically, didn't seem to be updated often with new site content from LookSmart. Now, with what appears to be some sort of cross promotion between LookSmart and Excite, the time between updates will likely be reduced. We called Excite and talked to Kelly Distefano about the new arrangement. She assured us that sites submitted via the Express Listing service would show up in the Excite category search in approximately one to two weeks after the listing shows up in the LookSmart directory. Minimum Requirements:
Excite's categories have been completely re-configured. Searching the category section now returns many more listings from LookSmart. Scoring appears to favor sites that are in a category with the keyword, or a portion of a keyphrase and the keyword in the title of the reviewed web site. Searching the directory for generic or popular single keywords returns category results first, then an alphabetical sorted directory listing next.
What will Excite index if I submit an html page that uses frameset tags? Excite will NOT index the content from the frames you link to on this page at the same time. However, it MAY come back later and attempt to read the content on the frameset pages -- but it is unlikely to actually list those pages. There is some evidence that Excite does pay attention to these pages in determining the rank of the home page. So if you're trying to score on hawaii then the subpages of your site need to have Hawaii in them, preferably within a <a href> keyword </a> link.
Can I use the <noframes> tag on a page that doesn't use frames to hide text?
Does Excite care where on the page I put the <noframes> tag?
Keyword Density Recommendations Our best two keyword phrases are scoring around 79%, which is about the best we can do without a directory listing for a domain in Excite at this time. A page ranking at this score needs to have about a 4.5% keyword density on the first word which is the same as the above page description (10 keywords/222 total words) and about a 2.25% (5 Keywords/222 Total Words) density for the second keyword. Both keywords need to be in the title, but not at the front of the sentence as stated before. If your title starts with the keyword it will cost you about 1% we have found.
Document Statistics
Excite likes domain names with keywords
Previous Testing
Here's an example of the code for our test; Note that text in this area won't show up in the browser window as viewable text, so this is even better than hiding text with the same color font as the background. Text placed within the style tag appears to be equivalent to having the same text in the body of the page. However, using these tags allows you to place your keywords up very close to the top of the document. Will this crash a browser? Not that we can tell, we've tried it in Netscape 3 and 4, and Internet Explorer 4 and 5 without problems -- but remember, this isn't proper html code. We would suggest testing your pages before you submit to insure they look right to a site visitor. . By the way, this tag has also been confirmed to work on AltaVista and it may also work on other engines as well. Just keep in mind that only Excite and AltaVista have been proven to index text within this tag set so far.
Our Strategy Excite favors root directories (home pages with upper level domain URL's). Frequently it drops subdirectory pages from its index. You can work at getting those subpages listed, but you will have more luck by focusing on your root directory pages. So what do you do if you are working on multiple keywords? Get more upper level domains. That's the key lately to having your pages stick in the Excite index. Since Excite prefers to index only the root URL we've gone to a strategy that incorporates having several upper level domains for the same company, topic, service or product. Yes, it is more expensive - but the traffic at Excite, WebCrawler, AOL and Netscape is well worth it in most cases. However, before you go to this expense try getting into their reviewed directories.
Is Excite ignoring subdirectory page submissions?
Unfortunately we will not accept subdirectory pages through the submission process. At this point we are trying to keep our index to a minimum so we can test new systems and searching techniques. If you submit your site and it meets our guidelines (please see: http://www.excite.com/Info/listing.html) then it should get indexed in approximately two to three weeks. At that point there is a possibility that another of our spiders may come across your site and index subsequent pages, but there is no guarantee of that. If you would like to submit your site please go to our Add URL link on the Excite home page. On the following link please be sure to type in the full URL for your site (i.e. http://www.yoursite.com/ ). If you have any other questions please let me know. Thank you again for your interest in Excite!
Mike Gardner
Frankly, we are not surprised. In fact, we expect that other search engines may even follow this pattern in one form or another. So what can you do? At this time, we advise you to follow the suggestions listed in these articles and don't waste your time submitting subdirectory pages to Excite. And, if you don't yet have an upper level domain for your commercial web site, then now is the time to get one. Excite apparently uses two types of spiders.
Our tests show that pages found by the second spider (the link finding spider) tend to stay in the index for only a set amount of time -- somewhere around two months. After that Excite appears to dump them out. This explains the disappearing pages that you may have noticed the past several months. It is also interesting to note that these disappearing pages have a tendency to show back up when the link spider finds them again. The cycle causes about a 15 to 30 day period when these found pages are gone from the index. Therefore, your best strategy is to keep an eye on the timing of the spider's activities so you can schedule new pages to appear at the appropriate intervals. It can also greatly help to use something like WebPosition to identify dates when your pages are indexed. That way you can design an alternative set of pointer link pages that can be placed in the path of the crawling spider at about the same time the older ones are due to fall out. By using this kind of alternating approach, you should experience good success in keeping your pages indexed on Excite.
The Invisible Link Technique By invisible we mean, links that a visitor will not see but the Excite spider will see -- and follow ...so the additional pages can be added to the Excite index. The trick is simple and consists of adding a transparent .gif that is 1 x 1 pixels in size on your page. Each image becomes a link to another page that you want the spider to visit and index -- for example: <a href="pagetoindex.html><img src="images/cleardot.gif" width="1" height="1" border="0"></a> You can put as many of these on your page as you want. They are invisible to the site visitor but the spider will see them. By doing so you significantly increase the chances that the rest of your web site will make it into the Excite index.
How to get your web pages to Stick in Excite We originally set up a series of links in an effort to get more pages indexed by the crawler. After a few months from when we implemented this, we discovered to our surprise that these pages stayed in the index, when others tended to drop out after one or two months. What we did is put a hidden link to get Excite's crawler to find pages on its own, without us having to submit them and run the risk of violating the 25 page submission limit that Excite claims. This hidden link on the home page of a different web site led to a list of links pointing to our pages that we were attempting to get indexed by Excite. The pages were nothing more than a simple html page that had links with descriptions. Note: There is a reported 25 page submission limit, but Excite may discover many more pages on its own and index them. This is not violating any of their rules. The second thing we did is make sure that these pages would always be fresh when Excite crawled them. There are two ways you can do this, by updating the pages on a monthly basis (we would suggest around the first of the month) or by putting some dynamic content within them. The way we did it was to get a CGI script that returns the server date and time every time the page is loaded. It's important that this text is indexable by the robot. It won't work with a java script clock, or other client side methods generating this changing information. Other methods that you could use would be a banner or image rotation script, a random quote script. Regardless of what method you use the pages should appear fresh each time the spider checks them. We placed our clock text within the title tag of the web page so we could tell also on what date and time the engine indexed the page. If your interested in adding the Perl time and date script to your pages, a free copy is available from Matt's script archive, http://worldwidemart.com/scripts/textclock.shtml Your web server has to support server side includes (SSI) to use this script.
Excite doesn’t keep all pages
Excite appears to keep pages in its index better if they are updated often. Here's a tip, put something on the page, like a quote for the day, or a time and date feature so that the page appears new any time Excite spiders the page. This may help keep your pages in the index.
Submitting your site to a channel
Testing results
During some testing we did we also learned that Excite only recognizes the first 71 characters in the title tag. However, we have also noticed that the multiple title tag technique does seem to be working very well in some cases. We have also confirmed that Excite doesn't index the .alt text, meta tags, or the <!-comments -- > tag. It will index information that is outside of the regular <html> tags for instance
Keywords do get indexed here, before the html tags begin.
Link Popularity
Keyword in the URL and/or domain name We are currently studying two-word keyphrase search results on Excite and we will be publishing the results in the near future book revisions and newsletter -- time depending upon how long it takes Excite to index them. At this time it appears that Excite reacts differently for multiple keyword searches. We've been studying 40 pages that are in the index for custom software and have learned some things that can help get your pages ranked higher. Here are some tips...
Excite Techniques Excite definitely gives preference to pages that have the keyword in the url. For example, if your keyword is hawaii, then you will get a bonus if you use it in your url like this http://www.hawaii.com/hawaii/hawaii-tours.html We have also seen usage of multiple <title> tags that were used with pages that had virtually no text in the page at all, that did very well. Don’t start your title text line with the keyword as the first word, we found it works better if it's the third or fourth word in the text. Excite does not index or use your web page's <META> keyword tag. Excite will simply ignore the Meta's keyword and index your page(s) according to the Excite criteria.
Most importantly, Excite is looking for keywords in viewable portion of your web page! -- Put your page creation efforts into the following areas. Another Tip: Excite understands synonyms of your keywords, so make use of them to avoid spamdexing penalties.
Additional Insight into Excite When asked if stuffing meant repeating words (like real estate, real estate, real estate...) or simply reusing a word throughout a page, Alex responded, "...repetition like in the first example (repeating words). Of course we try to look for all kinds of repetitious abuse, but like we said before some will inevitably slip by. This is primarily due to the fact that we do not want to suppress good pages that we thought were spammed, when they truly were not. We sway to the side of the spammer since we would rather have a few stuffed pages in the top results than to suppress good results that were not. Being redundant (although possibly annoying to read) should be fine, where being repetitive is not." According to Excite, you should design your page to be theme oriented -- and the search results that we researched indicate that Excite has found a way to enforce their wishes. Here is the general rule to follow with Excite. If the web site visitor cannot see the text, then Excite ignores it . This includes META's, comments, and .alt text. As we mentioned before, Excite is also scoring pages high that are mostly listings and links.
WebCrawler - http://www.webcrawler.com
Indexed Pages: Approx. 2 Million
Overview
Submitting to WebCrawler We strongly encourage you to submit your main URL (also known as a site's root or index URL) to us, as well as your site's main subsidiary pages, and not every single document of your site, especially if it is a matter of hundreds or even of thousands of URLs. Please do not submit more than 25 URLs for any given server. Cases in which WebCrawler receives disproportionate numbers of submissions for the same Web site will be subject to automated and/or human review, and will be processed as WebCrawler sees fit. http://webcrawler.com/Help/GetListed/HelpAddURL.html While we don't know where this magic number for 25 came from, we have been hearing rumors that Excite was using some kind of 25 per day rule. Since the change to WebCrawler was made earlier this year, this appears to apply to both Excite and WebCrawler. We have seen no penalties, for submitting more than 25 per day, but to avoid human inspection you should avoid submitting more than that. You will likely find that WebCrawler is even more difficult than Excite when it comes to getting your pages indexed. WebCrawler prefers finding documents that are the root of the main site or the index.html page in a subdirectory, for example: http://www.server.com and http://www.server.com/keyword .
Indexed Pages: Millions If not registered in 2 weeks after using the free registration, contact them at url-support@yahoo-inc.com (make sure and use the original email address used when submitting and list the date when you submitted your first request)
Whats New Is this a good deal? Maybe. But, we're not thrilled about the fact you don't have any control over what the sponsored listing says. Right now, Yahoo is using the same title and description as the site's standard Yahoo listing. As you may already know, oftentimes Yahoo's selection for your title and description leaves a LOT to be desired. Paying big bucks yet having no editorial license over your own site's description sucks, seems unusually restrictive. Anyway, if you have a deep pocket full of money to test it, then go ahead and give it a try (then be sure to call us and let us know how it worked out). BUT, be aware that it may be difficult to track the traffic coming from your sponsored listing. By that we mean that you may not be able to tell it apart from incoming traffic generated by your standard listing because Yahoo doesn't appear to offer any special tracking URLs or other options that would allow you to measure results. They do, however, use a different redirection system for the sponsored sites, so you might be able to find different referrer information in your logs -- at least, that's what we'd recommend watching for. Our opinioned belief is that the Most Popular listings will get more traffic than the Sponsored Sites. It's a good bet this program will go bust unless they change it to allow advertisers control over how their listings are described and how they're presented on the pages. They also need to enable tracking. http://sponsoredsites.yahoo.com/ http://help.yahoo.com/help/us/sponsored/
Yahoo - No more freebies for commercial sites Frankly, it's no big change. They were, in all practicality, ignoring free submits anyway. They will, however, continue to accept some listings in noncommercial categories of the directory.
Specifically, the two directory areas that will only accept paid submits now are: Even if the category that you want to submit to isn't one of the above, we still suggest using the Business Express service to make sure your site gets listed while you're still alive. In a related news story at MSNBC, Yahoo also explains that they are considering paid ranking similar to the Goto model according to Srinija Srinivasan, vice-president in charge of the directory. Read more of his comments at http://www.msnbc.com/news/491082.asp More details and FAQ's can be found at http://docs.yahoo.com/info/suggest/faq.html
Yahoo directory search results show popularity Remember now that it's important to notice how Yahoo is sorting the sites for the directories. It shows you which category is pulling the most traffic for your search keywords on that site. You may also notice that Yahoo is currently having some problems with this new system -- most notably, they are frequently timing out during a search. We believe they're still having problems accommodating the major load of day-to-day traffic the site routinely receives.
Business Express Opened up Probably the best (and certainly the quickest) way to get into the Yahoo directory is to use Yahoo!'s Business Express Service, which offers guaranteed consideration of your commercial website within seven (7) business days. This program requires a $199.00 USD one-time, non-refundable processing fee per submission. However, if you are submitting a site offering adult content and/or services, the fee for the Business Express Service is $600.00 USD. All adult sites must be submitted to the most appropriate category under Business and Economy/Shopping and Services/Sex Recently, Yahoo also changed their Business Express paid submission feature to require customers to register for a free Yahoo account before they can submit their web sites. Looks like someone in the marketing department is salivating over the opt-in marketing potential of site submitters. Regardless, we feel that Yahoo is pushing it by making you fill in information twice. After all, every additional form - aka roadblock a customer has to hurdle increases the odds they will lose the sale. Ordinarily, the Business Express submission process, with its hoops to jump through, is a good example of what not to do. Typically, requiring so much information will turn a hot-to-purchase customer off. However, a Yahoo listing is a must for any company trying to establish an Internet presence. So, we guess they can attach unusual strings in the process. Our belief is that if they were trying to sell anything other than a Yahoo listing this process would be costing them. It still may, time will tell.
Business Express Service Their acceptance criteria, found on their agreement page, sheds light on the subject of what kind of site makes it into the Yahoo! directory. We've reviewed the statement to highlight certain critical issues that likely effect your standard free submissions as well: From http://www.yahoo.com/info/suggest/terms.html Yahoo recently removed the requirement for a web site to have real time credit card processing for the $199 Business Express Service. The new minimal requirements are as follows - The site must be a commercial web site based in the United States with the official business name visible to any visitor to that site. Editors note: Put your business name and contact data on the bottom of every page
These criteria are minimum requirements only, and Yahoo!, in its sole discretion, may consider other criteria before accepting or rejecting a site.
Additonal Recommendations
Is it worth it?
Web Page Searches
Directory Registration Process
First Step: You must understand how Yahoo operates However, getting listed well on Yahoo is actually darn easy... provided you know how to do it. However, before you make any attempt to master the Yahoo positioning puzzle, you must throw out all of the search engine rules and learn exactly where Yahoo is unique and how it works. Ready? ...here goes. The first thing you must realize is that, in a keyword or keyphrase search on Yahoo one of four things will happen...
It is important to grasp that consumers using the Yahoo catalog do not generally understand exactly how Yahoo really works. Many people do keyword searches in the Yahoo catalog without any particular notice of how Yahoo returns results. Frequently their search yields no Yahoo results -- and they find themselves in the Inktomi index. Guess what... most of them do not even notice they are in the Inktomi index -- they just go on with what the search delivered without a clue they are actually looking Inktomi instead of Yahoo. Ok, so what strategy works best with Yahoo? Surprisingly the best working strategy with Yahoo is pretty sure fire PROVIDED THAT you understand how Yahoo works. Here are some more insights you must grasp before you submit any pages to Yahoo.
This, of course, means that no one will find you unless your company name happens to include one of the keywords/phrases that people might search for you under. This is bad news for most companies, unless of course, people are searching for you by company name. Only then are you very likely to appear on top. The problem is that very few people are going to look for PXY, Corp. if they happen to be searching for the tape management backup systems that you happen to sell. Lastly, they prefer that you have your own upper level domain. If your company's site resides on a sub-domain of an ISP and/or web hosting service, it is difficult to get Yahoo to list you. If your company's site resides on a sub-domain of a large ISP -- like AOL, freeyellow, geocities, earthlink, etc. -- then it is nearly impossible to get a new submission accepted. They simply ignore you.
Here's what you do. Note : Do not attempt to get Yahoo to change an already cataloged web site listing. If Yahoo has ever changed a listing (we know of no listing they have ever changed in spite of numerous requests), it probably involved far more time and trouble than it was worth. In some cases, Yahoo just removes the listing when a change is requested! It is easier to just get a new one and let the old ones ride (it is also probably easier to stop El Nino). Start by... creating a unique site and name your company something that utilizes your keywords. Referring back to our example above, we could -- for web page purposes only -- name our company something like Tape Management Backup Systems, LTD. with a logo that displays your TMBS, Ltd. and the words, Tape Management Backup Systems LTD. incorporated somewhere within the Logo. This will prove to the Yahooligan who is inspecting your site that we are indeed a company named TMBS, Ltd.. Now, how do we determine which category to register in? Here's how. Let's do a keyword search for tape management. (At the time of this writing, the #1 ranked company is a bad link. This presents an opportunity to purchase this position from the ISP or Hosting service that controls that sub-domain. Note: whenever you find a top rated Yahoo page in YOUR keyword category that no longer exists you should contact the owner of that domain and negotiate your way into that listing spot. Then, put a redirect on it to your main web site before Yahoo removes it from their catalog. This is a quick way to get a good listing on Yahoo.)
Onward with our search results... You should also note that each of the keywords are contained (and highlighted) in the company name and/or the description of the site. This proves that a keyword search can work well for you, provided that Yahoo happens to list categories and companies that match your keyword search. Such is not the case for all keywords and you should check before you begin your efforts so as not to waste your time trying to score high on a search that will only default to the Alta Vista search engine anyway. By examining this search results it becomes obvious that we want to be registered in the very first listed category... the one labeled: Business and Economy: Companies: Computers: Software: System Utilities: Backup ...simply because this is the category that comes up first for the keyword search tape management . Getting your company in the first category will almost insure a front page listing for most businesses in the Yahoo catalog... especially if your keywords are in the name of your company. OK... here's how you properly register from within that category. Click the category link... Business and Economy: Companies: Computers: Software: System Utilities: Backup After the next page loads you will see an icon at the top that says Add URL (the middle orange button after the Yahoo logo). Click the Add URL icon . A page titled Suggest A Site comes up with instructions on how to suggest a site. Once you start to read through the gobbly-gook they have on this page and the other pages they refer to, you'll wonder how anyone suggests a site to Yahoo without help! Regardless, go ahead and look at their recommendations. Once you've gone through their instruction nightmare you can skip it in the future. Ok, read the stuff ? Yes? Well alright then, -- now you are ready to push the Proceed to Step One button. Go ahead, click it... it won't explode. The next page comes up with the correct category already filled in. That's why you must start the Suggest a site process from the correct category page ... the one you want to be listed in. From here it is easy to follow the directions on the page. When you are finished, click the Proceed to step two button. Go ahead, practice. Nothing happens unless you push the submit button at the end of Step 4 . Step two gives you a chance to suggest an additional category that your company may belong in. It is self-explanatory and, in our example, the answer will depend on the information below. Therefore let us proceed. Backing up a little we want to make something perfectly clear. Do NOT submit anything to Yahoo until you are crystal clear exactly what two categories you wish to be listed in. And you should use the Add URL from your first choice category only . Your second choice category will be a fill-in that should be copy & pasted once you've selected it -- and then, is only completed at the time you are submitting your web site. That means that you do NOT submit until you have examined all of your options and you have chosen your top two categories. Got it? So, in an effort to choose the next category, let's do a keyword search from the Yahoo front page for tape backup systems. This time we get a category titled: Business and Economy: Companies: Computers: Hardware: Peripherals: Storage: Tape Drives ...and, if tape backup systems is what we sell then, obviously, that is the second category that we want to be listed in. Now is the time to copy and paste that second category choice into a notepad and save it for inclusion in the step two fill-in box. Since Yahoo will only list you in two categories, you can now initiate your suggestion process and, when finished, you are now done with that newly created company's web site submissions. The review process can take up to a month to complete. If you do not appear in the Yahoo index within two weeks, write them a nice letter of inquiry sort of as a reminder and to find out if you did anything wrong . If you still do not appear in 30 days, then write them another very nice letter of inquiry. Be sure to state the specifics (time, date, company name/title, page description, etc.). Chances are you will get (nicely) indexed -- provided you did everything right and you show a little patience. Now, here's a tip that will practically guarantee your submission will be accepted by Yahoo provided you do everything else right. Register and submit under an upper level domain name that reflects your keyword chosen company name . (example: tapemanagement.com or, tape-management.com ). That is likely to convince Yahoo that your company's name is indeed Tape Management Backup Systems, Ltd. and they will list you with your keywords automatically in the listing. Reality check : The domain registration will cost you $100 with InterNic. The domain hosting fee will be less than $30 per month. How much do you think a good listing on Yahoo is worth? If your budget can't handle an average of $35 per month for a very sweet listing on Yahoo then maybe you should reconsider being on the Internet. That's about as good as it gets, advertising-wise! Be aware that Step Three includes contact info, company name, email, phone, fax, address , etc, so be SURE to maintain a record of exactly what you tell them and make certain that the company name you give them matches the company name/title of your submitted web page. It is important to keep a record of your exact submission details especially if you plan on following our instructions for multiple keyword listings on Yahoo and/or if you already have listings on Yahoo .
Obtaining Multiple Listings on Yahoo Yahoo's policy? ...one listing -- and Yahoo gets to pick the category. That's if they list the company at all. Once you submit your site they reserve the right to change your title, description, category, or ignore you completely, all at the whim of some Yahoo Surfer that likely knows nothing whatsoever about your customers. Frankly, if the public knew how arbitrary and inconsistent Yahoo can be we doubt the directory would be so popular -- a phenomenon we attribute to arriving first on the scene and having a memorable name
Yahoo submission strategies...
Yahoo looks for links to pages that are already in it's directory.
Submitting a Multiple Listing Yes, this is a lot of work and something that we do ONLY for companies that stand to gain much from their Internet presence -- and from having their listing(s) in Yahoo. Overall, our experience has been that it's well worth the trouble because, unlike search engines, once you make it into the directory you are basically in forever. Good positioning can even become a salable asset. After all, Yahoo is the most visited site on the Internet.
More Yahoo Tips
Updating & Resubmitting pages to Yahoo
Another Submission Option Yahoo makes it difficult to find a email address to contact if you are having problems getting submitted, the only Official one that we have found is url-support@yahoo-inc.com . Wait at least two weeks before contacting them regarding submission failures. Make sure and include the date of your submission and if possible use the email address that you used during the original submission. The alternate would be to save the email address from a previous successful submission and use that one to contact them with. Note: It make take a MONTH or two for Yahoo to reply to this email address, but it does work, we have sent mail to it and confirmed that this is the right address.
Time Sensitivity?
One final note regarding Yahoo
Open Directory Project - http://www.dmoz.org
Indexed Pages: Approx. 1.5 Million
History For many, those first three characters in the GnuHoo domain name were very important, as they are the trademark of the Free Software Foundation or FSF for short. The GNU is associated with open code software like Linux, but GnuHoo wasn't giving away the software directory program code. After getting heat from the Open Source Community they changed their name to NewHoo in June 1998 and thereby avoided a trademark infringement issue. The directory continued to grow to upwards of 100,000 listings and started drawing attention from several of the other portal sites. Finally in November '98 Netscape purchased NewHoo for an undisclosed amount, renamed it the Netscape Open Directory and placed it under the umbrella of the free software movement of Mozilla.org. At this time Netscape actually did make the directory open . They created a free use license that allowed individuals and organizations to make copies of the directory. Things started happening pretty fast after that. Next AOL bought Netscape. Then AOL/Netscape dumped Excite as their search engine replacing it with a combination of the ODP and Inktomi. During the summer of 1999, amid a wave of acquisitions and mergers, other portals started liking the idea of not having to pay for, and administer, a directory. So, they too began incorporating the free ODP data into their sites as well. Today, ODP boasts more than 1.5 million sites, 22,000 editors, and 200,000 categories. The ODP database is used on more than 20 search engines and directories, including AltaVista, Netscape Open Directory, Netscape What's Related, Lycos, HotBot, Dogpile, and Thunderstone. It's also the default search for Netscape's browser. We generally equate a good listing in ODP to be worth about 1/4 to 1/2 as much traffic as a site listed well in the same category at Yahoo. This varies quite a bit depending on the topic area, but with all the engines using this directory it's not one to be overlooked.
How is ODP Used at Different Portals? Another reason that results may vary is because each site using ODP data can update their copy of the database whenever they want. In most cases you'll find they update on a monthly basis even though ODP itself often updates more frequently. You have the option to submit your sites from many of the different licensees sites but we recommend that you only submit from one to avoid duplicate submissions. We have not observed any advantage gained from submitting at any particular licensee site over another. Here's a brief rundown on each of the individual ODP sites; Netscape uses ODP extensively throughout their sites:
AOL -The search engine that AOL users see is http://aolsearch.aol.com/index.adp which gives the visitor an option of browsing the ODP directory or searching. Search results are composed with the following sections, from top to bottom of the page: The second search site is the one used by most web based users and can be found at search.aol.com, which now appears to be identical to aolsearch.aol.com. Although we haven't experimented with this one much, the use of Inktomi's search engine to index ODP pages appears to follow some of the same rules that we are used to at sites like HotBot. Keyword in the title, meta description, meta keyword and body text and URL are all important. On just a few sample pages, keyword density appears to be running around an even 2-3% for two word phrases. AOL appears to implement the new releases of the ODP data quicker than most of the other portals using ODP. AltaVista has incorporated ODP as their primary directory on their home page at http://www.altavista.com. When conducting a search at AltaVista for a single keyword, they present some matching categories from ODP at the bottom of the search results page. They also have added a Categories button at the top of the page which also takes you to their version of ODP. Lycos in April of '99, Lycos replaced their main search engine with ODP categories and selected web pages at the front of their search results listings. The Web Pages old Lycos search results now don't show up until the end of the ODP listings or unless the search phrase isn't found within ODP. The Lycos directory on the home pages is also powered by the database at ODP. Google in March 2000, Google.com augmented their search engine by indexed sites found within the ODP directory and using the directory structure to help make their search more relevant. Sites in the ODP directory have a definate advantage on this search engine. HotBot incorporates ODP as their home page directory and includes results from the ODP categories at the top of the search result listings.
DogPile ( www.dogpile.com) incorporates links from their home page to their version of ODP and also includes ODP listings in their search results. The search results at DogPile are found in the following order:
ODP Submission Guidelines Before submitting, remember that ODP is powered by human volunteers who operate in most cases according to a somewhat lose set of guidelines. One of which is sometimes referred to as WWYD or What would Yahoo do? This means basically that you shouldn't try any tricks that wouldn't work at Yahoo. The editors at ODP are often well versed in search engine promotion techniques and will frown upon your trying to use AAA or !AAA in your title (ODP's listings are alphabetical) unless you can really make a case that is the real name of your company or site. Even then be prepared for the editor to ignore you if you try that trick.
Other rules
Chapter Seven - High Tech Pages and Search Engines By using a frames page in a very specific way, it's possible to send the <no frames> section of the page you want indexed to the search engine and the frames content to the browser (humans). The <no frames> section would be your optimized source code for the engine. The <no frames> section DOES need to have logical text that is relevant to what is being displayed in the browser window -- just in case a human reviews the page manually. The <no frames> area of the page is basically invisible to users unless they use the view source option of their browser. Just remember to keep the content relevant to the page the humans will see. Refrain from making templates for the <no frames> area of the pages. AltaVista is trying to detect pages that are very similar -- the pages that only swap out a keyword or two. The content of your <no frames> sections needs to vary somewhat. Ideally each would be somewhat unique and focus on the topic that the page was built for. By the way, if your customer base consists of a significant portion of WebTV users you should test you pages on a WebTV browser. WebTV tries to combine frames and this can sometimes make a mess out of the page. That's the largest single browser demographic that currently doesn't support frames well. Also keep in mind that shopping cart pages on a secure server that are wrapped in frames from a non-secure server won't show the closed-lock icon on the browser window. You need to break out of the frame when leading people to the secure pages or they will understandably think the page is non-secure -- even though it is. This may cost some sales because that secure lock icon IS looked for by a lot of consumers. Computer programmers who design fancy pages may hate to hear this, but what we said for "Frames" also goes for JavaScript pages. The more Java, the less your chances of showing up high on a search engine search. Technical reasons again, but if your sales depend on being found, skip Java for now. There are plenty of other ways to spruce up your pages. In addition, our studies have shown that complex JavaScript & Frames in general both reduce sales. In the near future we will make available to you a report that explains why. For a good example, take a look at Yahoo, the highest traffic site on the web. Do they use frames, Java, or huge graphics? They waited till 1998 before changing their background color to white! See what they believe is state of the art - http://www.yahoo.com Almost all search engines will reject dynamically generated pages if they have extended characters in the URL (except for Lycos and Inktomi). This is primarily due to the fact they are worried of getting into what they call robot traps where there may be no end to the number of links that a script or program generates. If the URL contains a ? , % or other similar characters, they will probably not index your site, and most likely will not crawl these types of urls with a spider. In some cases such as at Inktomi and AltaVista, you might be able to get the engine to index a page with these characters when submitted manually, but the spider won’t follow it links with the characters in them. A work around is to build Pointer Pages using regular static html with links to the target page. If you attempt to use the <meta> refresh tag within the pointer pages, be aware that some engines will try to index your targeted page, not the page that you submit. There are ways around this problem, but they are quite complex. |
|
© 2003-2009 paultclark.com
|