Posts Tagged ‘reference’

Search Engine Optimization-Searching by Means of Subject Directories

Tuesday, July 15th, 2008

Think back to the library card catalogue analogy.  In the old card files, and even in today’s computer terminal library catalogues, you find information by searching on either the author, the title, or the subject.  You usually choose the subject option when you want to cover a broad range of information.

Example:  You’d like to create your own home page on the Web, but you don’t know how to write HTML, you’ve never created a graphic file, and you’re not sure how you’d post a page on the Web even if you knew how to write one.   In short, you need a lot of information on a rather broad topic–Web publishing.

Your best bet is not a search engine, but a Web directory like the Open Directory Project,  Google Directory or  Yahoo.  A directory is a subject-tree style catalogue that organizes the Web into major topics, including Arts, Business and Economy, Computers and Internet, Education, Entertainment, Government, Health, News, Recreation, Reference, Regional, Science, Social Science, Society and Culture.  Under each of these topics is a list of subtopics, and under each of those is another list, and another, and so on, moving from the more general to the more specific.

Example: To find out about Web page publishing from Yahoo, select the Computers and Internet Topic, under which you find a subtopic on the Wide World Web. Click on that and you find another list of subtopics, several of which are pertinent to your search: Web Page Authoring, CGI Scripting, Java, HTML, Page Design, Tutorials.  Selecting any of these subtopics eventually takes you to Web pages that have been posted precisely for the purpose of giving you the information you need.

If you are clear about the topic of your query, start with a Web directory rather than a search engine.  Directories probably won’t give you anywhere near as many references as a search engine will, but they are more likely to be on topic.

Web directories usually come equipped with their own keyword search engines that allow you to search through their indices for the information you need.

Important note:  Search engines and  Web directories are being integrated in interesting ways.    For example, if you use the Google search engine and one of the results happens to be found in the Google’s Directory (which is based on the dmoz directory), Google will offer you a link to that section of the directory. Meanwhile, if you conduct your search in the Google directory, Google will order the results according to PageRank, which is   Google’s all-important measure of  “link popularity.”

SEO:Keyword Searching

Monday, June 30th, 2008

This is the most common form of text search on the Web.  Most search engines do their text query and retrieval using keywords.

What is a keyword, exactly?  It can simply be any word on a webpage.  For example, I used the word “simply” in the previous sentence, making it one of the keywords for this particular webpage in some search engine’s index.   However, since the word “simply” has nothing to do with the subject of this webpage (i.e., how search engines work), it is not a very useful keyword.   Useful keywords and key phrases for this page would be “search,” “search engines,” “search engine methods,” “how search engines work,” “ranking” “relevancy,” “search engine tutorials,” etc.  Those keywords would actually tell a user something about the subject and content of this page.

Unless the author of the Web document specifies the keywords for her document (this is possible by using meta tags), it’s up to the search engine to determine them.  Essentially, this means that search engines pull out and index words that appear to be significant.  Since since engines are software programs, not rational human beings, they work according to rules established by their creators for what words are usually important in a broad range of documents.  The title of a page, for example, usually gives useful information about the subject of the page (if it doesn’t, it should!).  Words that are mentioned towards the beginning of a document (think of the “topic sentence” in a high school essay, where you lay out the subject you intend to discuss) are given more weight by most search engines.   The same goes for words that are repeated several times throughout the document.

Some search engines index every word on every page. Others index only part of the document.

Full-text indexing systems generally pick up every word in the text except commonly occurring stop words such as “a,” “an,” “the,” “is,” “and,” “or,” and “www.”  Some of the search engines discriminate upper case from lower case; others store all words without reference to capitalization.