Posts Tagged ‘searching’

Searching For Multiple Words

Tuesday, July 15th, 2008

Use the word OR to require that one or the other term be found in the search results. For example, giants OR baseball (include a space on each side of the OR) lists sites containing “giants” or “baseball.” You can combine AND, OR, AND NOT by using parentheses. For example, to find documents that contain the word giants but not either the word baseball or football type giants NOT (baseball OR football). You could also type this giants -(baseball OR football). Note: You cannot begin a search with a “-” term. You must put some other search term first.

Searching For Excluded Words

Tuesday, July 15th, 2008

Use the minus sign (-) before a word or the word NOT to require that it not be found in the search results. For example, giants -baseball (include a space between the first word and the - symbol) or giants NOT baseball lists sites containing “giants” but not “baseball.” Some engines like AND NOT (two words) or ANDNOT (one word) better than just NOT.

Searching For Required Words

Tuesday, July 15th, 2008

You can type the plus sign (+) or the word AND before a word to require that it be found in all of the search results. For example, giants +baseball (include a space between the first word and the + symbol) or giants AND baseball returns all listings that contain “baseball” and “giants” but not necessarily together.

Searching for an Exact Phrase

Tuesday, July 15th, 2008

To require that an entire phrase be found in a search,enter quotes (” “) around the terms. For example, “giants baseball” returns listings where the words “giants” and “baseball” appear together and in that order, either in the title, the URL of the Web site, the description, the keywords, or the document. If no sites are found that contain both terms, sites that contain either term will be displayed.

Search Engine Optimization-Searching by Means of Subject Directories

Tuesday, July 15th, 2008

Think back to the library card catalogue analogy.  In the old card files, and even in today’s computer terminal library catalogues, you find information by searching on either the author, the title, or the subject.  You usually choose the subject option when you want to cover a broad range of information.

Example:  You’d like to create your own home page on the Web, but you don’t know how to write HTML, you’ve never created a graphic file, and you’re not sure how you’d post a page on the Web even if you knew how to write one.   In short, you need a lot of information on a rather broad topic–Web publishing.

Your best bet is not a search engine, but a Web directory like the Open Directory Project,  Google Directory or  Yahoo.  A directory is a subject-tree style catalogue that organizes the Web into major topics, including Arts, Business and Economy, Computers and Internet, Education, Entertainment, Government, Health, News, Recreation, Reference, Regional, Science, Social Science, Society and Culture.  Under each of these topics is a list of subtopics, and under each of those is another list, and another, and so on, moving from the more general to the more specific.

Example: To find out about Web page publishing from Yahoo, select the Computers and Internet Topic, under which you find a subtopic on the Wide World Web. Click on that and you find another list of subtopics, several of which are pertinent to your search: Web Page Authoring, CGI Scripting, Java, HTML, Page Design, Tutorials.  Selecting any of these subtopics eventually takes you to Web pages that have been posted precisely for the purpose of giving you the information you need.

If you are clear about the topic of your query, start with a Web directory rather than a search engine.  Directories probably won’t give you anywhere near as many references as a search engine will, but they are more likely to be on topic.

Web directories usually come equipped with their own keyword search engines that allow you to search through their indices for the information you need.

Important note:  Search engines and  Web directories are being integrated in interesting ways.    For example, if you use the Google search engine and one of the results happens to be found in the Google’s Directory (which is based on the dmoz directory), Google will offer you a link to that section of the directory. Meanwhile, if you conduct your search in the Google directory, Google will order the results according to PageRank, which is   Google’s all-important measure of  “link popularity.”

How many times should you use keywords in a press release?

Tuesday, July 15th, 2008

As a guideline, in a press release that’s 500 words, we’ll use the phrase 2-4 times. We’ll also use variations of that keyword phrase. Search engines are smart enough that when documents are identified as being authoritative for a particular concept, the presence of an exact match keyword phrase will often be accompanied by related phrases. Keyword research will give insight not only on the phrases people are actually searching on but also related phrases.

Building and using a Sitemap

Monday, June 30th, 2008

The purpose of a sitemap is to enable search engines to index all the pages on a site being optimzed. Some search engines like Google recommend that you include a sitemap to speed up the indexing process reduce the risk of pages being skipped.

A sitemap can help more pages be listed, because not all search engines will go more than 2-3 link levels deep.

A secondary, but useful purpose of a sitemap is to assist visitors in finding their way around the site. Building a sitemap is a tedious chore and many sitemaps are neglected and go out-of-date as a result. So, after many hours of intense searching; we have found a wonderful solution.

SEO:Concept-based searching

Monday, June 30th, 2008

Excite used to be the best-known general-purpose search engine site on the Web that relies on concept-based searching.  It is now effectively extinct.

Unlike keyword search systems, concept-based search systems try to determine what you mean, not just what you say.  In the best circumstances, a concept-based search returns hits on documents that are “about” the subject/theme you’re exploring, even if the words in the document don’t precisely match the words you enter into the query.

How did this method work?  There are various methods of building clustering systems, some of which are highly complex, relying on sophisticated linguistic and artificial intelligence theory that we won’t even attempt to go into here.  Excite used to a numerical approach.  Excite’s software determines meaning by calculating the frequency with which certain important words appear.  When several words or phrases that are tagged to signal a particular concept appear close to each other in a text, the search engine concludes, by statistical analysis, that the piece is “about” a certain subject.

For example, the word heart, when used in the medical/health context, would be likely to appear with such words as coronary, artery, lung, stroke, cholesterol, pump, blood, attack, and arteriosclerosis.  If the word heart appears in a document with others words such as flowers, candy, love, passion, and valentine, a very different context is established, and a concept-oriented search engine returns hits on the subject of romance.

SEO:The Problem With Keyword Searching

Monday, June 30th, 2008

Keyword searches have a tough time distinguishing between words that are spelled the same way, but mean something different (i.e. hard cider, a hard stone, a hard exam, and the hard drive on your computer). This often results in hits that are completely irrelevant to your query. Some search engines also have trouble with so-called stemming — i.e., if you enter the word “big,” should they return a hit on the word, “bigger?” What about singular and plural words? What about verb tenses that differ from the word you entered by only an “s,” or an “ed”?

Search engines also cannot return hits on keywords that mean the same, but are not actually entered in your query. A query on heart disease would not return a document that used the word “cardiac” instead of “heart.”

SEO:Keyword Searching

Monday, June 30th, 2008

This is the most common form of text search on the Web.  Most search engines do their text query and retrieval using keywords.

What is a keyword, exactly?  It can simply be any word on a webpage.  For example, I used the word “simply” in the previous sentence, making it one of the keywords for this particular webpage in some search engine’s index.   However, since the word “simply” has nothing to do with the subject of this webpage (i.e., how search engines work), it is not a very useful keyword.   Useful keywords and key phrases for this page would be “search,” “search engines,” “search engine methods,” “how search engines work,” “ranking” “relevancy,” “search engine tutorials,” etc.  Those keywords would actually tell a user something about the subject and content of this page.

Unless the author of the Web document specifies the keywords for her document (this is possible by using meta tags), it’s up to the search engine to determine them.  Essentially, this means that search engines pull out and index words that appear to be significant.  Since since engines are software programs, not rational human beings, they work according to rules established by their creators for what words are usually important in a broad range of documents.  The title of a page, for example, usually gives useful information about the subject of the page (if it doesn’t, it should!).  Words that are mentioned towards the beginning of a document (think of the “topic sentence” in a high school essay, where you lay out the subject you intend to discuss) are given more weight by most search engines.   The same goes for words that are repeated several times throughout the document.

Some search engines index every word on every page. Others index only part of the document.

Full-text indexing systems generally pick up every word in the text except commonly occurring stop words such as “a,” “an,” “the,” “is,” “and,” “or,” and “www.”  Some of the search engines discriminate upper case from lower case; others store all words without reference to capitalization.