Basic Computer Notions Web searching

Introduction

Quality control

Good Housekeeping Seal of Approval

On the Internet, nobody knows you're a dog
New Yorker, 1993 Jul 5, p. 61
The authoritativeness of a page can be assessed in terms of

Of course it's not just the Internet that can lie.
Some call-centre workers in India not only perfect American accents, but they claim American names and life stories if asked personal questions.
("Hi, I'm from Chicago", story by M. Landler, New York Times, seen in Montréal Gazette, 2001 Mar 22)

Subject directories

dmoz screenshot Subject directories provide human-generated hierarchical classifications of Web sites. These classifications are browsable (and maybe also searchable).

Subject directories do not include all (or even most) Web sites, just those which have somehow been identified as useful or relevant. There may be limited consistency in such evaluations. Explicit site ratings may be provided.

The selection of sites may be restricted to specific topics (e.g., health or medicine).

Some subject directories contain a lot of out-dated material and dead links (but then so do some search engines).

Examples of Subject directories

Search engines Screenshot of Alta Vista from Digital, 1996 October

Search engines provide the ability to search for text strings or words within large collections of text and addresses.
These collections are automatically generated by robots (or spiders) which mechanically traverse the Web and collect text.
The text is automatically converted by an indexing engine to a format which can be efficiently processed by a search engine.

Many search engines began as research projects and evolved into commercial operations including ‘portals’ and supported by advertising, paid services, etc.

There is strong competition and rapid change.

Examples of search engines

Effective searching

Based in part on Effective Web Searching: an introductory slide show by Karen Campbell, 1997 March.

Planning a search

Traecy time rule: How much time is the answer worth?

If searching via a slow Internet connection (e.g., modem and telephone line), consider turning off your browser's graphics. Or do the search later where you will have direct Internet access (e.g., at school).

Do you require a quick answer or a comprehensive answer?

What other resources do you have?

Using the tools

Screwdriver used as crowbar

Start with a subject directory unless you have very specific query.

Know the engines

Common methods

The following are less popular than they used to be:

Relevance ranking

Virtually all Web search engines attempt to rank hits by relevance. There are various methods for estimating relevance.

Look only at the first 20 or 30 hits to judge whether the search has succeeded.

If it hasn't, try different search words, or try a different search engine, or try a subject directory, or try Medline, or ...

A good search strategy for some search engines is to include several terms in your search, including synonyms.

Keep track of your results

Further information

Safe surfing

Remember the cautions about opening attachments, Word macros, etc.

Set the security level of your Web browser to refuse any ActiveX control or plug-in which is not signed by an entity you trust. ActiveX controls and plug-ins have complete access to your machine, unlike Java applets which are restricted to a sandbox (details).

Java (and Javascript) have also suffered from security problems in various versions of various browsers; use up-to-date software and the latest security patches, and/or disable Java and Javascript.


Bacon home page
R. Funnell
Last modified: Wed, 2007 Mar 7 13:41:03