Help With

How To

Settings & Misc

Help Wurd



The PWP@att.net Search Spider

The search spider is a program that indexes pages at home.att.net for use in your site search feature. From what we understand, the search spider will start at whatever default page is seen at http://home.att.net/~emailid and will follow all HREFs off that page that are in the home.att.net domain. Then, it will follow all HREFs to pages on home.att.net on each of those pages and so on until it exhausts all of them.

The default page that is highest in the priority will be the one that is used as the starting point. If a page isn't linked from the starting point, it won't be indexed.

The current list of default pages (in order of priority) is:

  1. index.html
  2. index.htm
  3. home.html
  4. home.htm
  5. personal.html
  6. resume.html
  7. my_business.html
  8. my_assoc.html
  9. wsb.html
  10. store.html

For example, if you have an index.html and an home.html page, the index.html would be the spiders starting point and the home.html would not be spidered unless there is a link to it from a page that is spidered. Also, if you don't use one of the default pages shown above, your site won't be spidered unless the spider follows a link from another site.

Some additional info on the search spider:

If you have a subset of pages that aren't linked from the starting point page, consider linking to the main subset page using an HREF in a comment. If you have a lot of individual pages that you want spidered, but aren't individually linked on any page, make a page with all of those links on it (aka Site Map) and link to THAT page in an HTML comment. You could do the same type of thing with pages that are otherwise linked only via javascript. We've provided a sample link to a site map within an HTML comment below:

<!-- This is a comment and won't be displayed.
<a href="http://www.wurd.com">WURD</a>
-->

Here are some additional points about PWP Search:

If you fear that your site isn't being indexed:

  1. Make sure that you have search enabled in your profile:
    http://publish.att.net/cgi-bin/profile
  2. Make sure that the first indexable page in your site (i.e., the page that is retrieved from the URL http://home.att.net/~emailid/) contains links to the other pages in your site.
    Some common misconceptions on site indexing:
    1. It is the first_indexable_file that is retrieved, not some file that users send out the URL for as their homepage. e.g., if they send out the link http://home.att.net/~jims.stuff/jims-stuff.htm as their homepage in e-mail, but there isn't an indexable file for http://home.att.net/~jims.stuff/, or it doesn't contain links to the rest of their site, (especially a link to /~jims.stuff/jims-stuff.htm) their site won't be indexed as they might like.
    2. The spider doesn't crawl off-site. Ever. It doesn't index other hosting providers' content. If the user's have links going off-site, or are linked to a domain redirector to stealthily display their PWP content, it won't work.
    3. The spider won't index files in the user's PWP space that aren't linked to from their first indexable page. It's an HTTP spider, not a disk spider.
  3. There is no automated process to update their indexing preference in the user's PWP profile. If site search is turned on, then the user explicitly did it. If site search is turned off, then either they never turned it on, or they explicitly turned it off.
  4. It is theoretically possible for the user to cause the state of the search to be out of sync between the publishing server preferences and the search server start files, although it would be very difficult to do, in practice. If the user's site really, truly isn't being indexed, as verified by checking the points in 1 and 2 above, then the user should check the search preference in the PWP profile:
    1. If it isn't what they desire (the nasty double clicker or simply forgetful user), then they should change the preference to what they want.
    2. If the state is what they think it should be, then they should toggle it once, wait for the form to submit, count to ten, and toggle it back to what they desire.

Back to top

Need Additional Help?

If you can't find the answers you need, please try:

  1. The help file for the application you are using.
  2. Our FAQs.
  3. The AT&T Worldnet Help Newsgroups.
Note: AT&T DSL Service support is available at http://dslhelp.att.net