Index

A

-abspath 2-8
aliases for file paths
-prefixmap 3-7
-allold 2-3
-auth 2-14
and inetsrch.ini 3-2
authenticating
for indexing sites 3-2
proxy servers 3-24
secure paths 3-2
with HTTP_PROXY 3-25
Authfile
in inetsrch.ini 2-14

B

-bulksize 3-13

C

caching
of downloaded documents 2-11
capabilities
default 1-5
-casesen 2-20
CGI environment variables
for proxy authentication 3-25
HTTP_NOPROXY 3-25
HTTP_PROXY 3-25
-cgiok 2-15
-charmap 2-26
-cmdfile 2-8
-coll 3-18
-collection 2-8, 3-13
collection servicers (collsvc)
and Verity spider 3-12
arguments 3-13
collsvc utility 3-11
examples 3-17
shared library variable 3-11
collections
incompatibility map 1-11
meta versus universal 1-11
no optimizing 2-27
optimizing 4-7
style files 2-11
updating 1-10
upgrading 1-11, B-1
collsvc arguments
-bulksize 3-13
-collection 3-13
-delquery 3-13
-delschedule 3-14
-indexers 3-14
-indexmode 3-14
-interval 3-14
-logfile 3-14
-loglevel 3-15
-maxmemperproc 3-15
-mergemode 3-15
-mergers 3-15
-pidfile 3-15
-squeezeschedule 3-16
-tmpdir 3-16
-common 2-26
-connections 2-12
control file
for -prefixmap 3-7
customizing
Last-Modified date 3-22

D

-date 3-19
date formats
for Last-Modified date 4-10
-datefmt 2-26
-debug 2-26
default capabilities 1-4
-delay 2-12
-delquery 3-13
-delschedule 3-14
direct-access hosts 2-13, 3-25
discontinued options
-allold 2-3
-jobs 2-3
-nodns 2-3
-noupdate 2-3
-topicset 2-3
disk cache 2-11
document viewing
indexed secure paths 3-2
-domain 2-15

E

environment variables (CGI)
for proxy authentication 3-25
HTTP_NOPROXY 3-25
HTTP_PROXY 3-25
error messages A-1
examples
collection servicers (collsvc) 3-17
indexing tasks 4-1
Last-Modified date 3-23
prefix mapping 3-8- 3-10
using -abspath 3-7
using -auth 3-3
using -prefixmap 3-7
using -prunedir 3-4
vsdb utility 3-20
Exchange
indexing with Verity 1-2
-exclude 2-20
extract
limitations B-4
upgrading collections B-2

F

failed URLs
using -restart 2-7
file path resolution
to aliases 3-7
file system indexing
known MIME types 2-31
unknown MIME types 2-30

H

-header 2-12
-help 2-8
-host 2-16
-hostcache 2-13
hosts
using direct access 2-13, 3-25
HTML documents
and Last-Modified date 3-21
HTTP_NOPROXY 3-25
HTTP_PROXY 3-25

I

imported collections
and -resync 2-7
-include 2-20
incompatibility
of collections 1-11
-indexclude 2-21
-indexers 2-9
collsvc utility 3-14
indexing
and disk cache 2-11
and meta tags 3-5
file paths 3-7
job workflow 1-9
Microsoft Exchange 1-2
ODBC databases 1-2
secure paths using -auth 3-2
topics 4-5
workflow 4-2
indexing task
examples 4-1
status reporting 3-18
-indexmode 3-14
-indinclude 2-22
-indmimeexclude 2-23
-indmimeinclude 2-23
inetsrch.ini
and upgrading collections B-3
Authfile entry 2-14
supporting use of -auth 3-2
Information Server
and proxy access 3-24
internationalization
useful options 2-26
-interval 3-14

J

job syntax
vspider 2-2
-jobs 2-3
-jumps 2-16

L

-language 2-26
Last-Modified date 3-21
customizing 3-22
determining 3-22
example 3-23
how it is used 3-21
valid formats 4-10
-license 2-9
licensing 1-5
limiting paths
during directory walking 3-4
during web crawling 3-3
-locale 2-26
localization
useful options 2-26
-logfile
for collsvc 3-14
logging options (vspider)
-verbose 2-26
-debug 2-26
-trace 2-26
-loglevel
for collsvc 3-15

M

-match 3-18
-maxdocsize 2-24
-maxindmem 2-9
-maxmemperproc 3-15
-mergemode 3-15
-mergers 3-15
meta collections
and searching 1-11
not supported 1-5
upgrading 1-11
versus universal collections 1-11
meta tags
adding a field 3-5
indexing 3-5
meta2uni.pl
arguments B-5
editing B-5
-metafile 2-25
MIME types
and file system indexing 2-29
and web crawling 2-29
for file system indexing 2-31
indexing unknown 2-30
setting 2-28
-mimeexclude 2-24
-mimeinclude 2-24
mktopics
using 4-5
mkvdk
upgrading collections B-2
using with -nooptimize 4-7
-msgdb 2-26

N

new options 2-4
-nodns 2-3
-noflock 2-9
-nofollow 2-16
-noindex 2-9
-nooptimize 2-27
-noproxy 2-13
-norobo 2-16
-nostorage 2-10
-noupdate 2-3

O

ODBC
indexing with Verity 1-2
optimizing
with mkvdk 2-27, 4-7
overriding Last-Modified date 3-22

P

-parent 3-19
-pathlen 2-17
Perl
and upgrading collections B-5
where to get it B-5
persistent searches
and squeezes 3-11
persistent store
and platform dependence 1-3
-pidfile 3-15
prefix mapping 3-7
using the control file 3-7
-prefixmap 2-10
-print 3-19
providing Last-Modified date 3-22
-proxy 2-13
proxy access
and Information Server 3-24
proxy servers
authenticating 2-13, 3-24
specifying 2-13, 3-24
-proxyauth 2-13
-prunedir 2-17
-purge 2-27

R

-refresh 2-6
-refreshtime 2-18
-repair 2-27
-reparse 2-18
reporting
Verity spider status 3-18
vsdb 3-18
-restart 2-7, 2-11
status of reparsed URLs 3-19
restarting
status of reparsed URLs 2-7
-resync 2-7
-retry 2-14

S

searching meta collections 1-11
security
indexing secure sites 3-2
setting MIME types 2-28
multiple parameter values 2-28
using the asterisk (*) 2-28
shared library variable
collection servicers (collsvc) 3-11
-squeezeschedule 3-16
-start 2-6
-status 3-19
of URLs for a restart 3-19
-style 2-11
style files
default location 2-11
fields for meta tags 3-5
-submitsize 2-11
-summary 2-11
synchronization
for imported collections 2-7
with -noindex 2-9
syntax
vspider job 2-2

T

-temp 2-11
-timeout 2-14
-tmpdir 3-16
topics
how to index 4-5
-topicset 2-3
using mktopics instead 4-5
-trace 2-26

U

-unlimited 2-18
updating collections
fewer documents 1-10
upgrading collections
and inetsrch.ini B-3
from meta collections 1-6, 1-11
using a Perl script B-5
using meta2uni.pl B-5
with extract and mkvdk B-2
URLs
parsed in a restart 2-7
utilities
collection servicers (collsvc) 3-11
mktopics 2-3, 4-5

V

-verbose 2-26
Verity spider reporting 3-18
examples 3-20
viewing documents
indexed from secure paths 3-2
-virtualhost 2-19
vsdb arguments
-coll 3-18
-date 3-19
-match 3-18
-parent 3-19
-print 3-19
-status 3-19
vsdb utility 3-18
examples 3-20
vspider
and direct access 3-25
and proxy access 2-13
job syntax 2-2
licensing 1-5
proxy access 3-24
vspider options
-abspath 2-8
-auth 2-14
-casesen 2-20
category map 2-5
-cgiok 2-15
-charmap 2-26
-cmdfile 2-8
-collection 2-8
-common 2-26
-connections 2-12
-datefmt 2-26
-debug 2-26
-delay 2-12
discontinued 2-3
-domain 2-15
-exclude 2-20
for internationalization 2-26
for localization 2-26
-header 2-12
-help 2-8
-host 2-16
-hostcache 2-13
-include 2-20
-indexclude 2-21
-indexers 2-9
-indinclude 2-22
-indmimeexclude 2-23
-indmimeinclude 2-23
-jumps 2-16
-language 2-26
-license 2-9
-locale 2-26
-maxdocsize 2-24
-maxindmem 2-9
-metafile 2-25
-mimeexclude 2-24
-mimeinclude 2-24
-msgdb 2-26
new 2-4
-nodns 2-3
-noflock 2-9
-nofollow 2-16
-noindex 2-9
-nooptimize 2-27
-noproxy 2-13
-norobo 2-16
-nostorage 2-10
-pathlen 2-17
-prefixmap 2-10
-proxy 2-13
-proxyauth 2-13
-prunedir 2-17
-purge 2-27
-refresh 2-6
-refreshtime 2-18
-repair 2-27
-reparse 2-18
-restart 2-7, 2-11
-resync 2-7
-retry 2-14
-start 2-6
-style 2-11
-submitsize 2-11
-summary 2-11
-temp 2-11
-timeout 2-14
-topicset 2-3
-trace 2-26
-unlimited 2-18
-verbose 2-26
-virtualhost 2-19

W

what's new 1-6
workflow
indexing job 1-9




Copyright © 1998, Verity, Inc. All rights reserved.