|
|
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML//EN">
<html>
<head> <title>Microsoft Index Server Guide: Catalogs</title> <meta name="FORMATTER" content="Microsoft FrontPage 1.1"> <meta name="GENERATOR" content="Microsoft FrontPage 1.1"> </head>
<body bgcolor="#FFFFFF"> <!--Headerbegin--><p align=center><a name="TOP"><img src="onepix.gif" alt="Space" align=middle width=1 height=1></a> <a href="default.htm#Top"><img src="toc.gif" alt=" Contents" align=middle border=0 width=89 height=31></a> <a href="sechelp.htm"><img src="previous.gif" alt="Previous" align=middle border=0 width=32 height=31></a> <a href="reghelp.htm"><img src="next.gif" alt="Next" align=middle border=0 width=32 height=31></a> </p> <hr> <!--Headerend--><p align=left><a name="Catalogs"><font size=6><strong>Catalogs</strong></font></a></p> <p align=left><!--Chaptoc--></p> <blockquote> <p><a href="cathlp.htm#TheDefaultCatalog">The Default Catalog</a> <br> <a href="cathlp.htm#multiple">Multiple Catalogs</a> <br> <a href="cathlp.htm#MovingandRemovingaCatalog">Moving and Removing a Catalog</a> <br> <a href="cathlp.htm#PhysicalProperties">Physical Properties</a> <br> </p> </blockquote> <hr> <!--ChaptocEnd--><p>A <em>catalog</em> is the highest-level unit of organization in Microsoft Index Server. Each catalog is a completely self-contained unit, containing an index and cached properties for one or more scopes (virtual roots). In this version of Index Server, queries cannot span multiple catalogs.</p> <hr> <h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="TheDefaultCatalog">The Default Catalog</a></h1> <p>The initial catalog location is determined during setup and stored in the <a href="reghelp.htm#IsapiDefaultCatalogDirectory">IsapiDefaultCatalogDirectory</a> registry entry. If the <a href="idq-vars.htm#CiCatalog"><strong>CiCatalog</strong></a> parameter is not specified in an .idq file, then this default catalog is used. Intially, the default catalog contains an index of all virtual roots with read access, but the scope of the catalog can be <a href="adminhlp.htm#AddVRoot">modified</a> through administrative pages.</p> <hr> <h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="multiple">Multiple Catalogs</a></h1> <p>There are two reasons to create more than one catalog: to distribute queries, and to support virtual servers. Because it is impossible to create a query that spans more than one catalog, careful consideration of the consequences must be made before creating multiple catalogs. The ability to query “everything” is lost. Use of default catalog support is also impaired: no support for multiple IP address-specific <em>default</em> catalogs is provided.</p> <p>Physically dividing the set of virtual roots between a number of catalogs will improve query performance, but only if most queries are restricted to a subscope (one or more virtual roots) of the Web site. This improvement occurs because the number of false hits (hits outside the query scope) will be reduced.</p> <h3>Creating an Additional Catalog</h3> <h4>To create a catalog</h4> <ol> <li>Create a directory named Catalog.wci at the desired location. </li> <li>Set the appropriate <a href="sechelp.htm#CatalogPerm">catalog permissions</a>.</li> <li>Then make appropriate modifications to the <a href="idq-vars.htm#CiCatalog"><strong>CiCatalog</strong></a> parameter of .idq files to point to that location.</li> </ol> <p>For example, if you put the catalog in D:\Vdom2\Catalog.wci, the <strong>CiCatalog</strong> is set to D:\Vdom2. The <strong>CiCatalog</strong> specification should not include the Catalog.wci directory itself. The initial query against the catalog will start the indexing process. You may want to <a href="adminhlp.htm#AddVRoot">modify the set of virtual roots</a> covered by the new catalog.</p> <h3>Associating a Catalog with a Virtual Server</h3> <p>By default, a catalog is not associated with a specific virtual server, and only virtual roots <em>without</em> a specific IP address are added to the catalog. To associate a catalog with a specific virtual server, add an entry under the <a href="reghelp.htm#MaxFilesizeMultiplier">IsapiVirtualServerCatalogs</a> subkey. The name of the value is the IP address of the virtual server, and the value is the catalog location. For example, a catalog located in the root of drive G: that contains virtual roots accessible only from IP address 123.45.67.89 would look like: ...\CurrentControlSet\Control\ContentIndex\IsapiVirtualServerCatalogs\123.45.67.89 = G:\. After making this registry change, stop and restart Internet Information Server. Then, issue a query against the catalog for 123.45.67.89.</p> <p>The previous procedure will add virtual roots (specific to the indicated virtual server) to the set of scopes covered by the new catalog. All virtual roots without a specific IP address will be available as well. Note that by default, common roots are indexed in all catalogs. Also, be sure to update the .idq and .ida files if you have changed them.</p> <h3>Multiple Catalogs for Multiple Virtual Servers</h3> <p>The following example shows how to set up multiple catalogs for a configuration containing multiple virtual servers. </p> <p>In the following example, assume that you want to set up multiple catalogs for the following scenario:</p> <ul> <li><p align=left>The IP address of the virtual server is 123.45.67.89.</p> </li> <li><p align=left>The following virtual roots are set up for 123.45.67.89</p> <table border=1 cellpadding=5 cellspacing=0 width=75%> <tr><th align=left valign=bottom width=10%><font size=2>Virtual Root </font></th><th align=left valign=bottom width=33%><font size=2>Physical Path</font></th><th align=left valign=bottom width=15%><font size=2>Permissions</font></th></tr> <tr><td valign=top width=10%><font size=2>/</font></td><td valign=top width=33%><font size=2>D:\Vserver2\Root </font></td><td valign=top width=15%><font size=2>Read</font></td></tr> <tr><td valign=top width=10%><font size=2>/Scripts1</font></td><td valign=top width=33%><font size=2>D:\Vserver2\Scripts</font></td><td valign=top width=15%><font size=2>Execute</font></td></tr> <tr><td valign=top width=10%><font size=2>/Reports</font></td><td valign=top width=33%><font size=2>D:\Vserver2\Reports</font></td><td valign=top width=15%><font size=2>Read </font></td></tr> </table> </li> </ul> <ul> <li><p align=left>Microsoft Index Server data will be put in the directory D:\Vserver2_index. Better performance can be achieved by having Microsoft Index Server data on a different drive than the drive that contains the corpus, but for illustration purposes this example puts it in the same drive.</p> </li> <li><p align=left>Sample files installed by Microsoft Index Server will be copied and modified to work with the virtual server 123.45.67.89.</p> </li> </ul> <h4>To set up multiple catalogs for multiple virtual servers</h4> <ol> <li><p align=left>Create a directory D:\Vserver2_index\Catalog.wci . If D: is a Windows NT File System (NTFS) drive, set an ACL on it giving full access to SYSTEM and Administrators accounts only. For the highest security, Microsoft recommends installing Index Server data on an NTFS drive.</p> </li> <li><p align=left>Add an entry in the registry under the key<br> HKEY_LOCAL_MACHINE\System\CurrentControlSet\Control\contentindex\<br> ISAPIVirtualServerCatalogs\123.45.67.89=d:\vserver2_index<br> <br> Note that key value does not include the directory Catalog.wci.</p> </li> <li><p align=left>Copy all the query forms(.htm) files from the default installation to D:\Vserver2\Root\Samples\Search. (Create the directory hierarchy if necessary.)</p> </li> <li><p align=left>Copy all the query script and administration files (.idq , .ida, and .htx) files from the default installation to D:\Vserver2\Scripts\Samples\Search. (Create the directory hierarchy if necessary.)</p> </li> <li><p align=left>Modify the <strong>CiCatalog</strong> variable specification in all the .idq and .ida files in D:\Vserver2\Scripts\Samples\Search to point to D:\Vserver2_index. Make sure it does not have any comments in it. (It should look like <code>CiCatalog=d:\vserver2_index</code>. Again note that catalog.wci is not included in the specification).</p> </li> <li><p align=left>Modify the <strong>CiTemplate</strong> parameter in the .idq and .ida files to point to /Scripts1/samples/search. Note the change from “Scripts” to “Scripts1” for the virtual root.</p> </li> <li><p align=left>Modify the .htm files in D:\Vserver2\Root\Samples\Search to point to the appropriate .idq/.ida files. You may have to change the path from “/Scripts/Samples/Search” to “/Scripts1/Samples/Search”.</p> </li> <li><p align=left>Issue a query by connecting to http://123.45.67.89/samples/search/query.htm. This will start indexing for the virtual server 123.45.67.89</p> </li> </ol> <p>You can of course customize the query forms to restrict access to specific scopes. You may also have to modify the administration forms separately.</p> <hr> <h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="MovingandRemovingaCatalog">Moving and Removing a Catalog</a></h1> <p>It is safe to move a Catalog.wci directory to a new location. Similarly, to delete a catalog you need only delete the Catalog.wci directory. It is a good idea to update the relevant registry entries (<a href="reghelp.htm#IsapiDefaultCatalogDirectory">IsapiDefaultCatalogDirectory</a> and <a href="reghelp.htm#MaxFilesizeMultiplier">IsapiVirtualServerCatalogs</a>) as well. Internet Information Server or Peer Web Services must be stopped before you remove or move catalogs. It is recommended that you review all your .idq files to make sure there are no references to nonexistent catalogs.</p> <hr> <h1><a href="#TOP"><img src="up.gif" alt="To Top" align=middle border=0 width=14 height=11></a><a name="PhysicalProperties">Physical Properties</a></h1> <p>The catalog is physically stored in a user-specified location. A directory named Catalog.wci is created there. Index server creates a number of files in this directory, including: </p> <div align=left> <table border=1 cellpadding=5 cellspacing=0 width=100%> <tr><th align=left valign=bottom width=15%><font size=2>File</font></th><th align=left valign=bottom width=85%><font size=2>Purpose</font></th></tr> <tr><td valign=top width=15%><font size=3><tt>000</tt><em><tt>nnnn</tt></em><tt>.prp</tt></font></td><td valign=top width=85%><font size=2>Holds cached properties. The name is the version of the property cache. Each modification to the property-cache schema increments this number by one.</font></td></tr> <tr><td valign=top width=15%><font size=3><tt>0001</tt><em><tt>nnnn</tt></em><tt>.ci</tt></font></td><td valign=top width=85%><a href="indexhlp.htm#ShadowIndex"><font size=2>Shadow</font></a><font size=2> and </font><a href="indexhlp.htm#MasterIndex"><font size=2>master</font></a><font size=2> indexes. Each index has a unique number <em>nnnn</em>.</font></td></tr> <tr><td valign=top width=15%><font size=3><tt>0001</tt><em><tt>nnnn</tt></em><tt>.dir</tt></font></td><td valign=top width=85%><font size=2>Directory used to quickly search the similarly named shadow index or master index.</font></td></tr> <tr><td valign=top width=15%><font size=3><tt>cicat.hsh</tt></font></td><td valign=top width=85%><font size=2>Quickly converts a path into an internal identifier used throughout the index.</font></td></tr> <tr><td valign=top width=15%><font size=3><tt>CiCL0001.*</tt></font></td><td valign=top width=85%><font size=2>Contains a log of files that need to be </font><a href="filtrhlp.htm"><font size=2>filtered</font></a><font size=2>.</font></td></tr> <tr><td valign=top width=15%><font size=3><tt>CiFL</tt><em><tt>nnnn</tt></em><tt>.*</tt></font></td><td valign=top width=85%><font size=2>Maps documents to the most recent index for a given document.</font></td></tr> <tr><td valign=top width=15%><font size=3><tt>CiPS0000.*</tt></font></td><td valign=top width=85%><font size=2>Describes the record format of the property cache.</font></td></tr> <tr><td valign=top width=15%><font size=3><tt>CiPT0000.*</tt></font></td><td valign=top width=85%><font size=2>Maps ActiveX property descriptors to internal identifier.</font></td></tr> <tr><td valign=top width=15%><font size=3><tt>CiSL0001.*</tt></font></td><td valign=top width=85%><font size=2>Contains a log of in-use files that need to be </font><a href="filtrhlp.htm"><font size=2>filtered</font></a><font size=2>.</font></td></tr> <tr><td valign=top width=15%><font size=3><tt>CiSP0000.*</tt></font></td><td valign=top width=85%><font size=2>Contains a list of physical scopes covered by this index.</font></td></tr> <tr><td valign=top width=15%><font size=3><tt>CiST0000.*</tt></font></td><td valign=top width=85%><font size=2>Maps security </font><a href="glossary.htm#ACL"><font size=2>Access Control Lists (ACLs)</font></a><font size=2> to internal identifier.</font></td></tr> <tr><td valign=top width=15%><font size=3><tt>CiVP0000.*</tt></font></td><td valign=top width=85%><font size=2>Contains a mapping between physical and virtual paths.</font></td></tr> <tr><td valign=top width=15%><font size=3><tt>Index.*</tt></font></td><td valign=top width=85%><font size=2>Master list of indices.</font></td></tr> </table> </div> <p>The Catalog.wci directory, and everything under it, will not be indexed by Index Server even if it is accessible through a virtual root of the Web site.</p> <!--Footerbegin--><hr> <p align=center><a href="default.htm#Top"><img src="toc.gif" alt=" Contents" align=middle border=0 width=89 height=31></a> <a href="sechelp.htm"><img src="previous.gif" alt="Previous" align=middle border=0 width=32 height=31></a> <a href="#TOP"><img src="up_end.gif" alt="To Top" align=middle border=0 width=32 height=31></a> <a href="reghelp.htm"><img src="next.gif" alt="Next" align=middle border=0 width=32 height=31></a></p> <hr> <p align=center><em>© 1996 by Microsoft Corporation. All rights reserved.<!--Footerend--></em></p> </body>
</html>
|