Microsoft Windows NT Working Set Tuner README File April 15, 1998 1. Description: ---------------- Windows NT (32-bit) Working Set Tuner (WST) for x86, Mips, Alpha, and PowerPC platforms. The working set of a program is a collection of those pages in the program's virtual address space which have recently been referenced. As the working set size decreases, memory demand decreases. Since it is advantageous to minimize the memory demand, the working set tuner DLL is developed to measure code working set, and produce an ordering of the functions (procedures) within the code which will be small, if not minimal, in a paging environment. In any practical case there are probably many optimal assignments of functions to pages. Although this method does not claim optimality, it will produce results at least as good as a programmer could achieve by hand in a reasonable amount of time. This method consists of two parts: 1) Data Collection: Data is collected in C programs by using the attributed profiling hook provided by the Microsoft C/C++ Compiler (compiling with the "-Gh" option). Data is not collected in assembler language programs. A hook routine in the working set tuner DLL, WST.DLL, is called before each C/C++ function is called. The hook examines its environment to determine the identity of the called function. A bit is set in a bitmap every time a function is called within the specified time segment. A new bitmap is used (new snapshot) for each new time segment: smaller time segments will cause higher frequency of snapshots which will produce better working set (with some diminishing point of return...). The collected WST data is dumped upon the program's termination or via WST's dump facility called WstDump.exe. Functions' symbols in the program's symbol table are used by the WST.dll to match function names to bits in the bitmap. 2) Data Reduction: Two post-processing tools, WstCat.exe & WsTune.exe, process the collected data and produce an ordering of the functions (packing list) which, when placed into pages, will result in a smaller working set of pages in memory compared to its original working set size for the activity measured. The packing list, created using an ordering algorithm, can be used by the linker to perform the actual packing within the executable's object modules. 2. WST Files: -------------- o wst.ini -- WST initialization file. o wst.dll -- WST DLL. o wst.lib -- WST library file containing WST entry point. o wstdump.exe -- Dump utility for dumping/clearing WST data, and stopping data collection at any time. o wstune.exe -- Data analysis utility for creating the packing list from collected data. o wstcat.exe -- Concatenation utility for multiple data files. o penter.lib -- An empty library containing a dummy WST entry point. This can be used instead of WST.lib to link with an executable object module which was compiled with the -Gh option, so the application can run without recompiling and without any of the WST overhead. 3. Using the Working Set Tuner Dll: ------------------------------------ o Create the WST directory in the root directory of C: drive. (c:\wst) o Place WST.INI in the WST directory of C: drive and list all the DLL/EXE images to be tuned as follows: NOTE:- WST.INI must exist for WST to run. [EXES] Name(s) of module(s) to be tuned. Each name should be on a new line. A semicolon ';' in front of a name removes it from the list. [PATCH IMPORTS] obsolete - no longer used [TIME INTERVAL] Elapsed time in milliseconds between recording snapshots (if left blank, a default time interval of 1000 is used). This section may include an optional entry 'MaxSnaps=n', where n is the maximum number of snapshots to take (the default maximum is 3200). o Sample WST.INI file: [EXES] myapp.exe [TIME INTERVAL] 1500 MaxSnaps=256 o Attach WST.DLL to the application process: - Recompile your EXE/DLL using the "-Gh" and "-Zi" or "-Z7" compiler options. (NOTE - on Mips platform, use "-Od" to disable compiler optimization.) - Link it with WST.LIB using the options "-debug", either "-debugtype:coff" or "-debugtype:both", and "-pdb:none" or "-pdb:". - This method will cause all the C functions in the recompiled sources to be accessible to WST.DLL. NOTE:- The win32.mak file in the Win32 SDK will set these options correctly for your application if you build it with the tune=1 switch. I.e. "nmake -all tune=1". NOTE:- If symbols have been removed from your exe or dll, WST will try to locate the symbols files (DBG or PDB files) in the current directory, then in the path specified by the environment variables "_nt_symbol_path", "_nt_alt_symbol_path", and "systemroot", in that order. E.G. if the DBG file of your exe is in c:\symbols\exe, set _nt_symbol_path to c:\symbols. o Place WST.DLL in your SYSTEM directory (i.e. ..\nt\system32) or in the same directory with the application being tuned. o If this is the only run that is to be considered for tuning, clear any existing .TMI and .WSP files in the C:\WST directory. o Run your application(s). o Data will be collected for all the modules specified in the [EXES] section of the WST.INI. o Exit the application to dump the WST data, or run WstDump.exe to dump the data. See section 4 "WST Data" for details. o If you have multiple TMI & WSP files, run WstCat.exe to concatenate data from multiple runs. See section 5 "Using WstCat" for details. o Run WsTune.exe to generate the packing list (.prf file) based on the collected WST data. See section 6 "Using WsTune" for details. o Use the packing list to direct the linker to order the modules as specified in the packing list (.prf file). See section 8 "Linking with the Packing List" for details. 4. WST Data: ------------- o WST data can be captured in two ways: 1) Upon termination of the application WST data is dumped automatically. 2) Using the dump utility, WstDump.exe, WST data can be dumped at any time. See section 7 "Using WstDump" for details. o Data files are created in C:\WST directory. o WST data is dumped into data files using the application and its DLLs names with .TMI & .WSP extensions. No data is dumped for modules which lack symbol information in the COFF format. o If TMI & WSP files exist in the c:\wst directory, new data files with .T?? and .W?? are created where ?? is a number between 0x01-0xff. The number indicates the run number. All the data files can be concatenated (using WstCat.exe) to generate a single packing list from the multiple run data files. 5. Using WstCat: ----------------- o WstCat.exe can be used to concatenate multiple .WSP and .TMI files (multiple files will be named *.T00 - *.Tff and *.W00 - *.Wff) into a single file for analysis. This allows the user to run multiple scenarios, over time, and analyze them as a single run. WST will also create multiple files if a scenario exceeds its snapshots limit. Usage: WSTCAT moduleName "moduleName" is the name of the module to combine. The module is the name of the DLL or EXE which is being tuned and data has been collected for. NOTE:- The original TMI & WSP files will be renamed to TXX & WXX by WstCat. 6. Using WsTune: ----------------- o WsTune.exe is used to analyze the data created by WST. WsTune will also create the packing list, .PRF file, which is used by the linker to pack your DLL or EXE. o The DATA files created by wstune are: - .DT (Data Tuned) contains the analysis data/bitstrings for all api's called in the "packing list" order. - .DN (Data Not tuned) contains the api's called in the "unpacked" order. - .PRF contains the packing list to be used by the linker. - .WSR is a worker index file created for internal use. Usage: WSTUNE [/O] [/D] [/N] [?] moduleName1[.WSP] [...] /O Dump analysis data to file (*.DT tuned, *.DN not tuned) /N Analyze bitstring data only, create .WSR and .PRF files (turns off /O) /D Dump analysis data only; use existing .WSR file (turns off /N) /P Display a progress indicator /? Display this usage message where "moduleNameX" is the name(s) of the module file(s) to tune. Each module file will result in a separate set of tuning data based on the input switch settings. If neither /O nor /N are specified, tuning data (normally the contents of the .DT file) is dumped to the standard output device. 7. Using WstDump: ------------------ o WstDump.exe can be used to stop WST data collection and clear/dump WST collected data for all the applications being tuned, at any time. o The following options are available via WstDump.exe: - Stop : Stops WST data collection (applications continue to run). - Clear and Restart : Clears any existing WST data and restarts data collection. - Dump and Stop : Dumps any existing WST data and stops the data collection (applications continue to run). o Data is dumped to c:\wst using the names of the application and its DLLs with .TMI & .WSP extensions. No data is dumped for modules with no symbols. o If TMI & WSP files exist in the c:\wst directory, new data files with .T?? and .W?? are created, where ?? is a number between 0x01-0xff. The number indicates the run number. All the data files can be concatenated (using WstCat.exe) to generate a single packing list from the multiple run data files. 8. Linking with the Packing List: ---------------------------------- o Recompile your applications using the Microsoft C/C++ Compiler's "-Gy" option. (Remove the "-Gh" option to avoid extra overhead.) o Relink your applications using the link "-order:@filename" option where filename is the name of your packing list (i.e. .prf file generated by WsTune.exe). 9. Caveats: ------------ o If a symbol is not available in an EXE/DLL that is being tuned, ??? is displayed as the function name in data and analysis files (??? is not listed in the packing list, however.) o WST data is *NOT* collected for functions in assembler language programs/objects. o The linker currently will not reorder static functions. The Microsoft C/C++ Compiler automatically generates certain static functions (their names look like "_$Ennn", where nnn is a number the compiler assigns internally). ** END OF README **