Hermetic Word Frequency Counter Advanced Version Multiple Input Files Unlike the basic version, which acts only on one file at a time , the Advanced Version can act on multiple files in multiple folders. Here is a typical screenshot:
Files to be scanned can have any filename extension (the part of the filename following the last period), but the files must consist almost entirely of text characters (either 8-bit text or 16-bit Unicode text). For more detail see the section Scannable Files in the user manual for the basic version.
The software allows you to restrict the files which will be scanned to (a) those having a file extension in a specified list and (b) to files not having a specified extension. This is necessary because there may be, e.g., .js and .css text files mixed up with HTML files and you may wish to exclude them in a scan. (If you restrict the file extensions to one or more then there is no need to specify any to be excluded.)
The List files to be scanned operation should be run before doing a scan so that you know exactly what files will be processed. Clicking on the output textbox allows you to pause and resume the listing. This also works when (during a count words/phrases operation) words are being displayed as found.
Various types of files are automatically excluded from a scan, in particular, any binary file. This includes Microsoft Word .doc files, whose file formats are not made public by Microsoft. Other files which are automatically excluded are files with the extensions .xls, .pdf and .sys, plus the common graphics and executable files.
Introduction User Manual: Contents Hermetic Systems Home Page