Using Hermetic Word Frequency Counter
with Large Files and
Importing the Output into Excel![]()
To confirm that Hermetic Word Frequency Counter works with files containing about 100,000 different words a set of about 100,000 random 'words' (strings of 3-15 random letters) was generated, and from this a file was created consisting of 'sentences' composed of these words. This file is 2025 KB in size; the ZIP file (1315 KB) containing it may be downloaded by clicking on this link.
The program was applied to this file (with all checkboxes at the 'Parameters' window cleared). At the point where it had extracted 91,720 words the program looked like this:
After 90 minutes the program had extracted a total of 99,877 words, at which point it began a processing period lasting four minutes.
Then the words found were displayed:
At this point the file 'output.txt' was renamed to 'output_freq.txt'. Then word order 'alphabetical' was selected, and the words were displayed alphabetically (with negligible wait time). A new 'output.txt' file was written, and this was renamed to 'output_alpha.txt'. Both files are contained in a ZIP file(1846 KB) which can be downloaded by clicking on this link.
Opening 'output_freq.txt' in Excel 2003 brings up the Text Import Wizard, which has three steps:
Excel then displays the output data in a spreadsheet: Make sure that "File origin" is set to "Windows (ANSI)".
Excel 2003 has a limit of 65,536 rows; for 100,000 rows use Excel 2007.
Hermetic Word Frequency Counter Hermetic Systems Home Page