Hermetic Word Frequency Counter
HTML, XML, PHP and C/C++ Files

Files of these types (with file extensions htm, html, shtm, shtml, xml, php, c and cpp) are here called code files. The input file need not consist simply of natural language text, but may be such a code file, which may mix natural language with tags such as "<table>".

When processing HTML files, HTML tags such as "<center>" are skipped. When processing XML files all text within "<" and ">" is skipped. PHP files are processed as HTML files in which C-style comments are possible (see Embedded Comments). When processing PHP files, text within "<?php" and "?>" is not skipped.

Introduction User Manual: Contents
Hermetic Systems Home Page