First, verify that your HTML conforms to the HTML 3.2
specification using one of many HTML "lint" or "validator"
applications available on the 'net. The W3C has links to a few
good ones that can be used on-line. If you use HTML that does
not conform to the HTML specification, then what you get out of
HTMLDOC may or may not be what you expect. Just because your
favorite browser displays the HTML as you like, don't assume
that your HTML is conformant!
Second, HTML is not (by definition) a WYSIWYG format. The
formatting varies depending on the software and output device.
All the HTML specification defines is the general behavior and
function of each HTML element (P, H1, B, TABLE, etc.) HTMLDOC
follows the W3C display recommendations in the HTML
specification. When no behavior is explicitly specified, it
chooses a behavior that most closely resembles
Netscape/Mozilla.
Third, PostScript and PDF are quite different than HTML.
Printed documents have fixed page sizes, while a web browser can
provide scrollbars for content that goes beyond the window
boundaries. Pagination often has formatting side effects that
are not an issue when viewing the HTML on-line. Scripting
languages offer dynamic content when viewed with a web browser,
but that content will be unavailable when converting to
PostScript or PDF for a variety of reasons. Some web pages use
custom fonts which may not be available when printing or viewing
PostScript and PDF files.
Finally, please read the Users Manual that comes with the
software! Chapter 6
provides a reference for the HTML elements that HTMLDOC supports
as well as the HTML comments you can insert to control and tune
the final output.