Free Software for DOS
HTML Utilities

21 Aug 2006

Global Menu:
Go back to Front Page Menus

This page:



CONVERSION: txt < > html

CONVERSION: Special Formats


Browsers are listed in Communication & Internet - 2.

Also see HTML_Cal, creates monthly HTML calendar.


Also see:

The Connect IDE includes a good, integrated, text mode HTML viewer.

FTE – Programmer's editor with HTML editing mode. While this editor is tuned for programming chores, it also includes a straightfoward HTML "mode" with color highlighting of HTML tags and, more uniquely, a pull-down menu with commands to insert common HTML tags.

Other text editors that offer some HTML syntax highlighting and/or (obvious) tagging capabilities: TDE (highlight), FED (highlight), Vim, Elvis (HTML mode display, with local hyperlink navigation), JED (highlight, html_mode command bar), ZED (highlight), SETEDIT (highlight, tag macros), and GNU Emacs (highlight, pulldown tag menu).

L_H — Customizable HTML and XML viewer.


[added 2000-06-21, updated 2004-06-23]

Quick remarks: A text mode HTML viewer for the power user. Uses INI file for setting many options. Language is user-customizable (Dutch, English files currently included). Basic support for HTML tables, frame sets, and maps; Can save bookmarks, Win9x LFN support, more...

L_H <FILESPEC>|<HTML-FILENAME>... [options] [/INI=<NAME> ...] [/LOG=<NAME>]

    /Convert     Convert to single output file L_H.OUT
    /Find        Find text in specified (*.HTM) files (prompt)
    /Index       Save the document selection file as Index.HTM
    /Table       Alternate table support (numbered entries)
    /TFrame-/+/* Force table frames/rulers; /TF+ =On /TF* =Line after row
    /Sound-/+    Sound effects; /S+ =On
    /Adobe-/+/*  Support for HTML generated by Adobe server; /A+ =On /A* =Auto
    /Nocolor     Suppress color usage
    /Generate-/+ Generate contents (at end of document); /G+ =On
    /Restore-/+  Go to last diplayed page; /R+ =On
    /CLS-/+      Begin a TOC entry on a new screen; /TO+ =On
    /INI=<FNAME> Adds other, alternate INI-files
    /LOG=<FNAME> Specify alternate name for the logfile

Author: Jaap van Lelieveld, Netherlands (2002).

2002-05-04: v20020505.

Download (126K).

Get more info at the L_H page.

ViewHtml — View HTML documents offline (text-mode).

* * * * 1/2

[updated 2004-06-23]

ViewHtml is a well-designed, text-mode, offline HTML viewer and converter. Special text (e.g., hyperlinks) is displayed in different colors. Under Win9x only, image viewing is supported (through an external application of your choice) and you can launch a Web browser to view the same file.

Other features:

Author: Kevin Solway, Australia (1999).

1999-03-03: v2.5.

Download (75K).

Get more info and other downloads at Solway's Software Page.

HLIST (2) — Offline HTML viewer and converter (text-mode).


[added 1998-05-07, updated 2004-06-23]

HLIST is a relatively small (45K) program which functions as an offline text mode HTML viewer and an HTML-to-plain text filter. The viewer interface consists of two horizontal panes: the upper pane displays text and hyperlinks. Local hyperlink navigation is supported. The lower pane displays the page title, highlighted link information, current file name, and available memory. HLIST supports document loading from an integrated directory browser. Other features: Shell To DOS, Edit Current Document (using defined editor), History List, and Print Document. HLIST should run on just about any PC: 8086+, any video adapter, and DOS 3.3+. Run with no parameters to see a list of switches (color, sound, etc).

Limitations: HLIST supports only a subset of HTML 2.0 tags. Tables, frames, and proprietary Netscape / MSIE tags are not interpreted; no image handling. Doesn't support a mouse, but hyperlink navigation with keys is quick and easy.

Author: Martin Goebbel, Germany (1998).

1998-08-01: v0.81.

Download (32K).

More in these pages from Martin Goebbel.


Dir2Web — Converts a directory list to HTML with local or remote links.

* * * *

[updated 1998-05-29, updated 2005-04-28

This small (2.5K) program is a great tool for web authors. From the documentation:

...reads a user-defined filespec (e.g. *.*) and creates an HTML index file. The index is fully customizable: you can include file times (24-hour), dates (2000 compliant YYYY-MM-DD), or sizes and even create links to each file while including a remote URL prefix.
DIR2WEB filespec outfile ["Title"] [/D] [/L[=prefix]][/ND|T|S]
/D include directories
/L make links [with a prefix]
/N no dates/times/sizes
/? this help screen
Example: dir2web \files\*.zip index.htm "Test" /d /l=http://localhost/ /ndts
Notes: /L does not require a prefix. /N can be followed by D, T, or S in any
combination. All [parameters] are optional, and may be used in any combination.

Author: Jem E. Berkes / PC-Tools.Net, Canada (1998).

1998-05-18: v1.41. Part of the DOS Utilities Collection, copyrighted freeware.

Download (26K).

More in these pages from Jem E. Berkes.

BBS2HTML — Converts FILES.BBS list to HTML.

* * *

[added 1998-03-08]

BBS2HTML takes a FILES.BBS list and converts it to HTML. Primarily intended for BBS sysops who are transferring their file collections to the Internet. Others might find a use for this program if they are uploading personal file collections. This program has both good and bad points. On the negative side: it is completely interactive. BBS2HTML prompts for a few criteria (e.g., the name of your site, the remote path, etc.) and then generates the HTML file. Not much flexibility. On the plus side: sane HTML formatting. No tables or fancy HTML; your file will be readable in nearly any browser. You'll have to check the layout of your FILES.BBS file prior to conversion because the program won't convert properly if the FILES.BBS file contains a header. For a neater look, you should also remove IBM graphics characters that are frequently added to some archive description files.

Author: Scott Fletcher (1995).

1995-08-27: v1.0p.

Download (46K).

Also see: FiD.

GPAGE — Generates formatted web pages from plain text lists of sites.


[added 1998-09-12, updated 2000-04-30]

GPAGE is a tiny 5K command line program that produces a web page from a plain text file containing simple URLs, associated descriptions, and group labels. HTML tagging, page layout, and index generation are handled by the program.

Author: Horst Schaeffer, Germany (2000). Suggested by Robert Bull.

2000-05-02: v1.9.

Download (18K).

Go to Horst Schaeffer's Software Pages, in Englishauf Deutsch, for more info and other software for DOS & Windows.

More in these pages from Horst Schaeffer.


Also see: A freeware Win3.1 alternative to the apps listed below is ListPics from Solway's Software Page. It can create multiple pages and provide links between them.

Image Tab (IMGTAB) — Catalogs images into single HTML page.

* * * * *

Outputs tables, in number of columns specified by user.

Author: Servile Software, UK (1997).

1997-06-28: v1.1.

Download (19K).

AUTOGIF — Catalogs images to HTML with extra file information.

* *

[added 1997-12-00, updated 2004-06-23]

Displays detailed file information and image dimensions. Image listing now sorted by name. Win95 LFN support.

Author: Branislov Stofko / Shareware, Freeware and Public domain programs, Slovakia (1998).

1998-03-10: v98.03. Version info online.

Download (20K).

GIFTAB — Catalogs images into HTML page(s) with "thumbnails".

* * *

[added 1997-12-00, updated 2004-06-23]

This program creates a tabled header of thumbnail images on the HTML page. No thumbnail files are actually generated; instead the program uses the image redimension capabilities of Netscape / MSIE which reduce the dimensions of a regular image; the full image is still loaded into memory. Thumbnails are anchored (linked) to the corresponding full size image lower in the page. If you set up a directory structure as a "gallery" (where different subdirectories = different gallery rooms w/ pics) and run GIFTAB from the gallery's parent directory, the program will create an html file for each room. The parent directory will contain an html file with links to each subdirectory/room. Template support allows customization of web page appearance. Win95 LFN support.

Author: Branislov Stofko / Shareware, Freeware and Public domain programs, Slovakia (1998).

1998-07-27: v98.07. Version info online.

Download (84K).

Here's a more detailed feature comparison (revised 1998-03-21, 2004-07-08,23):

Multiple page output "gallery" no no
Makes thumbnail files no no no
Win95 LFN support yes no yes
Show file name optional yes yes
Sort by name no yes yes
Show file size optional no yes
Show file date/time optional no yes
Show image dimensions optional no yes


HTMSTRIP — Intelligent conversion of HTML to plain text.

* * * * *

[updated 2002-11-15]

Perhaps the finest HTML to text converter around – and it's free. Awarded a "Best" rating by PC Computing in their July 1996 issue. Many converters simply strip out codes – the resulting text may bear little resemblance to the original HTML document. HTMSTRIP is more intelligent: It centers text that is bracketed by <center> tags; Tables can be converted into ASCII or ANSI tables. This app can even reference links by number and place the actual links in an end-notes section. While both Netscape and MSIE have respectable "save as plain text" options, they lack the flexibility possessed by HTMLStrip. One reservation: HTMSTRIP does not distinguish ordered and unordered list items.

Author: Bruce Guthrie (2002).

2002-08-10: v0208.

Download HTMS0208.ZIP (126K).

WinEnt — Plain text to HTML converter.


[updated 2004-10-10]

Wraps text, adds a time-stamped footer, and translates supported special characters into their HTML equivalents. Works as a filter. WinEnt adds <P></P> tags around paragraph boundaries but inserts nothing at simple line breaks. Doesn't recognize indentations, text tables, right justified text, etc. as special. Source code included. Doc files in English and Portuguese.

USE: WINENT [-flags] [title] < infile.txt [ >]
	Flags are to be together in a single string preceded by a '-':
	All are disabled by default.
		T : no HTML tags at all
		e : no character replacement by HTML 3.2 entities
		d : no date / time stamp
		h : no HEAD section
		s : no HTML sections (HTML, HEAD, BODY)
		t : insert the title as a H3 document header
		i : paragraphs start with indents
		? : help. Displays this screen.
	Title: a string enclosed in quotes, e.g. "This is a title".
	WINENT -die "Why Mary had a lamb" < lamb.txt > lamb.htm
WinEnt v. 2.2 compiled Oct 12 1998 17:57:16

Author: João Magalhaes, Portugal (1998).

Download (17K).

TXT2HTM (1) — Converts plain text to HTML, with break and font options.


[added 2004-10-10]

This program writes page headers and footers; optionally translates single DOS carriage returns to <BR> tags, and line breaks to <P> tags. Under Win9x, supports drag & drop. Source code included.

USAGE: TXT2HTM source [object] [/switches]

C   Center all text
B   Disable line breaks after each line of text
F   Use fixed font (courier)
P   Disable paragraph tags

TXT2HTM test.txt /fp

Author: Robert Madewell (2001).

2001-01-28 release.

Download (12K).

TXT2HTM (2) — Converts plain text > HTML, replaces special characters with codes.


[added 2004-10-10]

This prog writes <pre> and </pre> tags to a text file, and translates special characters. Source code (PAS) included.

Usage: txt2htm [-u -nXXX] <infile.txt >outfile.htm
(That means, we use stdin/stdout and THEN DOS redirectors.)
Command line args:
n[...] – Do not convert these, for example -n<>"&
u – Only Convert German "Umlaute"
This program performs only translation of special characters
(<,>,",&,ä,ö,ü,ß,#>127), independent of sense.

Author: Michael Bunk, Germany (1997).

1997-06-04 release.

Download (5 K).

CONVERSION: Special Formats

Also see WP2LaTeX in Text Utils - 5 – converts WordPerfect to HTML and other formats.

For a comprehensive list of HTML converters for various platforms see:

Martha & Ishtar — RTF < > HTML conversion.


[added 1998-12-19, updated 2006-04-01]

Most word processors support writing of documents to the RTF format. While RTFtoHTML, from Logictran, was the premier command line RTF-to-HTML converter for DOS, it was pay-shareware (and is no longer marketed).

Martha is a less ambitious (and still-living) freeware effort and appears to perform adequately for casual use. While MARTHA (My Automatic RTF To HTML Alterator) is not a comprehensive or configurable tool it does appear to generate more faithful output than the few other existing DOS programs I've run across.

Martha supports conversion of heading / styles, tables, and footnotes (links to and from footnotes are auto-generated). It creates a table of contents with hyperlinks to headings in the body of the document. Font face, size, and attributes are usually but not always converted accurately. Embedded images are saved as external WMF files (links to GIF files are created – you must perform the image conversion by other means). Discarded markers are written to a file TRACE.DOC.

Ishtar (formerly HTM2RTF) converts HTML to RTF. Font data and tables are translated, comments are hidden, URL anchors are replaced by underlining, Java scripts are discarded, and more. Not all markers are processed – the focus is on getting a clean word processing format without clutter. As with Martha, discarded markers are written to a file TRACE.DOC.

Both programs are available in 16- and 32-bit compilations (32-bit DJGPP EXEs require 80386+ and DOS Protected Mode Interface, CWSDPMI or other).

All variants process files singly, from full command line with input and output filenames, or in interactive mode by starting with no input/output filenames.

In interactive mode, the four programs open by asking for an input filename, and write output to same filename with new extension. If the output file already exists, "Overwrite ? [Y.N]" comes up, followed by a request for a new name.

If <Enter> is pressed, without a filename at the first request, results vary:
16-bit – Both programs return "Input file not found. Abort ? [Y/N]"
32-bit – Both programs try conversion anyway. To exit, press Ctrl-Break, then <Enter>.

Author: Yves Sagnier, France (1997-2004).


32 bit






More info on MARTHA, in Englishen français.

More info on ISHTAR, in Englishen français.

DBF2HTML — Command-line DBF to HTML converter.

* * * *

[updated 1999-10-10]

There are plenty of utilities floating around that convert Excel or Access tables to HTML tables – but these are usually large GUI apps and often only read the latest proprietary formats generated by the fertile minds at M$.

In contrast, DBF2HTML converts widely transportable DBF files into HTML tables. This small (27K) program includes a healthy list of options to customize the content and general appearance of the resulting HTML. Recognized field types are: character, numeric, logical and date (no support for memo fields, doesn't process index files). DBF2HTML can also be used as a cgi-bin program for Web servers. I have seen a shareware Win9x app that does much the same thing, but it's packed as a 3MB zip.

Disribution package includes docs & source. Issued under GNU Public License.

Syntax: DBF2HTML [switches] dbffile[.dbf] [htmlfile[.htm]]
/b width...Set the table border width to <width>
/c caption...Set the table caption to <caption>
/B......Put the caption (/c) below the table(default: above)
/d......Skip deleted records
/f fieldlist..Process only fields specified in the <fieldlist>
/h......Display this help screen
/l......Use the four digits format to display years
/s......Display the DBF structure
/t title...Set the HTML document title to <title>
The following macros can be used inside the title
or the caption:
$f The file name of the DBF; $q The '"' character;
$p The '%' character; $$ The '$' character

Author: Simone Govoni, Italy (1999).

1999-07-05: v1.01.

Download (46K).

TTH — TEX / LaTEX to HTML converter.


[added 1998-12-06, updated 2006-04-01]

TtH is a command line filter that does a good job of converting Plain TeX or LaTeX files into their HTML equivalents.

From a reader:
How well does the program work? Very well. Documents are converted to HTML 3.2. It supports all standard TeX and LaTeX constructs, including the LaTeX tabular environment. Headings and text convert smoothly (for example, text tagged as \bf converts to <B>). TtH can also generate an HTML index if the LaTeX file contains a \makeindex command. Limitations? TtH cannot handle custom macros and styles. Equations are converted to HTML, but the program uses the <FONT FACE="SYMBOL"> extension. Also, <HEAD> and <BODY> tags are not added to the file. This may cause problems when you want to use CSS.

Runs on Win32 command line. Free for non-commercial use only.

Author: Ian Hutchinson (2005). Suggested by Scott Nesbitt.

2005-12-27: v3.72. Get latest version number and date.

Download (369K).

Go to the TTH Page for more information, links to versions for other OSes, source (C), and the commercial TTHgold with Windows GUI and other added functionality.


HTMLCRunch — Compresses (X)HTML / XML pages.


[added 2003-04-15, updated 2005-04-28]

Compresses (X)HTML/XML pages by removing superfluous blanks, line feeds, and comments, which are typically ignored by browsers. 32-bit DJGPP EXE, require 80386+ and a DOS Protected Mode Interface, CWSDPMI (included) or other.

USAGE    : htmlcr [-switch] file1 file2 ... [-switch] file ...
switches : -K     keep original files (default) --> output to *.hcr
           -O     overwrite mode
           -S     output to standard out
           -I     read from standard in, write to standard out
           -J     don't try to honor //-JavaScript comments
           -C     cut <!-- ... --> comments
example  : htmlcr -o urgent.htm start.htm -k index.htm

Author: Markus Stengel, Germany (2002).

2002-11-24: v0.8.1.

Download (89K). Includes EXEs, source, doc.

HTMLCRunch page.

HTML Tidy — Command line HTML validator and cleanup tool.

* * * * *

[added 1999-09-01, updated 2004-10-11]

This tool will generate a line by line error report and/or clean up your HTML. Customization is possible with a config file. 32-bit DJGPP program – requires 80386+, 4MB RAM, and a DOS Protected Mode Interface (DPMI), supplied either by CWSDPMI, or by Windows 3.x/9x.

From the docs:
Tidy is able to fix up a wide range of problems and to bring to your attention things that you need to work on yourself. Each item found is listed with the line number and column so that you can see where the problem lies in your markup. Tidy won't generate a cleaned up version when there are problems that it can't be sure of how to handle. These are logged as "errors" rather than "warnings".
Processing directives
-indent or -i to indent element content
-omit or -o to omit optional end tags
-wrap <column> to wrap text at the specified <column> (default is 68)
-upper or -u to force tags to upper case (default is lower case)
-clean or -c to replace FONT, NOBR and CENTER tags by CSS
-bare or -b to strip out smart quotes and em dashes, etc.
-numeric or -n to output numeric rather than named entities
-errors or -e to only show errors
-quiet or -q to suppress nonessential output
-xml to specify the input is well formed XML
-asxml to convert HTML to well formed XHTML
-asxhtml to convert HTML to well formed XHTML
-ashtml to force XHTML to well formed HTML
-slides to burst into slides on H2 elements

Character encodings
-raw to output values above 127 without conversion to entities
-ascii to use US-ASCII for output, ISO-8859-1 for input
-latin1 to use ISO-8859-1 for both input and output
-iso2022 to use ISO-2022 for both input and output
-utf8 to use UTF-8 for both input and output
-mac to use MacRoman for input, US-ASCII for output
-win1252 to use Windows-1252 for input, US-ASCII for output

File manipulation
-config <file> to set configuration options from the specified <file>
-f <file> to write errors to the specified <file>
-modify or -m to modify the original input files

-version or -v to show the version of Tidy
-help, -h or -? to list the command line options
-help-config to list all configuration options
-show-config to list the current configuration settings

Notes: This tool can be used by many freeware HTML editors (e.g., NoteTab Light by Eric Fookes and HTML-Kit [both win9x]). Packages also available for Win32 (GUI and console), MAC, Amiga, AIX, Linux, Unix, OS/2 and more.

Authors: Dave Raggett, UK. DOS port by Nick B (2001). Suggested by Bjørn Simonsen and others.

2001-12-25 release.

Download (167K).

For more info go to Dave Raggett's Clean up your Web pages with HTML TIDY and to Nick B's HTML-Tidy for DOS.

For versions for other OSes, go to the HTML Tidy Library Project at SourceForge.

Htmlpp — Powerful HTML preprocessor; requires Perl.


[added 1999-11-14]

Htmlpp is a Perl script (not a DOS program) that functions as an extremely powerful html preprocessor. Although it requires installation of the substantial Perl language package, you don't need to be a Perl guru to use htmlpp. Because htmlpp doesn't rely on any system-specific features it should be usable with any OS that has a recent Perl port.

Special requirements: Perl (v4x or 5x).

Author: Pieter Hintjens / iMatix Corp., Belgium; revised by Enrique Bengoechea, Spain (1998).

1998-10-22: v4.2a.

Download (128K).

PPWIZARD — Multi-platform HTML preprocessor; requires REXX interpreter.


[added 2000-08-21, updated 2006-04-01]

PPWIZARD, well known in OS/2 circles, is a comprehensive and well documented Rexx-driven HTML preprocessor that's also compatible with the multi-platform Regina Rexx interpreter. The PPWIZARD installer runs under any OS version of REXX/Regina: DOS, Win9x/ME/NT/2K/XP, Linux, FreeBSD, SunOS, Mac OS X, BeOS....

Notes: PPWIZARD now supports Regina v2.0 and is tested here with v2.0 for DOS/DJGPP. PPWIZARD's author maintains a helpful page of Regina-specific issues. Program download package includes compiled HTML (CHM) docs. In addition, separate HTML documentation with search indexes is available – there's even a section for former ORB users. An online manual and beginner's guide are also maintained.

Author: Dennis Bareis, Australia (2000).

2005-11-27: v05.331.

Program, docs (CHM)
Docs (HTML). Warning: Long File Names

Go to the author's site for more info on PPWIZARD and on using it with Regina, and addons. If your browser doesn't like frames, click these links to the PPWIZARD page, to the Regina info page, and to the Download page.

ORB — HTML preprocessor for efficient management of multiple web pages.


[added 1998-12-05, updated 2005-06-05]

Paraphrased from the docs:
Orb is a tool for Web developers which makes design, development, and maintenance of large sites easier...Orb source files consist of normal HTML source with additional Orb-specific markup; the latter is parsed and replaced by Orb to create a pure HTML output file.

32-bit DJGPP program, requires DOS3.0+, 80386+ and a DOS Protected Mode Interface (CWSDPMI or other). Formerly feeware, Orb is now free under GNU General Public License. The manual hasn't been updated to reflect this change, but the license is in the DOS package.

Author: Craig Berry (1998). Suggested by Scott Nesbitt, Dev Teelucksingh, others.


Includes C source
Includes sample work files *

* Sample work files can be opened with Orb for DOS. Thanks to Greg Head for providing the package.

Get more info at the Orb Page, and more files at the Project page at SourceForge.

htp — HTML preprocessor for efficient management of multiple web pages.


[added 1998-09-12]

This command line program could be a big timesaver (in the long run) for those who manage numerous web pages sharing a common layout. In brief, the program requires management of Content files (HTML files containing the content of your pages and time-saving macros) and a Template (design) file which holds instructions for the common layout of your pages.


Author: Jim Nelson (1996). Suggested by Robert Bull.

1996-09-24: v1.10.

Download (157K).

StrpHTML & KeepHTML — Cut out or keep sections of text in HTML document.

* * *

StrpHTML and the companion KeepHTML are programs which strip out (or keep) sections of text in an HTML document. The programs work by locating and cutting around user-inserted HTML comments. For example, StrpHTML cuts out text between <!--CUT --> and <!--ENDCUT --> comments. A possible application includes generating distinct, multiple HTML pages from a single commented template file.

StrpHTML [INFILE] [OUTFILE] [0 1 2 ... 9]
[INFILE]...The file to process.
[OUTFILE]..File that should be written to. If it exists StrpHTML
...will overwrite it.
[LEVEL]...Stripping level. Should be 0,1,2... or 9.
StrpHTML will copy every line from INFILE to OUTFILE until it finds a line that _starts_ with "<!--CUTn" (without the quotes), and if the level indicator n was specified on the commandline. Copying will resume after a line is found starting with "<!--ENDCUTn", and where n is a specified level. Also, every line starting with "<!--REMOVEn" or "nREMOVE-->" will not be copied to OUTFILE. Limitations: The CUT/ENDCUT comments must be placed at column one of a line and can't be preceded by other text. Tip: Strp/KeepHTML don't support wildcards: try LOCATE's marvelous /O option to quickly generate a batch file with multiple commands, e.g.,
(where &N holds a filename and &R a basename)

Author: Arjen Venhuis, Netherlands (1997). Tip by Bjørn Simonsen.

1997-03-04: v1.3.

Download (19K), includes KeepHTML

Go to Top | Front Page ]

©1994-2004, Richard L. Green.
©2004-2006, Short.Stop.