Up to full list of filters
There are various tools to convert SGML into HTML. Here is a
an annotated summary of some of them, and
here are a few more pointers.
Perl filters for several DTDs used in the humanities are available from Michigan and Virginia:
see " WWW-to-PAT Gateway;
exploiting an SGML-aware system through the Web ".
OCLC offers SGML translation services via Fred, which
includes tools to translate SGML-tagged text to other formats.
Currently, OCLC uses Fred to translate from SGML to HTML, TeX
(PostScript), and ASCII.
There are several versions of the HTML DTD: more information is
available here.
There is also a program dtd2html to convert SGML DTDs
themselves to HTML to help browse and understand them.
Linuxdoc-SGML is a text-formatting package based on SGML
that will produce HTML as well as other marked up documents.
For OSF DTD to HTML tools send mail to "dtd-comment@osf.org"
Conversion of HTML/SGML to other formats is also possible
Further information
is available on SGML resources and tools.
Emacs info
-
info2html
is a filter for converting Emacs info files (written directly, not
compiled from TeXinfo source) into HTML. Contact: guggis@iam.unibe.ch (Karl Guggisberg)
-
Here is another such script. Contact: ktl@wag.caltech.edu (Kian-Tat Lim)
-
...and another (the latest). Contact: lmdrsm@lmd.ericsson.se (Roar Smith)
Other marked-up text
-
mma2html is a perl program to convert Mathematica notebooks to HTML.
Contact: hedonley@grove.iup.edu (Ed Donley)
- The BETA and JAM
markup languages have converters to both LateX and HTML.
- htxp is a
macro preprocessor for writing HTML (and other
documents) allowing user-definable abbreviations and macros.
- HLPDK tools from HyperAct Inc allow
translation of help files from native HDF format to HTML, Windows
Help, OS/2 Help, DESQview/X, MM Viewer and several dos targets.
- hlptohtml converts Digital's DCL HELP files
(.HLP suffix) to html. Contact: advax@erich.triumf.ca (A.Daviel)
-
Here is a perl script that converts an AmigaGuide hypertext
document into a directory full of HTML pages. Contact: witbrock@cs.cmu.edu (Michael Witbrock)
-
WinHelp files can be translated using the HLPDK tools from HyperAct Inc
-
setext.pl is a
perl script for use as a Plexus filter to convert
setext markup to HTML. Contact: sanders@bsdi.com (Tony Sanders)
-
GMLHTML is a GML to HTML Translator for Waterloo Script/GML, based
on a set of GML macros. Contact: friendly@hotspur.psych.yorku.ca (Michael Friendly).
Useful collections of scripts
-
There is a large assortment of filters here.
Contact: oscar@iam.unibe.ch (Oscar Nierstrasz)
-
Another useful collection is the perlWWW tools dealing with SGML, HTML, Mosaic, Frame, etc.
It includes htmltoc to generate a Table of Contents for HTML documents
Contact: ehood@convex.com (Earl Hood)
Plain text etc.
-
text2html.sed is
a sed script to turn plain text into plain-looking valid HTML markup so that it
will be rendered just as it was.
-
txt2html.pl
is a
Perl script which attempts to do a better job than its
text2html.sed companion. In particular, it allows embedding of href's
into from plaintext. Contact: mengwong@seas.upenn.edu (Meng Weng Wong)
-
txt2html is yet another
Perl script which attempts to do an even better job. Contact: seth@cs.wustl.edu (Seth Golub)
-
WEBIT
is another perl script to turn text and simple identifiers into HTML.
Contact: attau791@futures.wharton.upenn.edu (Muhammad At-Tauhidi).
-
plain2 is a converter for plain text
to TeX and roff which has an HTML output option. Current version plain2-2.53 and patch
plain2-2.53-html.94feb15.patch. Contact: k-chinen@is.aist-nara.ac.jp (Kenichi Chinen)
-
HTML Markup for Mac convertes plain text to HTML
in a flexible way. Contact: sjked@rit.edu (
Scott J. Kleper)
-
Hyperize is a similar product for the Mac but with
several translation modes. Contact: jcj@extro.ucc.su.oz.au (Jason Johnston)
-
TextToHTML for the Mac from Logic n.v. handles plain text as well as rtf.
Contact: kris@logic.be (Kris Coppieters)
-
findrefs.pl Written by Ari Lemmke,
finds references http:... in plain text files and generates anchors out of them.
-
Text files can also be translated using the HLPDK tools from HyperAct Inc
-
man2html.c converts nroff output to html.
Microsoft Excel
-
XL2HTML.XLS is a Visual Basic Macro for Microsoft Excel 5.0
(Windows and Macintosh versions).
It allows users to specify a range of cells to be converted and then
generates a file with the data in HTML "Table" format.
Contact: jordan.evans@gsfc.nasa.gov (Jordan Evans).
-
Here is an
extended version of the above that produces more advanced tables and coloured bar graphs as well.
Contact: d.r.newman@qub.ac.uk (David R. Newman)
- excel-to-html
is an Excel 4.0 macro to convert part or all of an Excel spreadsheet to an
HTML 3.0 table. Contact: Scott_Blanksteen@nar.com (Scott Blanksteen)
-
Excel to HTML
is a Mac application that converts Excel 4.0 or 5.0 spreadsheets to HTML tables.
Contact: trenthem@rhodes.edu (Richie Trenthem)
Various formats
-
Lotus InterNotes Web Publisher
automates the process of publishing information to WWW by
converting Notes documents into HTML.
-
TILE
will convert Lotus Notes databases directly into HTML.
-
XTND HTML translator
is for use with Macintosh applications such as ClarisWorks that support
the XTND System. It uses translators, written independently of the application, to import and
export file information to and from the application
Contact: binky@eecs.umich.edu (Brian A. Sullivan), jrday@eecs.umich.edu (Jonathan Ryan Day).
-
HTML+ is another XTND translator.
Contact: leonardr@netcom.com (Leonard Rosenthol).
-
Jan Stelovsky <janst@uhics.ics.Hawaii.Edu> is working on a
prototype converter for SuperCard.
Suggested features and alpha testers are solicited.
-
ls2html.awk is an awk script which
will just take a list of names and generate a menu.
-
dir2html is a shell script which
generates a menu of pointers to files with particular suffixes in a set
of directories. It also includes a README file at the head of the
hypertext list if one exists.
-
Charconv allows the transformation of
one encoding of an extended character set (e.g., ISO Latin-1) to
another (e.g., MS DOS, Macintosh). It can handle HTML.
-
BBS2HTML for DOS/WIN/OS2 converts
FILES.BBS listings (used on most BBSes) to HTML. Once the files are converted
and placed on the HTTP server, users can browse the descriptions and click and
download the files. The funtionality is very similar to a BBS.
Contact: scott@oce.com (Scott Fletcher).
-
Usenet-Web
is Perl software for the low maintenance archiving and WWW
presentation of Usenet newsgroups.
-
Here is a script
that assists in reading news with links in it using a WWW browser.
-
Here are a couple of simple scripts to convert a .newsrc file
into html.
-
Spry2htm and Winh2htm are for
Windows SPRY Airmosaic Hotsheets and NCSA Mosaic hotlists.
-
Here is some information and further scripts
for converting Mosaic hotlists to html.
-
htn2html.c See the Hytelnet gateway
for the program to convert hytelnet data into HTML.
-
Here is a gawk script for converting a
calendar file.
-
Here is a small perl script to make an HTML page
out of all the URLs extracted from a set of files.
-
rfcidx2html
will generate an html version of an rfc-index file.
Contact: erdal@knidos.cc.metu.edu.tr (Erdal Taner)
-
blast2html is
a filter written in Perl for adding hyperlinks to the output of BLAST,
a program used by molecular biologists to compare DNA and protein sequences.
Contact: krobison@nucleus.harvard.edu (Keith Robison)
-
ged2html
converts GEDCOM genealogical databases: for Unix, MS-DOS and MS-Windows.
Contact: stark@cs.sunysb.edu (Eugene W. Stark)
-
cert2html is a small script to convert the
CERT advisories to HTML.
-
ncftp2html converts ncftp v2.0.0+ hosts files to HTML.
Available here as ncftp2html.c
Contact: smthor01@msuacad.morehead-st.edu (Michael Thornberry)
-
See references to converting foils
with Persuasion and Powerpoint.
-
Russell McOrmond
__________________________________________________________________
MS,
CERN
12 December 1995