Up to full list of filters
There are various tools to convert SGML into HTML. Here is a
an annotated summary of some of them, and
here are a few more pointers.
Perl filters for several DTDs used in the humanities are available from Michigan and Virginia:
see " WWW-to-PAT Gateway;
exploiting an SGML-aware system through the Web ".
OCLC offers SGML translation services via Fred, which
includes tools to translate SGML-tagged text to other formats.
Currently, OCLC uses Fred to translate from SGML to HTML, TeX
(PostScript), and ASCII.
There are several versions of the HTML DTD: more information is available
from the W3 Consortium.
There is also a program dtd2html to convert SGML DTDs
themselves to HTML to help browse and understand them.
Linuxdoc-SGML is a text-formatting package based on SGML
that will produce HTML as well as other marked up documents.
For OSF DTD to HTML tools send mail to "dtd-comment@osf.org"
Conversion of HTML/SGML to other formats is also possible
Further information
is available on SGML resources and tools.
HTML preprocessors etc.
Some of these have project / site management features.
A good set of references with explanation and commentary is given by
Jahn Rentmeister. Here are a few not covered there:
-
HTE
is a preprocessor that aims to achieve the benefits of
HTML++ without it's shortcomings.
It provides an inheritance mechanism for documents and parts of
documents, variable manipulation and regular expression handling.
ANSI-C source for Version 0.95 is available free.
Contact: rentmei@uni-muenster.de (Jahn Rentmeister)
-
htmlpp is a preprocessor for HTML along the lines
of the c preprocessor. Contact: pahint@eunet.be (Pieter Hintjens)
- htxp is a
macro preprocessor for writing HTML (and other
documents) allowing user-definable abbreviations and macros.
Contact: kwong@mcs.anl.gov (Man Kam Kwong)
- hsc is a preprocessor mainly intended to create
and maintain larger html projects. Written for the Amiga, but ANSI-C
source available.
- ePerl allows Perl 5 statements
to be embedded in HTML documents and evaluated to produce standard HTML.
Contact: rse@engelschall.com (Ralf S. Engelschall)
-
Meta-HTML
provides the most commonly wanted Web functionality as built-in
primitives. Web pages are authored using HTML and Meta-HTML statements
intermixed. The Meta-HTML engine dynamically processes any Meta-HTML
statements to produce a final HTML page which is delivered to the browser.
- The BETA, JAM
and YODL markup languages have converters to both LateX and HTML (and other formats).
Windows Help
- helpsite from Wextech
is an HTML authoring utility that allows Windows online Help
authors to convert their projects into WWW documents automatically.
-
WinHelp files can be translated using the HLPDK tools from HyperAct Inc.
-
The Hypertext Suite from Olson software
allows authoring WinHelp and HTML files from a single source file, as well as changing between WYSIWYG WinHelp and HTML.
-
htmlp20d is a converter between HTML and Windows 3.1x Help (HLP) files, also for Win 95.
Available from the SimTel archives.
-
Help-to-HTML Kit from
Blue Sky Software.
Emacs info
-
info2html
is a perl CGI script for converting Emacs info files into HTML. Contact: guggis@iam.unibe.ch (Karl Guggisberg)
-
info2www is another such script. Contact: lmdrsm@lmd.ericsson.se (Roar Smith)
-
Another info2www is a stand-alone perl script to convert
Emacs info files (written directly, not compiled from TeXinfo source).
Contact: ktl@wag.caltech.edu (Kian-Tat Lim)
Other marked-up text
- HLPDK tools from HyperAct Inc allow
translation of help files from native HDF format to HTML, Windows
Help, OS/2 Help, DESQview/X, MM Viewer and several dos targets.
-
mma2html is a perl program to convert Mathematica notebooks to HTML.
Contact: hedonley@grove.iup.edu (Ed Donley)
- hlptohtml converts Digital's DCL HELP files
(.HLP suffix) to html. Contact: advax@erich.triumf.ca (A.Daviel)
-
Here is a perl script that converts an AmigaGuide hypertext
document into a directory full of HTML pages. Contact: witbrock@cs.cmu.edu (Michael Witbrock)
-
setext.pl is a
perl script for use as a Plexus filter to convert
setext markup to HTML. Contact: sanders@bsdi.com (Tony Sanders)
- The MTX system uses a Perl script to allow easy generation of HTML from
"simple markup" files. An older Hypercard version is also available.
Contact: rrathe@dean.med.ufl.edu (Richard Rathe)
-
GMLHTML is a GML to HTML Translator for Waterloo Script/GML, based
on a set of GML macros. Contact: friendly@hotspur.psych.yorku.ca (Michael Friendly).
-
A converter is under development to convert tagged ascii to HTML
(tagged ascii being the report format of database programs such as dBASE and Foxpro).
Currently for DOS, Windows version soon.
-
The wwwcreat system allows a general
approach to converting database contents to HTML, using a customisable SGML DTD
and HTML templates. Contact: Ingo.Macherius@tu-clausthal.de
(Ingo Macherius)
Useful collections of scripts
-
There is a large assortment of filters here.
Contact: oscar@iam.unibe.ch (Oscar Nierstrasz)
-
Another useful collection is the perlWWW tools dealing with SGML, HTML, Mosaic, Frame, etc.
It includes htmltoc to generate a Table of Contents for HTML documents
Contact: ehood@convex.com (Earl Hood)
Plain text etc.
- Q-HTML for Windows
is a tool to convert text which has been entered with linebreaks into HTML.
It has various automatic conversion features. Contact: craig@scot.demon.co.uk (preferred), craig@acm.org
(Craig Cockburn)
- Text2HTML is a configurable freeware converter for
Windows. Contact: flagg@oberberg-online.de (Erik Möller)
- Quick Page is another converter for
Windows with many features. Contact: cwagner@connect.ab.ca (Chris Wagner)
-
text2html.sed is
a sed script to turn plain text into plain-looking valid HTML markup so that it
will be rendered just as it was.
-
txt2html.pl
is a
Perl script which attempts to do a better job than its
text2html.sed companion. In particular, it allows embedding of href's
into from plaintext. Contact: mengwong@seas.upenn.edu (Meng Weng Wong)
-
txt2html is yet another
Perl script which attempts to do an even better job. Contact: seth@cs.wustl.edu (Seth Golub)
-
WEBIT
is another perl script to turn text and simple identifiers into HTML.
Contact: attau791@futures.wharton.upenn.edu (Muhammad At-Tauhidi).
-
AscToHTM
is a shareware program for Windows and VMS that converts plain text to HTML.
It tries to recognise section headings and indent levels, and lets you
specify header and footer files, background colour etc.
Contact: jaf@yrl.co.uk (John A Fotheringham)
-
plain2 is a converter for plain text
to TeX and roff which has an HTML output option. Current version plain2-2.53 and patch
plain2-2.53-html.94feb15.patch. Contact: k-chinen@is.aist-nara.ac.jp (Kenichi Chinen)
-
HTML Markup for Mac convertes plain text to HTML
in a flexible way. Contact: klephacks@kagi.com (
Scott J. Kleper)
-
Hyperize is a similar product for the Mac but with
several translation modes. Contact: jcj@extro.ucc.su.oz.au (Jason Johnston)
-
TextToHTML for the Mac from Logic n.v. handles plain text as well as rtf.
Contact: kris@logic.be (Kris Coppieters)
-
findrefs.pl Written by Ari Lemmke,
finds references http:... in plain text files and generates anchors out of them.
-
Text files can also be translated using the HLPDK tools from HyperAct Inc
-
man2html.c converts nroff output to html.
-
and see tables for Windows; TableCloth and TableTool for Mac, which convert tab-delimited
text to HTML tables.
Spreadsheets, tables, etc.
-
For several spreadsheet formats see DataViz Web Publisher
-
Free Excel to Web Table Converter (professional version real soon now)
from Baarns Publishing will create an HTML table
from selected Excel cells. Options help build an entire Web Page using Excel.
Contact: baarnspublishing@baarns.com
-
Internet Assistant for Excel. An add-in wizard from Microsoft
assists users in converting spreadsheet data to HTML.
-
XL2HTML.XLS is a Visual Basic Macro for Microsoft Excel 5.0
(Windows and Macintosh versions).
It allows users to specify a range of cells to be converted and then
generates a file with the data in HTML "Table" format.
Contact: jordan.evans@gsfc.nasa.gov (Jordan Evans).
-
Here is an
extended version of the above that produces more advanced tables and coloured bar graphs as well.
Contact: d.r.newman@qub.ac.uk (David R. Newman)
- excel-to-html
is an Excel 4.0 macro to convert part or all of an Excel spreadsheet to an
HTML 3.0 table. Contact: Scott_Blanksteen@nar.com (Scott Blanksteen)
-
XTML
is an Add-in module for Microsoft Excel v5.0 or later (Macintosh or Windows versions).
The formatting from the Excel worksheet is preserved.
Contact: Sayward@kagi.com (Ken Sayward)
-
RangeToHTM is a VB macro to convert
an Excel Range to a HTML document.
-
Instant Tables from Two Clicks is a free forms-based service to
convert tab-separated data into HTML tables. Supports a range of
browsers are supported.
-
WinBrowse from Q&D Software Development is a utility
that among other things allows conversion of many spreadsheet and database formats to HTML.
-
KatalogMaster (also in German - auch auf Deutsch) for Windows has facilities for
importing Excel, Access or dBase databases.
-
Table Master is a shareware program that
creates HTML tables from data stored in spreadsheet and database files.
Contact: bobray@softville.com (Bob Ray)
-
tables
is a freeware Windows utility that turns a semicolon or comma separated file
into an html table. Supports colspan and rowspan. Contact: Finn@Ekberg.com (Finn Ekberg Christiansen)
-
A converter for ascii comma or tab-delimited files for DOS is available
from Ed Light
-
TableCloth is a small AppleScript applet which converts
tab-delimited text to HTML tables. It can convert files output from spreadsheet and database
applications such as Microsoft Excel, ClarisWorks, Lotus' 1-2-3, and FileMaker Pro.
-
TableTool is a small HyperCard stack to convert data from
spreadsheets or databases saved as standard tab-delimited text files to HTML files.
-
Note: the Excel to HTML
Mac application of trenthem@rhodes.edu (Richie Trenthem) is no longer available.
-
Web-It
XTND HTML translator. For use with Macintosh applications such as ClarisWorks that support
the XTND System. It uses translators, written independently of the application, to import and
export file information to and from the application.
(Now included in ClarisWorks 4.0).
Contact: binky@eecs.umich.edu (Brian A. Sullivan), jrday@eecs.umich.edu (Jonathan Ryan Day).
-
HexWeb XTND from HexMac
is an import/export filter for the Claris XTND system.
(Your browser needs frames to access their site).
-
HTML+ is another XTND translator.
Contact: leonardr@netcom.com (Leonard Rosenthol).
Various formats
-
Adobe Acrobat Access
will convert the text portion of PDF documents to html. Its motivation is to assist the
visually handicapped. It offers conversion by proxy, form, or email.
-
Lotus InterNotes Web Publisher
automates the process of publishing information to WWW by
converting Notes documents into HTML.
-
TILE
will convert Lotus Notes databases directly into HTML.
- Teletext:
- Some teletext-related software can be found
here.
Contact: luigi@iet.unipi.it (Luigi Rizzo)
-
Some info on the teletext format is available here.
-
Jan Stelovsky <janst@uhics.ics.Hawaii.Edu> is working on a
prototype converter for SuperCard.
Suggested features and alpha testers are solicited.
-
ls2html.awk is an awk script which
will just take a list of names and generate a menu.
-
dir2html is a shell script which
generates a menu of pointers to files with particular suffixes in a set
of directories. It also includes a README file at the head of the
hypertext list if one exists.
- gopher:
- goph2html
perl script converts a gopher tree to a rough approximation in HTML.
(Uses lynx). Contact: riddle@rice.edu (Prentiss Riddle)
- cap2html is a
small shell script to create HTML indexes from gopher directories. It recurses
on directories and gets file descriptions from files found in .cap directory.
Contact: vparada@inf.utfsm.cl (Victor Parada)
-
Charconv allows the transformation of
one encoding of an extended character set (e.g., ISO Latin-1) to
another (e.g., MS DOS, Macintosh). It can handle HTML.
-
BBS2HTML for DOS/WIN/OS2 converts
FILES.BBS listings (used on most BBSes) to HTML. Once the files are converted
and placed on the HTTP server, users can browse the descriptions and click and
download the files. The funtionality is very similar to a BBS.
Contact: scott@oce.com (Scott Fletcher).
-
Usenet-Web
is Perl software for the low maintenance archiving and WWW
presentation of Usenet newsgroups.
-
Here is a script
that assists in reading news with links in it using a WWW browser.
-
Here are a couple of simple scripts to convert a .newsrc file
into html.
-
Spry2htm and Winh2htm are for
Windows SPRY Airmosaic Hotsheets and NCSA Mosaic hotlists.
-
Here is some information and further scripts
for converting Mosaic hotlists to html.
-
htn2html.c See the Hytelnet gateway
for the program to convert hytelnet data into HTML.
-
Here is a gawk script for converting a
calendar file.
-
Here is a small perl script to make an HTML page
out of all the URLs extracted from a set of files.
-
rfcidx2html
will generate an html version of an rfc-index file.
Contact: erdal@knidos.cc.metu.edu.tr (Erdal Taner)
-
blast2html is
a filter written in Perl for adding hyperlinks to the output of BLAST,
a program used by molecular biologists to compare DNA and protein sequences.
Contact: krobison@nucleus.harvard.edu (Keith Robison)
-
ged2html
converts GEDCOM genealogical databases: for Unix, MS-DOS and MS-Windows.
Also available from the GENDEX site.
Contact: stark@cs.sunysb.edu (Eugene W. Stark), ged2html@gendex.com
-
cert2html is a small script to convert the
CERT advisories to HTML.
-
ncftp2html converts ncftp v2.0.0+ hosts files to HTML.
Available here as ncftp2html.c
Contact: smthor01@msuacad.morehead-st.edu (Michael Thornberry)
-
See references to converting foils
with Persuasion and Powerpoint.
-
Software tools
to convert Freeport-style menus used by a number of
Freenet's and other community networks. Contact:
Russell McOrmond
-
DAMOCLES 1.0
is a tool for editing and browsing OSI Managed Object specifications
according to the Guidelines for the Definition of Managed Objects (GDMO)
The tool supports the conversion of GDMO/ASN.1 definitions to HTML, MIF,
and RTF files. Contact: ikv@ikv.b.eunet.de (Eric Mordoh)
-
The Leaky Tree Compiler converts
QDL to HTML.
Contact: stevew@eskimo.com (Steve Walton)
__________________________________________________________________
MS,
CERN
5 July 1998