Erik Oliver's Home Page
[Header-201204]
Search [ ] [Search]
Main menu
Skip to primary content
Skip to secondary content
• Home
• Site Features
□ Fillable Forms in Microsoft Word
□ Secret of Evermore Walkthrough
□ Setext
• Patent Polishing
Setext
Basic Information About Setext
Setext is a simple text formatting language that was originally
designed for use in simple email and USENET exchanges. It has
gradually fallen by the wayside to HTML; however, it remains an
extremely simple and useful way of presenting text in a
structured fashion.
Programs
• setext2html.pl – original script from BSDi.com for
setext2html conversion
• setext2latex.pl – my own perl setext to latex converter
(Revised 16 Sep 2007; v 1.13)
• setext2html.pl – my own perl setext to html converter
(Revised 9 Sep 2007; v 1.9)
Tag Reference
Name Setext Example Displayed As /
Pattern Comments
Displayed literally
w/ minimal number of
headers.
Mail/USENET headers.
These bits of
information
primarily relate to
mail/news. Can
accomplish a third
Subject: Re: SGML vs. Setext level of hierarchy.
(a) Subject: … From:
[email protected] Note, my
subject-tt [From: … & Date: 10 Apr 2001 setext2latex parser
Date: … ] requires that the
headers be at the
start of the line
and only recognizes
“Subject”, “Date”,
and “From”
translating them to
the \title, \author,
and \date, of the
LaTeX title page
respectively.
Displayed in a
(user) selected
style for titles.A
distinct title
identified by the
text, maximum one
per setext. Must
start at the
beginning of the
line.First title-tt,
subhead-tt, or
subject-tt found
This is a long title scanning the file
(b) title-tt “Title ==================== from top to bottom
=====” will become the
LaTeX \title.
Therefore a
subject-tt should be
before a title-tt
which should be
before an
subhead-tt. No error
is raised by the
parser on multiple
title-tt’s or
multiple
subject-tt’s after
the first one.
Displayed in a
(user) selected
style for
subheadings. A
Subheading One distinct subheading
(c) “Subhead——-“ -------------- identified by the
subhead-tt text, zero or more
per text. Must start
at beginning of
line. See note in
title-tt about
handling.
Lines undented and
unfolded (longer
66-char First paragraph.. lines are generally
(d) lines more of paragraph. tolerated by most
indent-tt indented by parsers). This is
2 spaces Next paragraph... primary body text,
generally plain
undented in emails,
etc. currently.
Display in a (user)
selected style,
(e) bold-tt **[multi] This is **very important**... preferrably bold.One
word** or more bold words,
generally *word* or
**word** in emails
Display in a (user)
selected style,
preferrably
italics.Single
italicized word;
(f) ~word~ This is an ~italic~ word. unclear why
italic-tt multi-word form not
available.Multi-word
form of
~first~second~third~
supported by
setext2latex.
Display in a (user)
selected style,
preferrably with
(g) [_multi] This is _underlined_text_. underlining–except
underline-tt _word_ in browsers where
underlining
corresponds to hot
links.One or more
underlined words
Used in conjunction
with href-tt to make
footnotes or
hyperlinks. In
setext.pl provided
at home page it
makes the hot-tt a
[multi_] This is a hot_word_. hyperlink to the
(h) hot-tt word_ corresponding
href-tt. In my LaTeX
converter
the href-tt becomes
a footnote
associated with just
after the
hot-tt.Hypertextual
1+ word
Displayed in a user
selected style,
> This is quoted text... preferrably
(i) >[space] > ...more... monospaced with the
include-tt [text] leading “>”Normal
text quoting style
of news/mail user
agents.
Displayed in bullet
or list format.
Ambiguity as to
whether “*” has to
be in leftmost
non-space
position or absolute
first position,
(j) *[space] * Item 1 that is... actual practice
bullet-tt [text] ...really long* Item 2 suggests first
non-space position.
Also, handling of
run-on lines with
bullets is
ambiguous, I resolve
by allowing lines to
be wrapped according
to 2-char indent
rule.
Displayed literally,
e.g as if containing
typotag was not
therein. Probably
could omit “`” marks
in display.
Mostly useful for
presenting stuff
`[typo tags about setext in
(k) quote-tt from (a)- `here's some _underlined_text_ to show literally` setext.
(p)]` Implementation is
ambiguous,
setext2latex handles
“ as a literal `.
Otherwise handles
everything from the
first ` to the next
` as a literal
string even across
multiple lines.
Not directly
realized except with
hot-tt. URL could
also be some text
^.. ^.. _hot_word
http://www.this url for a footnote.
(l) href-tt _hot_word Modified in version
URL 1.11 to allow wrap
around onto multiple
lines that start
with “^.. “.
[hypertext link def]
Generate an error.
Unclear why this is
even a typotag;
^.. unclear why/how to
(m) note-tt _hot_word ^.. _hot_word Note:("Here's an error") use. One possibility
Note:(“*”) would be to use for
footnotes while
href-tt is used for
hyperlinks.
[hypertext note def]
[start parsing a new
setext within this
file]
Used to mark the end
of the first (or
(n) $$[at end of This is the end of this setext. $$ only) setext in a
twobuck-tt line] file. Generally
appears at the end
of the file since
most files include
only a single
setext.
(o) ^..[space] .. This won't show up. [not shown] not
supress-tt [not dot] presently used in
email/etc. generally
(p) ^..[alone on .. [noted; not shown]
twodot-tt line] logical end of text
Comments
The authoritative reference is no longer available at BSDI;
however, the existing table lacks concrete examples. My
examples attempt to correspond to the usage in the setext.pl
script from BSDI, usage by others, e.g. Tidbits, and my own
usage.The specification is somewhat inadequate for describing
behavior, e.g. title-tt and subhead-tt have to be at the start
of the line, etc.Similarly, from an implementation standpoint
the existing parsers for setext that I have encountered do not
properly handle multiple layers of include-tt or even typo-tags
within include-tt.
Note: only one instance of the element (c) (or, in its absence,
(b)) is absolutely required for a text to be considered a valid
setext.
All the elements but (c) are in effect optional, not necessary
for a setext to be declared as such. Element (a) deals with
setexts that arrive via email and end up being parsed
(processed) as unedited mailbox files; fully employed the (a),
(b) and (c) make it possible to distribute “multisetexts”, i.e.
setexts with one additional level of logical structure (= more
than one setext per message; more than one message in a
mailbox). If such file is viewed as a multisetext it will
result in 3-level-outline structure: mail-subjects become
top-level chapters, setext titles denote subchapters (topics)
and the subheads yet finer threads within these (still a notch
ABOVE mere “paragraphs of text”).
Blogroll
• Facebook profile
• Flickr
• Flickr
• Google+ profile
• LinkedIn profile
Recent Posts
• Picasso Puzzle
• Out of Africa
• New Puzzle – Steve Skelton – Farm Table
• Still alive
• Cups and Chirp and Circus Day
Archives
• February 2017
• April 2016
• March 2016
• January 2015
• November 2014
• December 2013
• March 2013
• September 2012
• July 2012
• June 2012
• May 2012
• April 2012
Tags
• 2007
• admin
• akismet
• aquarium
• australia
• brazil
• california
• carnival
• chile
• cruise
• disney
• disneyland
• family
• Foz do Iguaçu
• france
• germany
• golden gate bridge
• holocaust
• investing
• jigsaw
• lava
• macchu picchu
• monterey
• paraty
• paris
• patent
• patent polisher
• peru
• platinum puzzles
• puzzle
• retirement
• rio
• san francisco
• spam
• stave
• teaser
• thingamajigsaw
• torres del paine
• travel
• trick
• Uluru
• volcano
Meta
• Log in
• Entries RSS
• Comments RSS
• WordPress.org
(C) 1995-2012 Erik Oliver
Proudly powered by WordPress