Network Working Group                                          E. Thomas
Request for Comments: 1429                    Swedish University Network
                                                          February 1993


                     Listserv Distribute Protocol

Status of this Memo

  This memo provides information for the Internet community.  It does
  not specify an Internet standard.  Distribution of this memo is
  unlimited.

Abstract

  This memo specifies a subset of the distribution protocol used by the
  BITNET LISTSERV to deliver mail messages to large amounts of
  recipients.  This protocol, known as DISTRIBUTE, optimizes the
  distribution by sending a single copy of the message over heavily
  loaded links, insofar as topological information is available to
  guide such decisions, and reduces the average turnaround time for
  large mailing lists to 5-15 minutes on the average. This memo
  describes a simple interface allowing non-BITNET mailing list
  exploders (or other bulk-delivery scripts) to take advantage of this
  service by letting the BITNET distribution network take care of the
  delivery.

Introduction

  Running a mailing list of 1,000 subscribers or more with plain
  "sendmail" while keeping turnaround time to a reasonable level is no
  easy task. Due mostly to its limited bandwidth in the mid-80's,
  BITNET has developed an efficient bulk delivery protocol for its
  mailing lists. Originally introduced in 1986, this protocol was
  refined little by little and now carries 2-6 million mail messages a
  day. In fact, this distribution mechanism implements a general-
  purpose delivery service which can be used by any user of BITNET or
  the Internet. Thus, a simple solution to the "sendmail" turnaround
  problem is to wrap the message and recipient list in a DISTRIBUTE
  envelope and pass it to a BITNET server for delivery.  This may not
  be the best possible solution, but it has the advantage of being easy
  to implement.

  In this document we will use the term "production" to refer to the
  normal operation of the mailing list (or bulk delivery application)
  you want to pipe through the DISTRIBUTE service. That is, the
  "production" options are those you should specify once everything is
  tested and you are confident that the setup is working to your



Thomas                                                          [Page 1]

RFC 1429              Listserv Distribute Protocol         February 1993


  satisfaction. In contrast, "test" and "debug" options can be used to
  experiment with the protocol but should not be used for normal
  operation because of the additional bandwidth and CPU time required
  to generate the various informational reports.

  Finally, it should be noted that the DISTRIBUTE protocol was
  developed to address a number of issues, some of them relevant only
  to BITNET, and has evolved since 1986 while keeping a compatible
  syntax. For the sake of brevity, this RFC describes only a small
  subset of the available options and syntax. This is why the syntax
  may appear unnecessarily complicated or even illogical.

1. Selecting an entry point into the DISTRIBUTE backbone

  The first thing you have to do is to find a suitable site to submit
  your distributions to. For testing, and for testing ONLY, you can
  use:

                        [email protected]

  For production use, however, you should select a DISTRIBUTE site in
  your topological vicinity: it would make no sense to pass your
  distributions from California to a server in Sweden if most of your
  recipients are in the US. If your organization is connected to BITNET
  and your BITNET system is part of the DISTRIBUTE backbone, this ought
  to be your best bet. Otherwise you will want to contact someone
  knowledgeable about BITNET (or the author of this RFC if you have no
  BITNET users). Make sure to run through the following checklist
  before sending any production traffic to the site in question:


  a. Do you have good connectivity to the host in question? Does the
     host, in general, have decent BITNET connectivity? There are still
     a few sites that insist on using 9.6k leased lines for BITNET in
     spite of having T1 IP access. You will want to avoid them.

  b. Send mail to the server with "show version" in the message body
     (not in the subject field, which is ignored). Is the server running
     version 1.7f or higher? If so, it should not have given you the
     following warning,

       >>> This server is configured to use PUNCH format for mail <<<

     which means that messages with lines longer than 80 characters
     cannot be handled properly. If the software version is less than
     1.7f, the warning will not be present; instead, check the first
     (bottom) "Received:" field. If it does not say "LMail", do not use
     this server as it probably cannot handle messages with long lines.



Thomas                                                          [Page 2]

RFC 1429              Listserv Distribute Protocol         February 1993


     Finally, make sure that the "Master nodes file" is not older
     than 2 months: there are a handful of sites which never update
     their tables due to staffing problems. They cannot be prevented
     from running LISTSERV, but you will certainly want to avoid them.

  c. How big is your workload? If you are planning to use the service
     for more than 10,000 daily recipients, you should get permission
     from the LISTSERV administrator, both as a matter of courtesy and
     to hear about any restrictions or regularly scheduled downtime they
     might have. For instance, some universities might not allow large
     distributions during prime time, or they may have several
     DISTRIBUTE machines and will want to make sure you use the "right"
     one.  Send mail to "owner-listserv" at the host in question and
     give an estimate of the amount of daily messages and recipients you
     would like to submit. If your message bounces back with "No such
     local user" or the like, it means the server did not pass the above
     test (b) and you don't want to use it anyway.

  An index of sites/hosts which have the required configuration, good
  connectivity, keep their tables up to date and have generally agreed
  to provide this service to anyone in their topological area will be
  published separately in the future.

2. Physical delivery of the DISTRIBUTE request

  The distribution request is delivered via SMTP to the e-mail address
  obtained in step 1 (for instance, [email protected]). In fact,
  as long as you can somehow get mail to the server's host, you can use
  the service; SMTP is just the most convenient way of doing so.

2.1. Contents of MAIL FROM: field

  You should set the MAIL FROM: field to the address of the person who
  maintains your mailing list or, generally speaking, to the address of
  a human being who can take action in case the message fails to reach
  the DISTRIBUTE server's host. This is a very rare occurrence.

2.2. Contents of RCPT TO: field

  The RCPT TO: field points to the server's address (for instance,
  [email protected]).

2.3. Contents of the RFC822 header

  After the DATA instruction, you must supply a valid RFC822 header
  with a "From:" field pointing to the mailbox that should receive
  notification of delivery problems, bounced mail, and so on. This can
  be the same as the MAIL FROM: field, an address of the type "owner-



Thomas                                                          [Page 3]

RFC 1429              Listserv Distribute Protocol         February 1993


  xxxx@yourhost", etc.  DO NOT PUT THE LIST SUBMISSION ADDRESS THERE,
  or you will get mailing loops.

  For testing, the "From:" field should point to your own mailbox, so
  that you get the responses from the server.

  As long as RFC822 syntax is respected, the only field that matters is
  the "From:" field (or "Sender:", "Resent-From:", etc.). In practice
  this means you can just pipe the distribution request into "mail
  listserv@whatever" and let your mail program build all the headers.

3. Format of the DISTRIBUTE request

  The body of the message delivered to LISTSERV defines the recipients
  of the distribution and the text (header + body) of the RFC822
  message you want to have delivered. The request starts with a "job
  card", followed by a DISTRIBUTE command, a list of recipients, and
  finally the message header and body.

3.1. Syntax of the JOB card

  The purpose of the JOB card is to make sure that any spurious text
  inserted by mail gateways or the like is flushed and not erroneously
  interpreted as a command. It can optionally be used to associate a
  "job name" with the request, in case you want to use tools to assist
  you in processing the notifications you get from the DISTRIBUTE
  servers when running in test mode. The syntax is as follows:

  //jobname JOB ECHO=NO

  "jobname" can be anything as long as it does not contain blanks, and
  can be omitted. LISTSERV generally ignores case when parsing
  commands, so you can use "job" or "Job" if you prefer. The ECHO=NO
  keyword is required for production use, to suppress the "resource
  usage summary" you would otherwise get upon completion of your
  delivery. You may want to omit it when testing.

3.2. Syntax of the DISTRIBUTE command

  Below the JOB card, you must supply the following line:

  DISTRIBUTE MAIL

  For production mode, do not specify anything else on that line. When
  testing, you should add ACK=MAIL in order to get an acknowledgement
  confirming the delivery. There are two other useful options:
  DEBUG=YES, which instructs the server to produce a report showing how
  the various recipients will be routed, but without actually



Thomas                                                          [Page 4]

RFC 1429              Listserv Distribute Protocol         February 1993


  delivering the message; and TRACE=YES, which does the same but does
  deliver the message. Before making a "live" test with your actual
  recipients list, you should tack the DEBUG=YES option once to make
  sure you got all the parameters and syntax right, and get a rough
  idea of the efficiency of the distribution (see the section on
  performance).

3.3. Giving the list of recipients

  The list of recipients follows the DISTRIBUTE line and is specified
  as follows:

  //To DD *
  user1@host1 BSMTP
  user2@host2 BSMTP
  /*

  The two lines starting with a "/" have to be copied as-is. Each of
  the lines in between contains the address of one of the recipients,
  followed by a blank and by the word "BSMTP", which indicates that you
  do not want the header rewritten. There are four restrictions:

  a. The address must be a plain "local-part@hostname" - no name string,
     no angle bracket, no source route, etc. Bear in mind that the
     DISTRIBUTE server is not in the same domain as you: all the
     addresses should be fully qualified.

  b. If the local-part is quoted, it must be quoted from the first word
     on.  Technically, RFC822 allows: Joe."Now@Home"[email protected], but
     for performance reasons this form is not supported. Just quote the
     first word to tell LISTSERV to run the address through the full
     parser: you would write "Joe"."Now@Home"[email protected] instead.

  c. The local-part of the address may not start with an (unquoted)
     asterisk.  You can bypass this restriction by quoting the local
     part and using a %-hack through the server's host:
     "***JACK***%jack-ws.xyz.edu"@server-host.

  d. Blanks are not allowed anywhere in the address.

  You can use the pseudo-domain ".BITNET" for BITNET recipients: it is
  always supported within DISTRIBUTE requests.

3.4. Specifying the message text

  After the last recipient and the closing "/*", add the following
  line,




Thomas                                                          [Page 5]

RFC 1429              Listserv Distribute Protocol         February 1993


  //Data DD *,EOF

  followed by the RFC822 message (header + body) that you want
  delivered.  The EOF option indicates that the message header and body
  will extend until the end of the message you are sending to the
  DISTRIBUTE server.  If you are worried about extraneous data being
  appended by a gateway, remove the EOF option, add a closing "/*" line
  after the end of the message, followed by a "// EOJ" card to flush
  any remaining text. This, however, will fail if the message itself
  contains a "/*" line; you would have to insert a space before any
  such line.

4. Examples

  Here is an (intentionally short) example to clarify the syntax:

  ----- cut here -----
  //Test JOB
  Distribute mail Ack=mail Debug=yes
  //To DD *
  [email protected] BSMTP
  [email protected] BSMTP
  [email protected] BSMTP
  [email protected] BSMTP
  [email protected] BSMTP
  /*
  //Data DD *,EOF
  Date:         Tue, 19 Jan 1993 10:57:29 -0500
  From:         Robert H. Smith <[email protected]>
  Subject:      Re: Problem with V5.41
  To:           [email protected]

  I agree with Jack, V5.41 is not a stable release. I had to fall back
  to V5.40 within 5 minutes of installation...

                                          Bob Smith
  ----- cut here -----

  Note: some of the hostnames are genuine, but the usernames are all
  fictitious.

  You would get the following reply:

  --------------------------------------------------------------------
  Job "Test" started on 20 Feb 1993 01:09:40

  > Distribute mail ack=mail debug=yes
  Debug trace information:



Thomas                                                          [Page 6]

RFC 1429              Listserv Distribute Protocol         February 1993


  ABC.COM                   goes to SEARN    (213) - single recipient
  ALPHA.CC.BUFFALO.EDU      goes to UBVM     (027) - single recipient
  LIBRARY.RICE.EDU          goes to RICEVM1  (022) - single recipient
  TAMVM1                    goes to TAIVM1   (247) - single recipient
  WS-4.XYZ.EDU              goes to SEARN    (213) - single recipient

  Path information:

   TAIVM1  : UGA      RICEVM1  TAIVM1
   UBVM    : UGA      UBVM
   RICEVM1 : UGA      RICEVM1

  (Debug) Mail forwarded to LISTSERV@UGA      for   3 recipients.
  (Debug) Mail posted via BSMTP to [email protected].
  (Debug) Mail posted via BSMTP to [email protected].

  Job "Test" ended   on 20 Feb 1993 01:09:40

  Summary of resource utilization
  -------------------------------
   CPU time:        0.086 sec                Device I/O:     6
   Overhead CPU:    0.045 sec                Paging I/O:     5
   CPU model:        9221                    DASD model:  3380
  --------------------------------------------------------------------

  To actually perform the distribution and get an acknowledgement, you
  would change the first two lines as follows:

  ----- cut here -----
  //Test JOB Echo=NO
  Distribute mail Ack=mail
  --------------------

  And you would get the following reply:

  --------------------------------------------------------------------
  Mail forwarded to LISTSERV@UGA      for   3 recipients.
  Mail posted via BSMTP to [email protected].
  Mail posted via BSMTP to [email protected].
  --------------------------------------------------------------------

  Finally, by removing the "Ack=mail" keyword you would perform a
  "silent" distribution without any acknowledgement, suitable for
  production mode.







Thomas                                                          [Page 7]

RFC 1429              Listserv Distribute Protocol         February 1993


5. Performance

  The efficiency of the distribution depends mostly on the quality and
  accuracy of the topological information available to the DISTRIBUTE
  server (and, in some extreme cases, on system load). For BITNET
  recipients, the typical turnaround time for reasonably well connected
  systems is 5-15 minutes. Internet recipients fall in two categories:
  those which can be routed to a machine within or close to the
  recipient's organization (average turnaround time 5-20 minutes), and
  those for which no topological information is available at all. In
  that case the delivery can take much longer, but usually remains
  faster than with a vanilla sendmail setup. At the time being,
  topological information is available for most top-level domains
  outside the US and for many sub-domains of EDU and GOV.

  You can measure the efficiency of the distribution using the
  DEBUG=YES option as explained above. Recipients which get forwarded
  to another server usually get delivered within 5-20 minutes (except
  to poorly connected sites or countries, for which not much can be
  done). Recipients which are handled locally are passed to a local
  SMTP agent whose efficiency depends very much on the amount of
  "burst" queries the local name server can handle in quick succession.

  A number of projects are currently underway to investigate the
  feasibility of improving the quality of the topological information
  available to the DISTRIBUTE servers for the Internet.

Security Considerations

  Security issues are not discussed in this memo.

Author's Address

  Eric Thomas
  Swedish University Network
  Dr.Kristinas vaeg 37B
  100 44 Stockholm, Sweden

  E-mail: [email protected]












Thomas                                                          [Page 8]