Network Working Group                                         S. Crocker
Request for Comments: 33                                            UCLA
                                                                S. Carr
                                                     University of Utah
                                                                V. Cerf
                                                                   UCLA
                                                       12 February 1970


                        New HOST-HOST Protocol

  Attached is a copy of the paper to be presented at the SJCC on the
  HOST-HOST Protocol.  It indicates many changes from the old protocol
  in NWG/RFC 11; these changes resulted from the network meeting on
  December 8, 1969.  The attached document does not contain enough
  information to write a NCP, and I will send out another memo or so
  shortly.  Responses to this memo are solicited, either as NWG/RFC's
  or personal notes to me.


                    HOST-HOST Communication Protocol
                          in the ARPA Network*

  by C. Stephen Carr
  University of Utah
  Salt Lake City, Utah

  and

  by Stephen D. Crocker
  University of California
  Los Angeles, California

  and

  by Vinton G. Cerf
  University of California
  Los Angeles, California

  *This research was sponsored by the Advanced Research Projects
  Agency, Department of Defense, under contracts AF30(602)-4277 and
  DAHC15-69-C-0825.

INTRODUCTION

  The Advanced Research Projects Agency (ARPA) Computer Network
  (hereafter referred to as the "ARPA network") is one of the most
  ambitious computer networks attempted to date.  [1]  The types of



Crocker, et. al.                                                [Page 1]

RFC 33                   New HOST-HOST Protocol         12 February 1970


  machines and operating systems involved in the network vary widely.
  For example, the computers at the first four sites are an XDS 940
  (Stanford Research Institute), an IBM 360/75 (University of
  California, Santa Barbara), an XDS SIGMA-7 (University of California,
  Los Angeles), and a DEC PDP-10 (University of Utah).  The only
  commonality among the network membership is the use of highly
  interactive time-sharing systems; but, of course, these are all
  different in external appearance and implementation.  Furthermore, no
  one node is in control of the network.  This has insured reliability
  but complicates the software.

  Of the networks which have reached the operational phase and been
  reported in the literature, none have involved the variety of
  computers and operating systems found in the ARPA network.  For
  example, the Carnegie-Mellon, Princeton, IBM network consists of
  360/67's with identical software. [2]  Load sharing among identical
  batch machines was commonplace at North American Rockwell Corporation
  in the early 1960's.  Therefore, the implementers of the present
  network have been only slightly influenced by earlier network
  attempts.

  However, early time-sharing studies at the University of California
  at Berkeley, MIT, Lincoln Laboratory, and System Development
  Corporation (all ARPAA sponsored) have had considerable influence on
  the design of the network.  In some sense, the ARPA network of time-
  shared computers is a natural extension of earlier time-sharing
  concepts.

  The network is seen as a set of data entry and exit points into which
  individual computers insert messages destined for another (or the
  same) computer, and from which such messages emerge.  The format of
  such messages and the operation of the network was specified by the
  network contractor (BB&N) and it became the responsibility of
  representatives of the various computer sites to impose such
  additional constraints and provide such protocol as necessary for
  users at one site to use resources at foreign sites.  This paper
  details the decisions that have been made and the considerations
  behind these decisions.

  Several people deserve acknowledgement in this effort.  J. Rulifson
  and W. Duvall of SRI participated in the early design effort of the
  protocol and in the discussions of NIL.  G. Deloche of Thompson-CSF
  participated in the design effort while he was at UCLA and provided
  considerable documentation.  J. Curry of Utah and P. Rovner of
  Lincoln Laboratory reviewed the early design and NIL.  W. Crowther of
  Bolt, Beranek and Newman, contributed the idea of a virtual net.  The
  BB&N staff provided substantial assistance and guidance while
  delivering the network.



Crocker, et. al.                                                [Page 2]

RFC 33                   New HOST-HOST Protocol         12 February 1970


  We have found that, in the process of connecting machines and
  operating systems together, a great deal of rapport has been
  established between personnel at the various network node sites.  The
  resulting mixture of ideas, discussions, disagreements, and
  resolutions has been highly refreshing and beneficial to all
  involved, and we regard the human interaction as a valuable by-
  product of the main effect.

THE NETWORK AS SEEN BY THE HOSTS

  Before going on to discuss operating system communication protocol,
  some definitions are needed.

     A HOST is a computer system which is a part of the network,

     An IMP (Interface Message Processor) is a Honeywell DDP-516
     computer which interfaces with up to four HOSTs at a particular
     site, and allows HOSTs access into the network.  The configuration
     of the initial four-HOST network is given in figure 1.  The IMPs
     from a store-and-forward communications network.  A companion
     paper in these proceedings covers the IMPs in some detail. [3]

  A message is a bit stream less than 8096 bits long which is given to
  an IMP by a HOST for transmission to another HOST.  The first 32 bits
  of the message are the leader.  The leader contains the following
  information:

     (a) HOST
     (b) Message Type
     (c) Flags
     (d) Link Number

  When a message is transmitted from a HOST to its IMP, the HOST field
  of the leader names the receiving HOST.  When the message arrives at
  the receiving HOST, the HOST field names the sending HOST.

  Only two message types are of concern in this paper.  Regular
  messages are generated by a HOST and sent to its IMP for transmission
  to a foreign HOST.  The other message type of interest is a RFNM
  (Request-for-Next-Message).  RFNM's are explained in conjunction with
  links.

  The flag field of the leader controls special cases not of concern
  here.







Crocker, et. al.                                                [Page 3]

RFC 33                   New HOST-HOST Protocol         12 February 1970


  The link number identifies over which of 256 logical paths (links)
  between the sending HOST and the receiving HOST the message will be
  sent.  Each link is unidirectional and is controlled by the network
  so that no more than one message at a time may be sent over it.  This
  control is implemented using RFNM messages.  After a sending HOST has
  sent a message to a receiving HOST over a particular link, the
  sending HOST is prohibited from sending another message over that
  same link until the sending HOST receives a RFMN.  The RFNM is
  generated by the IMP connected to the receiving HOST, and the RFNM is
  sent back to the sending HOST after the message has entered the
  receiving HOST.  It is important to remember that there are 356 links
  in each direction and that no relationship among these is imposed by
  the network.

  The purpose of the link and RFMN mechanism is to prohibit individual
  users from overloading an IMP or a HOST.  Implicit in this purpose is
  the assumption that a user does not use multiple links to achieve a
  wide band, and to a large extent the HOST-HOST protocol cooperates
  with this assumption.  An even more basic assumption, of course, is
  that the network's load comes from some users transmitting sequences
  of messages rather than many users transmitting single messages
  coincidently.

  In order to delimit the length of the message, and to make it easier
  for HOSTs of differing word lengths to communicate, the following
  formatting procedure is used.  When a HOST prepares a message for
  output, it creates a 32-bit leader.  Following the leader is a binary
  string, called marking, consisting of an arbitrary number of zeros,
  followed by one.  Marking makes is possible for the sending HOST to
  synchronize the beginning of the text message with its word
  boundaries.  When the last bit of a message has entered an IMP, the
  hardware interface between the IMP and HOST appends a one followed by
  enough zeros to make the message length a multiple of 16 bits.  These
  appended bits are called padding.  Except for the marking and
  padding, no limitations are placed on the text of a message.  Figure
  2 shows a typical message sent by a 24-bit machine.

DESIGN CONCEPTS

  The computers participating in the network are alike in two important
  respects: each supports research independent of the network, and each
  is under the discipline of a time-sharing system.  These facts
  contributed to the following design philosophy.

  First, because the computers in the network have independent purposes
  it is necessary to preserve decentralized administrative control of
  the various computers.  Since all of the time-sharing supervisors
  possess elaborate and definite accounting and resource allocation



Crocker, et. al.                                                [Page 4]

RFC 33                   New HOST-HOST Protocol         12 February 1970


  mechanisms, we arranged matters so that these mechanisms would
  control the load due to the network in the same way that they control
  locally generated load.

  Second, because the computers are all operated under time-sharing
  disciplines, it seemed desirable to facilitate basic interactive
  mechanisms.

  Third, because this network is used by experienced programmers it was
  imperative to provide the widest latitude in using the network.
  Restrictions concerning character sets, programming languages, etc.
  would not be tolerated and we avoided such restrictions.

  Fourth, again because the network is used by experienced programmers,
  it was felt necessary to leave the design open-ended.  We expect that
  conventions will arise from time to time as experience is gained, but
  we felt constrained not to impose them arbitrarily.

  Fifth, in order to make network participation comfortable, or in some
  cases, feasible, the software interface to the network should require
  minimal surgery on the HOST operating system.

  Finally, we except the assumption stated above that network use
  consists of prolonged conversations instead of one-shot requests.

  These considerations led to the notions of connections, a Network
  Control Program, a control link, control commands, sockets, and
  virtual nets.

  A connection is an extension of a link.  A connection connects two
  processes so that output from one process is input to the other.
  Connections are simplex, so two connections are needed if two
  processes are to converse in both directions.

  Processes within a HOST communicate with the network through a
  Network Control Program (NCP).  In most HOSTs, the NCP will be a part
  of the executive, so that processes will use system calls to
  communicate with it.  The primary function of the NCP is to establish
  connections, break connections, switch connections, and control flow.

  In order to accomplish its tasks, a NCP in one HOST must communicate
  with a NCP in another HOST.  To this end, a particular link between
  each pair of HOSTs has been designated as the control link.  Messages
  received over the control link are always interpreted by the NCP as a
  sequence of one or more control commands.  As an example, one of the
  kinds of control commands is used to assign a link and initiate a





Crocker, et. al.                                                [Page 5]

RFC 33                   New HOST-HOST Protocol         12 February 1970


  connection, while another kind carries notification that a connection
  has been terminated.  A partial sketch of the syntax and semantics of
  control commands is given in the next section.

  A major issue is how to refer to processes in a foreign HOST.  Each
  HOST has some internal naming scheme, but these various schemes often
  are incompatible.  Since it is not practical to impose a common
  internal process naming scheme, an intermediate name space was
  created with a separate portion of the name space given to each HOST.
  It is left to each HOST to map internal process identifiers into its
  name space.

  The elements of the name space are called sockets.  A socket forms
  one end of a connection, and a connection is fully specified by a
  pair of sockets.  A socket is specified by the concatenation of three
  numbers:

     (a) a user number (24 bits)
     (b) a HOST number (8 bits)
     (c) AEN (8 bits)

  A typical socket is illustrated in Figure 3.

  Each HOST is assigned all sockets in the name space which have field
  (b) equal to the HOST's own identification.

  A socket is either a receive socket or a send socket, and is so
  marked by the lower-order bit of the AEN (0 = receive, 1 = send).
  The other seven bits of the AEN simply provide a sizable population
  of sockets for each used number at each HOST.  (AEN stands for
  "another eight-bit number")

  Each user is assigned a 24-bit user number which uniquely identifies
  him throughout the network.  Generally this will be the 8-bit HOST
  number of his home HOST, followed by 16 bits which uniquely identify
  him at that HOST.  Provision can also be made for a user to have a
  user number not keyed to a particular HOST, an arrangement desirable
  for mobile users who might have no home HOST or more than one home
  HOST.  This 24-bit user number is then used in the following manner.
  When a user signs onto a HOST, his user number is looked up.
  Thereafter, each process the user creates is tagged with his user
  number.  When the user signs onto a foreign HOST via the network, his
  same user number is used to tag processes he creates in that HOST.
  The foreign HOST obtains the user number either by consulting a table
  at login time, as the home HOST does, or by noticing the
  identification of the caller.  The effect of propagating the user's
  number is that each user creates his own virtual net consisting of
  processes he has created.  This virtual net may span an arbitrary



Crocker, et. al.                                                [Page 6]

RFC 33                   New HOST-HOST Protocol         12 February 1970


  number of HOSTs.  It will thus be easy for a user to connect his
  processes in arbitrary ways, while still permitting him to connect
  his processes with those in other virtual nets.

  The relationship between sockets and processes is now describable
  (see Figure 4).  For each user number at each HOST, there are 128
  send sockets and 128 receive sockets.  A process may request from the
  local NCP the use of any one of the sockets with the same user
  number; the request is granted if the socket is not otherwise in use.
  The key observation here is that a socket requested by a process
  cannot already be in use unless it is by some other process within
  the same virtual net, and such a process is controlled by the same
  user.

  An unusual aspect of the HOST-HOST protocol is that a process may
  switch its end of a connection from one socket to another.  The new
  socket may be in any virtual net and at any HOST, and the process may
  initiate a switch either at the time the connection is being
  established, or later.  The most general forms of switching entail
  quite complex implementation, and are not germane to the rest of this
  paper, so only a limited form will be explained.  This limited form
  of switching provides only that a process may substitute one socket
  for another while establishing a connection.  The new socket must
  have the same user number and HOST number, and the connection is
  still established to the same process.  This form of switching is
  thus only a way of relabelling a socket, for no charge in the routing
  of messages takes place.  In the next section we document the system
  calls and control commands; in the section after next, we consider
  how login might be implemented.

SYSTEM CALLS AND CONTROL COMMANDS

  Here we sketch the mechanisms of establishing, switching and breaking
  a connection.  As noted above, the NCP interacts with user processes
  via system calls and with other NCPs via control commands.  We
  therefore begin with a partial description of system calls and
  control commands.

  System calls will vary from one operating system to another, so the
  following description is only suggestive.  We assume here that a
  process has several input-output paths which we will call ports.
  Each port may be connected to a sequential I/O device, and while
  connected, transmits information in only one direction.  We further
  assume that the process is blocked (dismissed, slept) while
  transmission proceeds.  The following is the list of system calls:






Crocker, et. al.                                                [Page 7]

RFC 33                   New HOST-HOST Protocol         12 February 1970


           Init      <port>, <AEN 1>, <AEN 2>, <foreign socket>

     where <port> is part of the process issuing the Init
                    _
           <AEN 1>   |
     and             +- are 8-bit AEN's (see Figure 2)
           <AEN 2>   |
                    _|

           The first AEN is used to initiate the connection; the second
           is used while the connection exists.

           <foreign socket> is the 40-bit socket name of the distant
           end of the connection.

           The lower-order bits of <AEN 1> and <AEN 2> must agree, and
           these must be the complement of the lower-order bit of
           <foreign socket>.

           The NCP concatenates <AEN 1> and <AEN 2> each with the user
           number of the process and the HOST number to form 40-bit
           sockets.  It then sends a Request for Connection (RFC)
           control command to the distant NCP.  When the distant NCP
           responds positively, the connection is established and the
           process is unblocked.  If the distant NCP responds
           negatively, the local NCP unblocks the requesting process,
           but informs it that the system call has failed.

           Listen <port>, <AEN 1>

     where <port> and <AEN 1> are as above.  The NCP retains the ports
           and <AEN 1> and blocks the process.  When an RFC control
           command arrives naming the local socket, the process is
           unblocked and notified that a foreign process is calling.

           Accept <AEN 2>

           After a Listen has been satisfied, the process may either
           refuse the call or accept it and switch it to another
           socket.  To accept the call, the process issues the Accept
           system call.  The NCP then sends back an RFC control
           command.

           Close <port>

           After establishing a connection, a process issues a Close to
           break the connection.  The Close is also issued after a
           Listen to refuse a call.



Crocker, et. al.                                                [Page 8]

RFC 33                   New HOST-HOST Protocol         12 February 1970


           Transmit <port>, <addr>

           If <port> is attached to a send socket, <addr> points to a
           message to be sent.  This message is preceded by its length
           in bits.

           If <port> is attached to a receive socket, a message is
           stored at <addr>.  The length of the message is stored
           first.

Control Commands

  A vocabulary of control commands has been defined for communication
  between Network Control Programs.  Each control command consists of
  an 8-bit operation code to indicate its function, followed by some
  parameters.  The number and format of parameters is fixed for each
  operation code.  A sequence of control commands destined for a
  particular HOST can be packed into a single control message.

     RFC   <my socket 1>, <my socket 2>.

           <your socket>, (<link>)

  This command is sent because a process has executed either an Init
  system call or an Accept system call.  A link is assigned by the
  prospective receiver, so it is omitted if <my socket 1> is a send
  socket.

  There is distinct advantage in using the same commands both to
  initiate a connection (Init) and to accept a call (Accept).  If the
  responding command were different from the initiating command, then
  two processes could call each other and become blocked waiting for
  each other to respond.  With this scheme, no deadlock occurs and it
  provides a more compact way to connect a set of processes.

     CLS      <my socket>, <your socket>

  The specified connection is terminated

     CEASE    <link>

  When the receiving process does not consume its input as fast as it
  arrives, the buffer space in the receiving HOST is used to queue the
  waiting messages.  Since only limited space is generally available,
  the receiving HOST may need to inhibit the sending HOST from sending
  any more messages over the offending connection.  When the sending
  HOST receives this command, it may block the process generating the
  messages.



Crocker, et. al.                                                [Page 9]

RFC 33                   New HOST-HOST Protocol         12 February 1970


     RESUME   <link>

  This command is also sent from the receiving HOST to the sending HOST
  and negates a previous CEASE.

LOGGING IN

  We assume that within each HOST there is always a process in
  execution which listens to login requests.  We call this process the
  logger, and it is part of a special virtual net whose user number is
  zero.  The logger is programmed to listen to calls on socket number
  0.  Upon receiving a call, the logger switches it to a higher (even)
  numbered sockets, and returns a call to the socket numbered one less
  than the send socket originally calling.  In this fashion, the logger
  can initiate 127 conversations.

  To illustrate, assume a user whose identification is X'010005' (user
  number 5 at UCLA) signs into UCLA, starts up one of his programs, and
  this program wants to start a process at SRI.  No process except the
  logger is currently willing to listen to our user, so he executes

        Init, <port> = 1, <AEN 1> = 7, <AEN 2> = 7,

              <foreign socket> = 0

  His process is blocked, and the NCP at UCLA sends

        RFC   <my socket 1> = X'0100050107',

              <my socket 2> = X'0100050107',

              <your socket> = X'000000200'

  The logger at SRI is notified when this message is received, because
  it has previously executed

        Listen   <port> = 9, <AEN 1> = 0.

  The logger then executes

        Accept   <AEN 2> = 88.










Crocker, et. al.                                               [Page 10]

RFC 33                   New HOST-HOST Protocol         12 February 1970


  In response to the Accept, the SRI NCP sends

        RFC   <my socket 1> = X'0000000200'

              <my socket 2> = X'0000000258'

              <your socket> = X'0100050107'

              <link> = 37

  where the link has been chosen from the set of available links.  The
  SRI logger than executes

        Init     <port> = 10

              <AEN 1> = 89, <AEN 2> = 89,

              <foreign socket> = X'0100050106'

  which causes the NCP to send

        RFC   <my socket 1> = X'0000000259'

              <my socket 2> = x'0000000259'

              <your socket> = X'0100050106'

  The process at UCLA is unblocked and notified of the successful Init.
  Because SRI logger always initiates a connection to the AEN one less
  than it has just been connected to, the UCLA process then executes

        Listen   <port> = 11

              <AEN 1> = 6

  and when unblocked

        Accept   <AEN 2> = 6

  When these transactions are complete, the UCLA process is doubly
  connected to the logger at SRI.  The logger will then interrogate the
  UCLA process, and if satisfied, create a new process at SRI.  This
  new process will be tagged with user number X'010005', and both
  connections wil be switched to the new process.  In this case,
  switching the connections to the new process corresponds to "passing
  the console down" in many time-sharing systems.





Crocker, et. al.                                               [Page 11]

RFC 33                   New HOST-HOST Protocol         12 February 1970


USER LEVEL SOFTWARE

  At the user level, subroutines which manage data buffer and format
  input designed for other HOSTs are provided.  It is not mandatory
  that the user use such subroutines, since the user has access to the
  network system calls in his monitor.

  In addition to user programming access, it is desirable to have a
  subsystem program at each HOST which makes the network immediately
  accessible from a teletype-like device without special programming.
  Subsystems are commonly used system components such as text editors,
  compilers and interpreters.  An example of a network-related
  subsystem is TELNET, which will allow users at the University of Utah
  to connect to Stanford Research Institute and appear as regular
  terminal users.  It is expected that more sophisticated subsystems
  will be developed in time, but this basic one will render the early
  network immediately useful.

  A user at the University of Utah (UTAH) is sitting at a teletype
  dialed into the University's PDP-10/50 time-sharing system.  He
  wishes to operate the Conversational Algebraic Language (CAL)
  subsystem on the XDS-940 at Stanford Research Institute (SRI) in
  Menlo Park, California.  A typical TELNET dialog is illustrated in
  Figure 5.  The meaning of each line of dialogue is discussed here.

     (i)      The user signs in at UTAH

     (ii)     The PDP-10 run command starts up the TELNET subsystem at
              the user's HOST.

     (111)    The user identifies a break character which causes any
              message following the break to be interpreted locally
              rather than being sent on the foreign HOST.

     (iv)     The TELNET subsystem will make the appropriate system
              calls to establish a pair of connections to the SRI
              logger.  The connections will be established only if SRI
              accepts another foreign user.

  The UTAH user is now in the pre-logged-in state at SRI.  This is
  analogous to the standard teletype user's state after dialing into a
  computer and making a connection but before typing anything.

     (v)      The user signs in to SRI with a standard login command.
              Characters typed on the user's teletype are transmitted
              unaltered through the PDP-10 (user HOST) and on to the
              940 (serving HOST).  The PDP-10 TELNET will have
              automatically switched to full-duplex, character-by-



Crocker, et. al.                                               [Page 12]

RFC 33                   New HOST-HOST Protocol         12 February 1970


              character transmission, since this is required by SRI's
              940.  Full duplex operation is allowed for by the PDP-10,
              though not used by most Digital Equipment Corporations
              subsystems.

     (vi) and (vii) The 940 subsystem, CAL, is started.

  At this point, the user wishes to load a local CAL file into the 940
  CAL subsystem, from the file system on his local PDP-10.

     (viii)   CAL is instructed to establish a connection to UTAH in
              order to receive this file.  "NETWRK" is a predefined 940
              name similar in nature to "PAPER TYPE" or "TELETYPE".

     (ix)     Finally, the user types the break character (#) followed
              by a command to his PDP-10 TELNET program, which sends
              the desired file to SRI from Utah on the connection just
              established for this purpose.  The user's next statement
              is in CAL again.

  The TELNET subsystem coding should be minimal for it is essentially a
  shell program built over the network system calls.  It effectively
  established a shunt in the user HOST between the remote user and a
  distant serving HOST.

  Given the basic system primitives, the TELNET subsystem at the user
  HOST and a manual for the serving HOST, the network can be profitably
  employed by remote users today.

HIGHER LEVEL PROTOCOL

  The network poses special problems where a high degree of interaction
  is required between the user and a particular subsystem in a foreign
  HOST.  These problems arise due to heterogeneous consoles, local
  operating systems overhead, and network transmission delays.  Unless
  we use special strategies it may be difficult or even impossible for
  a distant user to make use of the more sophisticated subsystems
  offered.  While these difficulties are especially severe in the area
  of graphics, problems may arise even for teletype interaction.  For
  example, suppose that a foreign subsystem is designed for teletype
  consoles connected by telephone, and then this subsystem becomes
  available to network users.  This subsystem might have the following
  characteristics.

     1. Except for echoing and correction of mistyping, no action is
        taken until a carriage return is typed.





Crocker, et. al.                                               [Page 13]

RFC 33                   New HOST-HOST Protocol         12 February 1970


     2. All characters except "^", and "<-" and carriage returns are
        echoed as the character is typed.

     3. <- causes deletion of the immediately preceding character, and
        is echoed as that character.

     4. ^ causes all previously typed characters to be ignored.  A
        carriage return and line feed are echoed.

     5. A carriage return is echoed as a carriage return followed by a
        line feed.

  If each character typed is sent in its own message, then the
  characters

     H E L L O <- <- P c.r.

  cause nine messages in each direction.  Furthermore, each character
  is handled by a user level program in the local HOST before being
  sent to the foreign HOST.

  Now it is clear that if this particular example were important, we
  would quickly implement rules 1 to 5 in a local HOST program and send
  only complete lines to the foreign HOST.  If the foreign HOST program
  could not be modified so as to not generate echoes, then the local
  program could not only echo properly, it could also throw away the
  later echoes from the foreign HOST.  However, the problem is not any
  particular interaction scheme; the problem is that we expect many of
  these kinds of schemes to occur.  We have not found any general
  solutions to these problems, but some observations and conjectures
  may lead the way.

  With respect to heterogeneous consoles, we note that although
  consoles are rarely compatible, many are equivalent.  It is probably
  reasonable to treat a model 37 teletype as the equivalent of an IBM
  2741.  Similarly, most storage scopes will form an equivalence class,
  and most refresh display scopes will form another.  Furthermore, a
  hierarchy might emerge with members of one class usable in place of
  those in another, but not vice versa.  We can imagine that any scope
  might be an adequate substitute for a teletype, but hardly the
  reverse.  This observation leads us to wonder if a network-wide
  language for consoles might be possible.  Such a language would
  provide for distinct treatment of different classes of consoles, with
  semantics appropriate to each class.  Each site could then write
  interface programs for its consoles to make them look like network
  standard devices.





Crocker, et. al.                                               [Page 14]

RFC 33                   New HOST-HOST Protocol         12 February 1970


  Another observation is that a user evaluates an interactive system by
  comparing the speed of the system's responses with his own
  expectations.  Sometimes a user feels that he has made only a minor
  request, so the response should be immediate; at other times he feels
  he has made a substantial request, and is therefore willing to wait
  for the response.  Some interactive subsystems are especially
  pleasant to use because a great deal of work has gone into tailoring
  the responses to the user's expectations.  In the network, however, a
  local user level process intervenes between a local console and a
  foreign subsystem, and we may expect the response time for minor
  requests to degrade.  Now it may happen that all of this tailoring of
  the interaction is fairly independent of the portion of the subsystem
  which does the heavy computing or I/O.  In such a case, it may be
  possible to separate a subsystem into two sections.  One section
  would be a "front end" which formats output to the user, accepts his
  input, and controls computationally simple responses such as echoes.
  In the example above, the program to accumulate a line and generate
  echoes would be the front end of some subsystem.  We now take notice
  of the fact that the local HOSTs have substantial computational
  power, but our current designs make use of the local HOST only as a
  data concentrator.  This is somewhat ironic, for the local HOST is
  not only poorly utilized as a data concentrator, it also degrades
  performance because of the delays it introduces.

  These arguments have led us to consider the possibility of a Network
  Interface Language (NIL) which would be a network-wide language for
  writing the front end of interactive subsystems.  This language would
  have the feature that subprograms communicate through network-like
  connections.  The strategy is then to transport the source code for
  the front end of a subsystem to the local HOST, where it would be
  compiled and executed.

  During preliminary discussions we have agreed that NIL should have at
  least the following semantic properties not generally found in other
  languages.

     1. Concurrency.  Because messages arrive asynchronously on
        different connections, and because user input is not
        synchronized with subsystem output, NIL must include semantics
        to accurately model the possible concurrencies.

     2. Program Concatenation.  It is very useful to be able to insert
        a program in between two other programs.  To achieve this, the
        interconnection of programs would be specified at run time and
        would not be implicit in the source code.






Crocker, et. al.                                               [Page 15]

RFC 33                   New HOST-HOST Protocol         12 February 1970


     3. Device substitutability.  It is usual to define languages so
        that one device may be substituted for another.  The
        requirement here is that any device can be modeled by a NIL
        program.  For example, if a network standard display controller
        manipulates tree-structures according to messages sent to it
        then these structures must be easily implementable in NIL.

  NIL has not been fully specified, and reservations have been
  expressed about its usefulness.  These reservations hinge upon our
  conjecture that it is possible to divide an interactive system into a
  transportable front end which satisfies a user's expectations at low
  cost and a more substantial stay-at-home section.  If our conjecture
  is false, then NIL will not be useful; otherwise it seems worth
  pursuing.  Testing of this conjecture and further development of NIL
  will take priority after low level HOST-HOST protocol has stabilized.

HOST/IMP INTERFACING

  The hardware and software interfaces between HOST and IMP is an area
  of particular concern for the HOST organizations.  Considering the
  diversity of HOST computers to which a standard IMP must connect, the
  hardware interface was made bit serial and full-duplex.  Each HOST
  organization implements its half of this very simple interface.

  The software interface is equally simple and consists of messages
  passed back and forth between the IMP and HOST programs.  Special
  error and signal messages are defined as well as messages containing
  normal data.  Messages waiting in queues in either machine are sent
  at the pleasure of the machine in which they reside with no concern
  for the needs of the other computer.

  The effect of the present software interface is the needless
  rebuffering of all messages in the HOST in addition to the buffering
  in the IMP.  The messages have no particular order other than arrival
  times at the IMP.  The Network Control Program at one HOST (e.g.,
  UTAH) needs waiting RFNM's before all other messages.  At another
  site (e.g., SRI), the NCP could benefit by receiving messages for the
  user who is next to be run.

  What is needed is coding representing the specific needs of the HOST
  on both sides of the interface to make intelligent decisions about
  what to transmit next over the channel.  With the present software
  interface, the channel in one direction once committed to a
  particular message is then locked up for up to 80 milliseconds!  This
  approaches one teletype character time and needlessly limits full-
  duplex, character by character, interactions over the net.  At the
  very least, the IMP/HOST protocol should be expended to permit each
  side to assist the other in scheduling messages over the channels.



Crocker, et. al.                                               [Page 16]

RFC 33                   New HOST-HOST Protocol         12 February 1970


CONCLUSIONS

  At this time (February 1970) the initial network of four sites is
  just beginning to be utilized.  The communications system of four
  IMPs and wide band telephone lines have been operational for two
  months.  Programmers at UCLA have signed in as users of the SRI 940.
  More significantly, one of the authors (S. Carr) living in Palo Alto
  uses the Salt Lake PDP-10 on a daily basis by first connecting to
  SRI.  We thus have first hand experience that remote interaction is
  possible and is highly effective.

  Work on the ARPA network has generated new areas of interest.  NIL is
  one example, and interprocess communication is another.  Interprocess
  communication over the network is a subcase of general interprocess
  communication in a multiprogrammed environment.  The mechanism of
  connections seems to be new, and we wonder whether this mechanism is
  useful even when the processes are within the same computer.

REFERENCES

  1     L. ROBERTS
        "The ARPA network"
        Invitational Workshop on Networks of Computers Proceedings
        National Security Agency 1968 p 115 ff

  2.    R M RUTLEDGE et al
        "An interactive network of time-sharing computers"
        Proceedings of the 24th National Conference
        Association for Computing Machinery 1969 p 431 ff

  3.    F E HEART  R E KAHN  S M ORNSTEIN  W R CROWTHER
        D C WALDEN
        "The interface message processors for the ARPA network"
        These Proceedings

LIST OF FIGURES

  Figure 1  Initial network configuration

  Figure 2  A typical message from a 24-bit machine

  Figure 3  A typical socket

  Figure 4  The relationship between sockets and processes

  Figure 5  A typical TELNET dialog.

            Underlined characters are those types by the user.



Crocker, et. al.                                               [Page 17]

RFC 33                   New HOST-HOST Protocol         12 February 1970


                                SRI
                               _____
                              /     \
                             |  XDS  |
                             |  940  |
                              \_____/
                                 |
                           +----------+
                           |    IMP   |
                           +----------+
                            /   |    \
                           /    |     \
                          /     |      \  +----+    _____
                         /      |       \ | I  |   /     \
      ______     +----+ /       |        \| M  |--|  DEC  |
     /      \    | I  |/        |         | P  |  | PDP-10|
    |   IBM  |---| M  |         |         +----+   \_____/
    | 360/75 |   | P  |\        |
     \______/    +----+ \       |                    UTAH
                         \      |
       UCSB               \     |
                         +----------+
                         |    IMP   |
                         +----------+
                             |
                          ___|___
                         /       \
                        |   XDS   |
                        |(sigma)-7|
                         \_______/

                           UCLA

  Figure 1 Initial network configuration

















Crocker, et. al.                                               [Page 18]

RFC 33                   New HOST-HOST Protocol         12 February 1970


  |<------------ 24bits ----------->|
  |                                 |
  +---------------------------------+
  |                                 |
  |        Leader (32 bits)         |
  |               __________________|
  |              | 100 ---    ----0 |<----16 bits of marking
  +--------------+------------------+
  |                                 |
  |                                 |
  |   Text of messages (96 bits)    |
  |                                 |
  +------------------------+--------+
  | 100-----          ----0|
  +-------^----------------+
          |
          |______16 bits of padding added
                 by the interface

  Figure 2  A typical message from a 24-bit machine



         24                    8          8
  +----------------------+-----------+----------+
  |  User Number         |           |          |
  +----------------------+-----------+----------+
                               |          |___AEN
                               |
                               |___HOST number
  Figure 3 A typical socket



             |<--- connection --->|
  +---------+                      +---------+
  |         |        link          |         |
  | process |--(|--------------|)--| process |
  |         |   ^              ^   |         |
  +---------+   |              |   +---------+
                |              |
            send socket    receive socket

  Figure 4 The relationship between sockets and processes

        [ This RFC was put into machine readable form for entry ]
         [ into the online RFC archives by Lorrie Shiota 08/00]




Crocker, et. al.                                               [Page 19]