Network Working Group                                          K. Nichols
Request for Comments: 2638                                    V. Jacobson
Category: Informational                                             Cisco
                                                                L. Zhang
                                                                    UCLA
                                                               July 1999


   A Two-bit Differentiated Services Architecture for the Internet

Status of this Memo

  This memo provides information for the Internet community.  It does
  not specify an Internet standard of any kind.  Distribution of this
  memo is unlimited.

Copyright Notice

  Copyright (C) The Internet Society (1999).  All Rights Reserved.

Abstract

  This document was originally submitted as an internet draft in
  November of 1997. As one of the documents predating the formation of
  the IETF's Differentiated Services Working Group, many of the ideas
  presented here, in concert with Dave Clark's subsequent presentation
  to the December 1997 meeting of the IETF Integrated Services Working
  Group, were key to the work which led to RFCs 2474 and 2475 and the
  section on allocation remains a timely proposal. For this reason, and
  to provide a reference, it is being submitted in its original form.
  The forwarding path portion of this document is intended as a record
  of where we were at in late 1997 and not as an indication of future
  direction.

  The postscript version of this document includes Clark's slides as an
  appendix. The postscript version of this document also includes many
  figures that aid greatly in its readability.

1. Introduction

  This document presents a differentiated services architecture for the
  internet. Dave Clark and Van Jacobson each presented work on
  differentiated services at the Munich IETF meeting [2,3]. Each
  explained how to use one bit of the IP header to deliver a new kind
  of service to packets in the internet. These were two very different
  kinds of service with quite different policy assumptions. Ensuing
  discussion has convinced us that both service types have merit and
  that both service types can be implemented with a set of very similar



Nichols, et al.              Informational                      [Page 1]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  mechanisms. We propose an architectural framework that permits the
  use of both of these service types and exploits their similarities in
  forwarding path mechanisms. The major goals of this architecture are
  each shared with one or both of those two proposals: keep the
  forwarding path simple, push complexity to the edges of the network
  to the extent possible, provide a service that avoids assumptions
  about the type of traffic using it, employ an allocation policy that
  will be compatible with both long-term and short-term provisioning,
  make it possible for the dominant Internet traffic model to remain
  best-effort.

  The major contributions of this document are to present two distinct
  service types, a set of general mechanisms for the forwarding path
  that can be used to implement a range of differentiated services and
  to propose a flexible framework for provisioning a differentiated
  services network. It is precisely this kind of architecture that is
  needed for expedient deployment of differentiated services: we need a
  framework and set of primitives that can be implemented in the
  short-term and provide interoperable services, yet can provide a
  "sandbox" for experimentation and elaboration that can lead in time
  to more levels of differentiation within each service as needed.

  At the risk of belaboring an analogy, we are motivated to provide
  services tiers in somewhat the same fashion as the airlines do with
  first class, business class and coach class. The latter also has
  tiering built in due to the various restrictions put on the purchase.
  A part of the analogy we want to stress is that best effort traffic,
  like coach class seats on an airplane, is still expected to make up
  the bulk of internet traffic. Business and first class carry a small
  number of passengers, but are quite important to the economics of the
  airline industry. The various economic forces and realities combine
  to dictate the relative allocation of the seats and to try to fill
  the airplane. We don't expect that differentiated services will
  comprise all the traffic on the internet, but we do expect that new
  services will lead to a healthy economic and service environment.

  This document is organized into sections describing service
  architecture, mechanisms, the bandwidth allocation architecture, how
  this architecture might interoperate with RSVP/int-serv work, and
  gives recommendations for deployment.











Nichols, et al.              Informational                      [Page 2]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


2. Architecture

2.1 Background

  The current internet delivers one type of service, best-effort, to
  all traffic. A number of proposals have been made concerning the
  addition of enhanced services to the Internet. We focus on two
  particular methods of adding a differentiated level of service to IP,
  each designated by one bit [1,2,3]. These services represent a
  radical departure from the Internet's traditional service, but they
  are also a radical departure from traditional "quality of service"
  architectures which rely on circuit-based models. Both these
  proposals seek to define a single common mechanism that is used by
  interior network routers, pushing most of the complexity and state of
  differentiated services to the network edges. Both use bandwidth as
  the resource that is being requested and allocated. Clark and
  Wroclawski defined an "Assured" service that follows "expected
  capacity" usage profiles that are statistically provisioned [3]. The
  assurance that the user of such a service receives is that such
  traffic is unlikely to be dropped as long as it stays within the
  expected capacity profile. The exact meaning of "unlikely" depends on
  how well provisioned the service is. An Assured service traffic flow
  may exceed its Profile, but the excess traffic is not given the same
  assurance level. Jacobson defined a "Premium" service that is
  provisioned according to peak capacity Profiles that are strictly not
  oversubscribed and that is given its own high-priority queue in
  routers [2]. A Premium service traffic flow is shaped and hard-
  limited to its provisioned peak rate and shaped so that bursts are
  not injected into the network. Premium service presents a "virtual
  wire" where a flow's bursts may queue at the shaper at the edge of
  the network, but thereafter only in proportion to the indegree of
  each router. Despite their many similarities, these two approaches
  result in fundamentally different services. The former uses buffer
  management to provide a "better effort" service while the latter
  creates a service with little jitter and queueing delay and no need
  for queue management on the Premium packets's queue.

  An Assured service was introduced in [3] by Clark and Wroclawski,
  though we have made some alterations in its specification for our
  architecture. Further refinements and an "Expected Capacity"
  framework are given in Clark and Fang [10].  This framework is
  focused on "providing different levels of best-effort service at
  times of network congestion" but also mentions that it is possible to
  have a separate router queue to implement a "guaranteed" level of
  assurance.  We believe this framework and our Two-bit architecture
  are compatible but this needs further exploration.  As Premium
  service has not been documented elsewhere, we describe it next and
  follow this with a description of the two-bit architecture.



Nichols, et al.              Informational                      [Page 3]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


2.2 Premium service

  In [2], a Premium service was presented that is fundamentally
  different from the Internet's current best effort service. This
  service is not meant to replace best effort but primarily to meet an
  emerging demand for a commercial service that can share the network
  with best effort traffic. This is desirable economically, since the
  same network can be used for both kinds of traffic. It is expected
  that Premium traffic would be allocated a small percentage of the
  total network capacity, but that it would be priced much higher. One
  use of such a service might be to create "virtual leased lines",
  saving the cost of building and maintaining a separate network.
  Premium service, not unlike a standard telephone line, is a capacity
  which the customer expects to be there when the receiver is lifted,
  although it may, depending on the household, be idle a good deal of
  the time.  Provisioning Premium traffic in this way reduces the
  capacity of the best effort internet by the amount of Premium
  allocated, in the worst case, thus it would have to be priced
  accordingly. On the other hand, whenever that capacity is not being
  used it is available to best effort traffic. In contrast to normal
  best effort traffic which is bursty and requires queue management to
  deal fairly with congestive episodes, this Premium service by design
  creates very regular traffic patterns and small or nonexistent
  queues.

  Premium service levels are specified as a desired peak bit-rate for a
  specific flow (or aggregation of flows). The user contract with the
  network is not to exceed the peak rate. The network contract is that
  the contracted bandwidth will be available when traffic is sent.
  First-hop routers (or other edge devices) filter the packets entering
  the network, set the Premium bit of those that match a Premium
  service specification, and perform traffic shaping on the flow that
  smooths all traffic bursts before they enter the network. This
  approach requires no changes in hosts. A compliant router along the
  path needs two levels of priority queueing, sending all packets with
  the Premium bit set first. Best-effort traffic is unmarked and queued
  and sent at the lower priority. This results in two "virtual
  networks": one which is identical to today's Internet with buffers
  designed to absorb traffic bursts; and one where traffic is limited
  and shaped to a contracted peak-rate, but packets move through a
  network of queues where they experience almost no queueing delay.

  In this architecture, forwarding path decisions are made separately
  and more simply than the setting up of the service agreements and
  traffic profiles. With the exception of policing and shaping at
  administrative or "trust" boundaries, the only actions that need to
  be handled in the forwarding path are to classify a packet into one
  of two queues on a single bit and to service the two queues using



Nichols, et al.              Informational                      [Page 4]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  simple priority. Shaping must include both rate and burst parameters;
  the latter is expected to be small, in the one or two packet range.
  Policing at boundaries enforces rate compliance, and may be
  implemented by a simple token bucket. The admission and set-up
  procedures are expected to evolve, in time, to be dynamically
  configurable and fairly complex while the mechanisms in the
  forwarding path remain simple.

  A Premium service built on this architecture can be deployed in a
  useful way once the forwarding path mechanisms are in place by making
  static allocations. Traffic flows can be designated for special
  treatment through network management configuration. Traffic flows
  should be designated by the source, the destination, or any
  combination of fields in the packet header. First-hop (of leaf)
  routers will filter flows on all or part of the header tuple
  consisting of the source IP address, destination IP address, protocol
  identifier, source port number, and destination port number. Based on
  this classification, a first-hop router performs traffic shaping and
  sets the designated Premium bit of the precedence field. End-hosts
  are thus not required to be "differentiated services aware", though
  if and when end-systems become universally "aware", they might do
  their own shaping and first-hop routers merely police.

  Adherence to the subscribed rate and burst size must be enforced at
  the entry to the network, either by the end-system or by the first-
  hop router. Within an intranet, administrative domain, or "trust
  region" the packets can then be classified and serviced solely on the
  Premium bit. Where packets cross a boundary, the policing function is
  critical. The entered region will check the prioritized packet flow
  for conformance to a rate the two regions have agreed upon,
  discarding packets that exceed the rate. It is thus in the best
  interests of a region to ensure conformance to the agreed-upon rate
  at the egress. This requirement means that Premium traffic is burst-
  free and, together with the no oversubscription rule, leads directly
  to the observation that Premium queues can easily be sized to prevent
  the need to drop packets and thus the need for a queue management
  policy. At each router, the largest queue size is related to the in-
  degree of other routers and is thus quite small, on the order of ten
  packets.

  Premium bandwidth allocations must not be oversubscribed as they
  represent a commitment by the network and should be priced
  accordingly. Note that, in this architecture, Premium traffic will
  also experience considerably less delay variation than either best
  effort traffic or the Assured data traffic of [3]. Premium rates
  might be configured on a subscription basis in the near-term, or on-
  demand when dynamic set-up or signaling is available.




Nichols, et al.              Informational                      [Page 5]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  Figure 1 shows how a Premium packet flow is established within a
  particular administrative domain, Company A, and sent across the
  access link to Company A's ISP. Assume that the host's first-hop
  router has been configured to match a flow from the host's IP address
  to a destination IP address that is reached through ISP. A Premium
  flow is configured from a host with a rate which is both smaller than
  the total Premium allocation Company A has from the ISP, r bytes per
  second, and smaller than the amount of that allocation has been
  assigned to other hosts in Company A. Packets are not marked in any
  special way when they leave the host. The first-hop router clears the
  Premium bit on all arriving packets, sets the Premium bit on all
  packets in the designated flow, shapes packets in the Premium flow to
  a configured rate and burst size, queues best-effort unmarked packets
  in the low priority queue and shaped Premium packets in the high
  priority queue, and sends packets from those two queues at simple
  priority. Intermediate routers internal to Company A enqueue packets
  in one of two output queues based on the Premium bit and service the
  queues with simple priority. Border routers perform quite different
  tasks, depending on whether they are processing an egress flow or an
  ingress flow. An egress border router may perform some reshaping on
  the aggregate Premium traffic to conform to rate r, depending on the
  number of Premium flows aggregated. Ingress border routers only need
  to perform a simple policing function that can be implemented with a
  token bucket. In the example, the ISP accepts all Premium packets
  from A as long as the flow does not exceed r bytes per second.

  Figure 1. Premium traffic flow from end-host to organization's ISP

2.3 Two-bit differentiated services architecture

  Clark's and Jacobson's proposals are markedly similar in the location
  and type of functional blocks that are needed to implement them.
  Furthermore, they implement quite different services which are not
  incompatible in a network. The Premium service implements a
  guaranteed peak bandwidth service with negligible queueing delay that
  cannot starve best effort traffic and can be allocated in a fairly
  straightforward fashion. This service would seem to have a strong
  appeal for commercial applications, video broadcasts, voice-over-IP,
  and VPNs. On the other hand, this service may prove both too
  restrictive (in its hard limits) and overdesigned (no overallocation)
  for some applications. The Assured service implements a service that
  has the same delay characteristics as (undropped) best effort packets
  and the firmness of its guarantee depends on how well individual
  links are provisioned for bursts of Assured packets. On the other
  hand, it permits traffic flows to use any additional available
  capacity without penalty and occasional dropped packets for short
  congestive periods may be acceptable to many users. This service
  might be what an ISP would provide to individual customers who are



Nichols, et al.              Informational                      [Page 6]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  willing to pay a bit more for internet service that seems unaffected
  by congestive periods. Both services are only as good as their
  admission control schemes, though this can be more difficult for
  traffic which is not peak-rate allocated.

  There may be some additional benefits of deploying both services. To
  the extent that Premium service is a conservative allocation of
  resources, unused bandwidth that had been allocated to Premium might
  provide some "headroom" for underallocated or burst periods of
  Assured traffic or for best effort. Network elements that deploy both
  services will be performing RED queue management on all non-Premium
  traffic, as suggested in [4], and the effects of mixing the Premium
  streams with best effort might serve to reduce burstiness in the
  latter. A strength of the Assured service is that it allows bursts to
  happen in their natural fashion, but this also makes the
  provisioning, admission control and allocation problem more difficult
  so it may take more time and experimentation before this admission
  policy for this service is completely defined. A Premium service
  could be deployed that employs static allocations on peak rates with
  no statistical sharing.

  As there appear to be a number of advantages to an architecture that
  permits these two types of service and because, as we shall see, they
  can be made to share many of the same mechanisms, we propose
  designating two bit-patterns from the IP header precedence field. We
  leave the explicit designation of these bit-patterns to the standards
  process thus we use the shorthand notation of denoting each pattern
  by a bit, one we will call the Premium or P-bit, the other we call
  the assurance or A-bit. It is possible for a network to implement
  only one of these services and to have network elements that only
  look at the one applicable bit, but we focus on the two service
  architecture. Further, we assume the case where no changes are made
  in the hosts, appropriate packet marking all being done in the
  network, at the first-hop, or leaf, router. We describe the
  forwarding path architecture in this section, assuming that the
  service has been allocated through mechanisms we will discuss in
  section 4.

  In a more general sense, Premium service denotes packets that are
  enqueued at a higher priority than the ordinary best-effort queue.
  Similarly, Assured service denotes packets that are treated
  preferentially with respect to the dropping probability within the
  "normal" queue. There are a number of ways to add more service levels
  within each of these service types [7], but this document takes the
  position of specifying the base-level services of Premium and
  Assured.





Nichols, et al.              Informational                      [Page 7]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  The forwarding path mechanisms can be broken down into those that
  happen at the input interface, before packet forwarding, and those
  that happen at the output interface, after packet forwarding.
  Intermediate routers only need to implement the post packet
  forwarding functions, while leaf and border routers must perform
  functions on arriving packets before forwarding. We describe the
  mechanisms this way for illustration; other ways of composing their
  functions are possible.

  Leaf routers are configured with a traffic profile for a particular
  flow based on its packet header. This functionality has been defined
  by the RSVP Working Group in RFC 2205. Figure 2 shows what happens to
  a packet that arrives at the leaf router, before it is passed to the
  forwarding engine. All arriving packets must have both the A-bit and
  the P-bit cleared after which packets are classified on their header.
  If the header does not match any configured values, it is immediately
  forwarded. Matched flows pass through individual Markers that have
  been configured from the usage profile for that flow: service class
  (Premium or Assured), rate (peak for Premium, "expected" for
  Assured), and permissible burst size (may be optional for Premium).
  Assured flow packets emerge from the Marker with their A-bits set
  when the flow is in conformance to its Profile, but the flow is
  otherwise unchanged. For a Premium flow, the Marker will hold packets
  when necessary to enforce their configured rate. Thus Premium flow
  packets emerge from the Marker in a shaped flow with their P-bits
  set. (It is possible for Premium flow packets to be dropped inside of
  the Marker as we describe below.) Packets are passed to the
  forwarding engine when they emerge from Markers. Packets that have
  either their P or A bits set we will refer to as Marked packets.

  Figure 2. Block diagram of leaf router input functionality

  Figure 3 shows the inner workings of the Marker. For both Assured and
  Premium packets, a token bucket "fills" at the flow rate that was
  specified in the usage profile. For Assured service, the token bucket
  depth is set by the Profile's burst size. For Premium service, the
  token bucket depth must be limited to the equivalent of only one or
  two packets. (We suggest a depth of one packet in early deployments.)
  When a token is present, Assured flow packets have their A-bit set to
  one, otherwise the packet is passed to the forwarding engine. For
  Premium-configured Marker, arriving packets that see a token present
  have their P-bits set and are forwarded, but when no token is
  present, Premium flow packets are held until a token arrives. If a
  Premium flow bursts enough to overflow the holding queue, its packets
  will be dropped. Though the flow set up data can be used to configure
  a size limit for the holding queue (this would be the meaning of a
  "burst" in Premium service), it is not necessary. Unconfigured
  holding queues should be capable of holding at least two bandwidth-



Nichols, et al.              Informational                      [Page 8]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  delay products, adequate for TCP connections. A smaller value might
  be used to suit delay requirements of a specific application.

  Figure 3. Markers to implement the two different services

  In practice, the token bucket should be implemented in bytes and a
  token is considered to be present if the number of bytes in the
  bucket is equal or larger to the size of the packet. For Premium, the
  bucket can only be allowed to fill to the maximum packet size; while
  Assured may fill to the configured burst parameter. Premium traffic
  is held until a sufficient byte credit has accumulated and this
  holding buffer provides the only real queue the flow sees in the
  network. For Assured, traffic, we just test if the bytes in the
  bucket are sufficient for the packet size and set A if so. If not,
  the only difference is that A is not set. Assured traffic goes into a
  queue following this step and potentially sees a queue at every hop
  along its path.

  Each output interface of a router must have two queues and must
  implement a test on the P-bit to select a packet's output queue. The
  two queues must be serviced by simple priority, Premium packets
  first. Each output interface must implement the RED-based RIO
  mechanism described in [3] on the lower priority queue. RIO uses two
  thresholds for when to begin dropping packets, a lower one based on
  total queue occupancy for ordinary best effort traffic and one based
  on the number of packets enqueued that have their A-bit set. This
  means that any action preferential to Assured service traffic will
  only be taken when the queue's capacity exceeds the threshold value
  for ordinary best effort service. In this case, only unmarked packets
  will be dropped (using the RED algorithm) unless the threshold value
  for Assured service is also reached. Keeping an accurate count of the
  number of A-bit packets currently in a queue requires either testing
  the A-bit at both entry and exit of the queue or some additional
  state in the router. Figure 4 is a block diagram of the output
  interface for all routers.

  Figure 4. Router output interface for two-bit architecture

  The packet output of a leaf router is thus a shaped stream of packets
  with P-bits set mingled with an unshaped best effort stream of
  packets, some of which may have A-bits set. Premium service clearly
  cannot starve best effort traffic because it is both burst and
  bandwidth controlled. Assured service might rely only on a
  conservative allocation to prevent starvation of unmarked traffic,
  but bursts of Assured traffic might then close out best-effort
  traffic at bottleneck queues during congestive periods.





Nichols, et al.              Informational                      [Page 9]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  After [3], we designate the forwarding path objects that test flows
  against their usage profiles "Profile Meters". Border routers will
  require Profile Meters at their input interfaces. The bilateral
  agreement between adjacent administrative domains must specify a peak
  rate on all P traffic and a rate and burst for A traffic (and
  possibly a start time and duration). A Profile Meter is required at
  the ingress of a trust region to ensure that differentiated service
  packet flows are in compliance with their agreed-upon rates. Non-
  compliant packets of Premium flows are discarded while non-compliant
  packets of Assured flows have their A-bits reset. For example, in
  figure 1, if the ISP has agreed to supply Company A with r bytes/sec
  of Premium service, P-bit marked packets that enter the ISP through
  the link from Company A will be dropped if they exceed r. If instead,
  the service in figure 1 was Assured service, the packets would simply
  be unmarked, forwarded as best effort.

  The simplest border router input interface is a Profile Meter
  constructed from a token bucket configured with the contracted rate
  across that ingress link (see figure 5). Each type, Premium or
  Assured, and each interface must have its own profile meter
  corresponding to a particular class across a particular boundary.
  (This is in contrast to models where every flow that crosses the
  boundary must be separately policed and/or shaped.) The exact
  mechanisms required at a border router input interface depend on the
  allocation policy deployed; a more complex approach is presented in
  section 4.

  Figure 5. Border router input interface Profile Meters

3. Mechanisms

3.1 Forwarding Path Primitives

  Section 2.3 introduced the forwarding path objects of Markers and
  Profile Meters. In this section we specify the primitive building
  blocks required to compose them. The primitives are: general
  classifier, bit-pattern classifier, bit setter, priority queues,
  policing token bucket and shaping token bucket. These primitives can
  compose a Marker (either a policing or a shaping token bucket plus a
  bit setter) and a Profile Meter (a policing token bucket plus a
  dropper or bit setter).

  General Classifier: Leaf or first-hop routers must perform a
  transport-level signature matching based on a tuple in the packet
  header, a functionality which is part of any RSVP-capable router.  As
  described above, packets whose tuples match one of the configured
  flows are conformance tested and have the appropriate service bit
  set.  This function is memory- and processing-intensive, but is kept



Nichols, et al.              Informational                     [Page 10]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  at the edges of the network where there are fewer flows.

  Bit-pattern classifier: This primitive comprises a simple two-way
  decision based on whether a particular bit-pattern in the IP header
  is set or not. As in figure 4, the P-bit is tested when a packet
  arrives at a non-leaf router to determine whether to enqueue it in
  the high priority output queue or the low priority packet queue. The
  A-bit of packets bound for the low priority queue is tested to 1)
  increment the count of Assured packets in the queue if set and 2)
  determine which drop probability will be used for that packet.
  Packets exiting the low priority queue must also have the A-bit
  tested so that the count of enqueued Assured packets can be
  decremented if necessary.

  Bit setter: The A-bits and P-bits must be set or cleared in several
  places. A functional block that sets the appropriate bits of the IP
  header to a configured bit-pattern would be the most general.

  Priority queues: Every network element must include (at least) two
  levels of simple priority queueing. The high priority queue is for
  the Premium traffic and the service rule is to send packets in that
  queue first and to exhaustion. Recall that Premium traffic must never
  be oversubscribed, thus Premium traffic should see little or no
  queue.

  Shaping token bucket:This is the token bucket required at the leaf
  router for Premium traffic and shown in figure 3. As we shall see,
  shaping is also useful at egress points of a trust region. An
  arriving packet is immediately forwarded if there is a token present
  in the bucket, otherwise the packet is enqueued until the bucket
  contains tokens sufficient to send it. Shaping requires clocking
  mechanisms, packet memory, and some state block for each flow and is
  thus a memory and computation-intensive process.

  Policing token bucket: This is the token bucket required for Profile
  Meters and shown in figure 5. Policing token buckets never hold
  arriving packets, but check on arrival to see if a token is available
  for the packet's service class. If so, the packet is forwarded
  immediately. If not, the policing action is taken, dropping for
  Premium and reclassifying or unmarking for Assured.

3.2 Passing configuration information

   Clearly, mechanisms are required to communicate the information
  about the request to the leaf router. This configuration information
  is the rate, burst, and whether it is a Premium or Assured type.
  There may also need to be a specific field to set or clear this
  configuration. This information can be passed in a number of ways,



Nichols, et al.              Informational                     [Page 11]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  including using the semantics of RSVP, SNMP, or directly set by a
  network administrator in some other way. There must be some
  mechanisms for authenticating the sender of this information. We
  expect configuration to be done in a variety of ways in early
  deployments and a protocol and mechanism for this to be a topic for
  future standards work.

3.3 Discussion

  The requirements of shapers motivate their placement at the edges of
  the network where the state per router can be smaller than in the
  middle of a network. The greatest burden of flow matching and shaping
  will be at leaf routers where the speeds and buffering required
  should be less than those that might be required deeper in the
  network. This functionality is not required at every network element
  on the path. Routers that are internal to a trust region will not
  need to shape traffic. Border routers may need or desire to shape the
  aggregate flow of Marked packets at their egress in order to ensure
  that they will not burst into non-compliance with the policing
  mechanism at the ingress to the other domain (though this may not be
  necessary if the in-degree of the router is low). Further, the
  shaping would be applied to an aggregation of all the Premium flows
  that exit the domain via that path, not to each flow individually.

  These mechanisms are within reach of today's technology and it seems
  plausible to us that Premium and Assured services are all that is
  needed in the Internet. If, in time, these services are found
  insufficient, this architecture provides a migration path for
  delivering other kinds of service levels to traffic. The A- and P-
  bits would continue to be used to identify traffic that gets Marked
  service, but further filter matching could be done on packet headers
  to differentiate service levels further. Using the bits this way
  reduces the number of packets that have to have further matching done
  on them rather than filtering every incoming packet. More queue
  levels and more complex scheduling could be added for P-bit traffic
  and more levels of drop priority could be added for A-bit traffic if
  experience shows them to be necessary and processing speeds are
  sufficient. We propose that the services described here be considered
  as "at least" services. Thus, a network element should at least be
  capable of mapping all P-bit traffic to Premium service and of
  mapping all A-bit traffic to be treated with one level of priority in
  the "best effort" queue (it appears that the single level of A-bit
  traffic should map to a priority that is equivalent to the best level
  in a multi-level element that is also in the path).

  On the other hand, what is the downside of deploying an architecture
  for both classes of service if later experience convinces us that
  only one of them is needed? The functional blocks of both service



Nichols, et al.              Informational                     [Page 12]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  classes are similar and can be provided by the same mechanism,
  parameterized differently. If Assured service is not used, very
  little is lost. A RED-managed best effort queue has been strongly
  recommended in [4] and, to the extent that the deployment of this
  architecture pushes the deployment of RED-managed best effort queues,
  it is clearly a positive. If Premium service goes unused, the two-
  queues with simple priority service is not required and the shaping
  function of the Marker may be unused, thus these would impose an
  unnecessary implementation cost.

4. The Architectural Framework for Marked Traffic Allocation

  Thus far we have focused on the service definitions and the
  forwarding path mechanisms. We now turn to the problem of allocating
  the level of Marked traffic throughout the Internet. We observe that
  most organizations have fixed portions of their budgets, including
  data communications, that are determined on an annual or quarterly
  basis. Some additional monies might be attached to specific projects
  for discretionary costs that arise in the shorter term. In turn,
  service providers (ISPs and NSPs) must do their planning on annual
  and quarterly bases and thus cannot be expected to provide
  differentiated services purely "on call". Provisioning sets up static
  levels of Marked traffic while call set-up creates an allocation of
  Marked traffic for a single flow's duration. Static levels can be
  provisioned with time-of-day specifications, but cannot be changed in
  response to a dynamic message. We expect both kinds of bandwidth
  allocation to be important. The purchasers of Marked services can
  generally be expected to work on longer-term budget cycles where
  these services will be accounted for similarly to many information
  services today. A mail-order house may wish to purchase a fixed
  allocation of bandwidth in and out of its web-server to give
  potential customers a "fast" feel when browsing their site. This
  allocation might be based on hit rates of the previous quarter or
  some sort of industry-based averages. In addition, there needs to be
  a dynamic allocation capability to respond to particular events, such
  as a demonstration, a network broadcast by a company's CEO, or a
  particular network test. Furthermore, a dynamic capability may be
  needed in order to meet a precommitted service level when the
  particular source or destination is allowed to be "anywhere on the
  Internet". "Dynamic" covers the range from a telephoned or e-mailed
  request to a signalling type model. A strictly statically allocated
  scenario is expected to be useful in initial deployment of
  differentiated services and to make up a major portion of the Marked
  traffic for the forseeable future.

  Without a "per call" dynamic set up, the preconfiguring of usage
  profiles can always be construed as "paying for bits you don't use"
  whether the type of service is Premium or Assured. We prefer to think



Nichols, et al.              Informational                     [Page 13]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  of this as paying for the level of service that one expects to have
  available at any time, for example paying for a telephone line. A
  customer might pay an additional flat fee to have the privilege of
  calling a wide local area for no additional charge or might pay by
  the call. Although a customer might pay on a "per call" basis for
  every call made anywhere, it generally turns out not to be the most
  economical option for most customers. It's possible similar pricing
  structures might arise in the internet.

  We use Allocation to refer to the process of making Marked traffic
  commitments anywhere along this continuum from strictly preallocated
  to dynamic call set-up and we require an Allocation architecture
  capable of encompassing this entire spectrum in any mix. We further
  observe that Allocation must follow organizational hierarchies, that
  is each organization must have complete responsibility for the
  Allocation of the Marked traffic resource within its domain. Finally,
  we observe that the only chance of success for incremental deployment
  lies in an Allocation architecture that is made up of bilateral
  agreements, as multilateral agreements are much too complex to
  administer. Thus, the Allocation architecture is made up of
  agreements across boundaries as to the amount of Marked traffic that
  will be allowed to pass. This is similar to "settlement" models used
  today.

4.1 Bandwidth Brokers: Allocating and Controlling Bandwidth Shares

  The goal of differentiated services is controlled sharing of some
  organization's Internet bandwidth. The control can be done
  independently by individuals, i.e., users set bit(s) in their packets
  to distinguish their most important traffic, or it can be done by
  agents that have some knowledge of the organization's priorities and
  policies and allocate bandwidth with respect to those policies.
  Independent labeling by individuals is simple to implement but
  unlikely to be sufficient since it's unreasonable to expect all
  individuals to know all their organization's priorities and current
  network use and always mark their traffic accordingly.  Thus this
  architecture is designed with agents called bandwidth brokers (BB)
  [2], that can be configured with organizational policies, keep track
  of the current allocation of marked traffic, and interpret new
  requests to mark traffic in light of the policies and current
  allocation.

  We note that such agents are inherent in any but the most trivial
  notions of sharing.  Neither individuals nor the routers their
  packets transit have the information necessary to decide which
  packets are most important to the organization.  Since these agents
  must exist, they can be used to allocate bandwidth for end-to-end
  connections with far less state and simpler trust relationships than



Nichols, et al.              Informational                     [Page 14]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  deploying per flow or per filter guarantees in all network elements
  on an end-to-end path. BBs make it possible for bandwidth allocation
  to follow organizational hierarchies and, in concert with the
  forwarding path mechanisms discussed in section 3, reduce the state
  required to set up and maintain a flow over architectures that
  require checking the full flow header at every network element.
  Organizationally, the BB architecture is motivated by the observation
  that multilateral agreements rarely work and this architecture allows
  end-to-end services to be constructed out of purely bilateral
  agreements. BBs only need to establish relationships of limited trust
  with their peers in adjacent domains, unlike schemes that require the
  setting of flow specifications in routers throughout an end-to-end
  path. In practical technical terms, the BB architecture makes it
  possible to keep state on an administrative domain basis, rather than
  at every router and the service definitions of Premium and Assured
  service make it possible to confine per flow state to just the leaf
  routers.

  BBs have two responsibilities. Their primary one is to parcel out
  their region's Marked traffic allocations and set up the leaf routers
  within the local domain. The other is to manage the messages that are
  sent across boundaries to adjacent regions' BBs. A BB is associated
  with a particular trust region, one per domain. A BB has a policy
  database that keeps the information on who can do what when and a
  method of using that database to authenticate requesters. Only a BB
  can configure the leaf routers to deliver a particular service to
  flows, crucial for deploying a secure system. If the deployment of
  Differentiated Services has advanced to the stage where dynamically
  allocated, marked flows are possible between two adjacent domains,
  BBs also provide the hook needed to implement this. Each domain's BB
  establishes a secure association with its peer in the adjacent domain
  to negotiate or configure a rate and a service class (Premium or
  Assured) across the shared boundary and through the peer's domain. As
  we shall see, it is possible for some types of service and
  particularly in early implementations, that this "secure association"
  is not automatic but accomplished through human negotiation and
  subsequent manual configuration of the adjacent BBs according to the
  negotiated agreement. This negotiated rate is a capability that a BB
  controls for all hosts in its region.

  When an allocation is desired for a particular flow, a request is
  sent to the BB. Requests include a service type, a target rate, a
  maximum burst, and the time period when service is required. The
  request can be made manually by a network administrator or a user or
  it might come from another region's BB. A BB first authenticates the
  credentials of the requester, then verifies there exists unallocated
  bandwidth sufficient to meet the request. If a request passes these
  tests, the available bandwidth is reduced by the requested amount and



Nichols, et al.              Informational                     [Page 15]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  the flow specification is recorded. In the case where the flow has a
  destination outside this trust region, the request must fall within
  the class allocation through the "next hop" trust region that was
  established through a bilateral agreement of the two trust regions.
  The requester's BB informs the adjacent region's BB that it will be
  using some of this rate allocation. The BB configures the appropriate
  leaf router with the information about the packet flow to be given a
  service at the time that the service is to commence. This
  configuration is "soft state" that the BB will periodically refresh.
  The BB in the adjacent region is responsible for configuring the
  border router to permit the allocated packet flow to pass and for any
  additional configurations and negotiations within and across its
  borders that will allow the flow to reach its final destination.

  At DMZs, there must be an unambiguous way to determine the local
  source of a packet. An interface's source could be determined from
  its MAC address which would then be used to classify packets as
  coming across a logical link directly from the source domain
  corresponding to that MAC address. Thus with this understanding we
  can continue to use figures illustrating a single pipe between two
  different domains.

  In this way, all agreements and negotiations are performed between
  two adjacent domains. An initial request might cause communication
  between BBs on several domains along a path, but each communication
  is only between two adjacent BBs. Initially, these agreements will be
  prenegotiated and fairly static. Some may become more dynamic as the
  service evolves.

4.2 Examples

  This section gives examples of BB transactions in a non-trivial,
  multi-transit-domain Internet. The BB framework allows operating
  points across a spectrum from "no signalling across boundaries" to
  "each flow set up dynamically". We might expect to move across this
  spectrum over time, as the necessary mechanisms are ubiquitously
  deployed and BBs become more sophisticated, but the statically
  allocated portions of the spectrum should always have uses. We
  believe the ability to support this wide spectrum of choices
  simultaneously will be important both in incremental deployment and
  in allowing ISPs to make a wide range of offerings and pricings to
  users. The examples of this section roughly follow the spectrum of
  increasing sophistication. Note that we assume that domains contract
  for some amount of Marked traffic which can be requested as either
  Assured or Premium in each individual flow setup transaction. The
  examples say "Marked" although actual transactions would have to
  specify either Assured or Premium.




Nichols, et al.              Informational                     [Page 16]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  A statically configured example with no BB messages exchanged: Here
  all allocations are statically preallocated through purely bilateral
  agreements between users (individual TCPs, individual hosts, campus
  networks, or whole ISPs) [6]. The allocations are in the form of
  usage profiles of rate, burst, and a time during which that profile
  is to be active. Users and providers negotiate these Profiles which
  are then installed in the user domain BB and in the provider domain
  BB. No BB messages cross the boundary; we assume this negotiation is
  done by human representatives of each domain. In this case, BBs only
  have to perform one of their two functions, that of allocating this
  Profile within their local domain. It is even possible to set all of
  this suballocations up in advance and then the BB only needs to set
  up and tear down the Profile at the proper time and to refresh the
  soft state in the leaf routers. From the user domain BB, the Profile
  is sent as soft state to the first hop router of the flow during the
  specified time. These Profiles might be set using RSVP, a variant of
  RSVP, SNMP, or some vendor-specific mechanism. Although this static
  approach can work for all Marked traffic, due to the strictly not
  oversubscribed requirement, it is only appropriate for Premium
  traffic as long as it is kept to a small percentage of the bottleneck
  path through a domain or is otherwise constrained to a well-known
  behavior. Similar restrictions might hold for Assured depending on
  the expectation associated with the service.

  In figure 6, we show an example of setting a Profile in a leaf
  router. A usage profile has been negotiated with the ISP for the
  entire domain and the BB parcels it out among individual flows as
  requested. The leaf router mechanism is that shown in figure 3, with
  the token bucket set to the parameters from the usage profile. The
  ISP's BB would configure its own Profile Meter at the ingress router
  from that customer to ensure the Profile was maintained. This
  mechanism was shown in figure 5. We assume that the time duration and
  start times for any Profile to be active are maintained in the BB.
  The Profile is sent to the ingress device or cleared from the ingress
  device by messages sent from the BB. In this example, we assume that
  van@lbl wants to talk to ddc@mit. The LBL-BB is sent a request from
  Van asking that premium service be assigned to a flow that is
  designated as having source address "V:4" and going to destination
  address "D:8". This flow should be configured for a rate of 128kb/sec
  and allocated from 1pm to 3pm. The request must be "signed" in a
  secure, verifiable manner. The request might be sent as data to the
  LBL-BB, an e-mail message to a network administrator, or in a phone
  call to a network administrator. The LBL-BB receives this message,
  verifies that there is 128kb/sec of unused Premium service for the
  domain from 1-3pm, then sends a message to Leaf1 that sets up an
  appropriate Profile Meter. The message to Leaf1 might be an RSVP
  message, or SNMP, or some proprietary method. All the domains passed
  must have sufficient reserve capacity to meet this request.



Nichols, et al.              Informational                     [Page 17]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  Figure 6. Bandwidth Broker setting Profiles in leaf routers

  A statically configured example with BB messages exchanged: Next we
  present an example where all allocations are statically preallocated
  but BB messages are exchanged for greater flexibility. Figure 7 shows
  an end-to-end example for Marked traffic in a statically allocated
  internet. The numbers at the trust region boundaries indicate the
  total statically allocated Marked packet rates that will be accepted
  across those boundaries. For example, 100kbps of Marked traffic can
  be sent from LBL to ESNet; a Profile Meter at the ESNet egress
  boundary would have a token bucket set to rate 100kbps. (There MAY be
  a shaper set at LBL's egress to ensure that the Marked traffic
  conforms to the aggregate Profile.) The tables inside the transit
  network "bubbles" show their policy databases and reflect the values
  after the transaction is complete. In Figure 7, V wants to transmit a
  flow from LBL to D at MIT at 10 Kbps. As in figure 6, a request for
  this profile is made of LBL's BB. LBL's BB authenticates the request
  and checks to see if there is 10kbps left in its Marked allocation
  going in that direction. There is, so the LBL-BB passes a message to
  the ESNet-BB saying that it would like to use 10kbps of its Marked
  allocation for this flow. ESNet authenticates the message, checks its
  database and sees that it has a 10kbps Marked allocation to NEARNet
  (the next region in that direction) that is being unused. The policy
  is that ESNet-BB must always inform ("ask") NEARNet-BB when it is
  about to use part of its allocation. NEARNET-BB authenticates the
  message, checks its database and discovers that 20kbps of the
  allocation to MIT is unused and the policy at that boundary is to not
  inform MIT when part of the allocation is about to be used ("<50 ok"
  where the total allocation is 50). The dotted lines indicate the
  "implied" transaction, that is the transaction that would have
  happened if the policy hadn't said "don't ask me". Now each BB can
  pass an "ok" message to this request across its boundary. This allows
  V to send to D, but not vice versa. It would also be possible for the
  request to originate from D.

  Figure 7. End-to-end example with static allocation.

  Consider the same example where the ESNet-BB finds all of its Marked
  allocation to NEARNet, 10 kbps, in use. With static allocations,
  ESNet must transmit a "no" to this request back to the LBL-BB.
  Presumably, the LBL-BB would record this information to complain to
  ESNet about the overbooking at the end of the month! One solution to
  this sort of "busy signal" is for ESNet to get better at anticipating
  its customers needs or require long advance bookings for every flow,
  but it's also possible for bandwidth brokerage decisions to become
  dynamic.





Nichols, et al.              Informational                     [Page 18]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  Figure 8. End-to-end static allocation example with no remaining
  allocation

  Dynamic Allocation and additional mechanism: As we shall see, dynamic
  allocation requires more complex BBs as well as more complex border
  policing, including the necessity to keep more state. However, it
  enables an important service with a small increase in state.

  The next set of figures (starting with figure 9) show what happens in
  the case of dynamic allocation. As before, V requests 10kbps to talk
  to D at MIT. Since the allocation is dynamic, the border policers do
  not have a preset value, instead being set to reflect the current
  peak value of Marked traffic permitted to cross that boundary. The
  request is sent to the LBL-BB.

  Figure 9. First step in end-to-end dynamic allocation example.

  In figure 10, note that ESNet has no allocation set up to NEARNet.
  This system is capable of dynamic allocations in addition to static,
  so it asks NEARNet if it can "add 10" to its allocation from ESNet.
  As in the figure 7 example, MIT's policy is set to "don't ask" for
  this case, so the dotted lines represent "implicit transactions"
  where no messages were exchanged. However, NEARNet does update its
  table to indicate that it is now using 20kbps of the Marked
  allocation to MIT.

  Figure 10. Second step in end-to-end dynamic allocation example

  In figure 11, we see the third step where MIT's "virtual ok" allows
  the NEARNet-BB to tell its border router to increase the Marked
  allocation across the ESNet-NEARNet boundary by 10 kbps.

  Figure 11. Third step in end-to-end dynamic allocation example

  Figure 11 shows NEARNet-BB's "ok" for that request transmitted back
  to ESNet-BB. This causes ESNet-BB to send its border router a message
  to create a 10 kbps subclass for the flow "V->D". This is required in
  order to ensure that the 10kpbs that has just been dynamically
  allocated gets used only for that connection. Note that this does
  require that the per flow state be passed from LBL-BB to ESNet-BB,
  but this is the only boundary that needs that level of flow
  information and this further classification will only need to be done
  at that one boundary router and only on packets coming from LBL. Thus
  dynamic allocation requires more complex Profile Metering than that
  shown in figure 5.

  Figure 12. Fourth step in end-to-end dynamic allocation example.




Nichols, et al.              Informational                     [Page 19]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  In figure 12, the ESNet border router gives the "ok" that a subclass
  has been created, causing the ESNet-BB to send an "ok" to the LBL-BB
  which lets V know the request has been approved.

  Figure 13. Final step in end-to-end dynamic allocation example

  For dynamic allocation, a basic version of a CBQ scheduler [5] would
  have all the required functionality to set up the subclasses. RSVP
  currently provides a way to move the TSpec for the flow.

  For multicast flows, we assume that packets that are bound for at
  least one egress can be carried through a domain at that level of
  service to all egress points. If a particular multicast branch has
  been subscribed to at best-effort when upstream branches are Marked,
  it will have its bit settings cleared before it crosses the boundary.
  The information required for this flow identification is used to
  augment the existing state that is already kept on this flow because
  it is a multicast flow. We note that we are already "catching" this
  flow, but now we must potentially clear the bit-pattern.

5. RSVP/int-serv and this architecture

  Much work has been done in recent years on the definition of related
  integrated services for the internet and the specification of the
  RSVP signalling protocol. The two-bit architecture proposed in this
  work can easily interoperate with those specifications. In this
  section we first discuss how the forwarding mechanisms described in
  section 3 can be used to support integrated services. Second, we
  discuss how RSVP could interoperate with the administrative structure
  of the BBs to provide better scaling.

5.1 Providing Controlled-Load and Guaranteed Service

  We believe that the forwarding path mechanisms described in section 3
  are general enough that they can also be used to provide the
  Controlled-Load service [8] and a version of the Guaranteed Quality
  of Service [9], as developed by the int-serv WG. First note that
  Premium service can be thought of as a constrained case of
  Controlled-Load service where the burst size is limited to one packet
  and where non-conforming packets are dropped. A network element that
  has implemented the mechanisms to support premium service can easily
  support the more general controlled-load service by making one or
  more minor parameter adjustments, e.g. by lifting the constraint on
  the token bucket size, or configuring the Premium service rate with
  the peak traffic rate parameter in the Controlled-Load specification,
  and by changing the policing action on out-of-profile packets from
  dropping to sending the packets to the Best-effort queue.




Nichols, et al.              Informational                     [Page 20]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  It is also possible to implement Guaranteed Quality of Service using
  the mechanisms of Premium service. From RFC 2212 [9]: "The definition
  of guaranteed service relies on the result that the fluid delay of a
  flow obeying a token bucket (r, b) and being served by a line with
  bandwidth R is bounded by b/R as long as R is no less than r.
  Guaranteed service with a service rate R, where now R is a share of
  bandwidth rather than the bandwidth of a dedicated line approximates
  this behavior." The service model of Premium clearly fits this model.
  RFC 2212 states that "Non-conforming datagrams SHOULD be treated as
  best-effort datagrams." Thus, a policing Profile Meter that drops
  non-conforming datagrams would be acceptable, but it's also possible
  to change the action for non-compliant packets from a drop to sending
  to the best-effort queue.

5.2 RSVP and BBs

  In this section we discuss how RSVP signaling can be used in
  conjunction with the BBs described in section 4 to deliver a more
  scalable end-to-end resource set up for Integrated Services. First we
  note that the BB architecture has three major differences with the
  original RSVP resource set up model:

  1. There exist apriori bilateral business relations between BBs of
  adjacent trust regions before one can set up end-to-end resource
  allocation; real-time signaling is used only to activate/confirm the
  availability of pre-negotiated Marked bandwidth, and to dynamically
  readjust the allocation amount when necessary. We note that this
  real-time signaling across domains is not required, but depends on
  the nature of the bilateral agreement (e.g., the agreement might
  state "I'll tell you whenever I'm going to use some of my allocation"
  or not).

  2. A few bits in the packet header, i.e. the P-bit and A-bit, are
  used to mark the service class of each packet, therefore a full
  packet classification (by checking all relevant fields in the header)
  need be done only once at the leaf router; after that packets will be
  served according to their class bit settings.

  3. RSVP resource set up assumes that resources will be reserved hop-
  by-hop at each router along the entire end-to-end path.

  RSVP messages sent to leaf routers by hosts can be intercepted and
  sent to the local domain's BB. The BB processes the message and, if
  the request is approved, forwards a message to the leaf router that
  sets up appropriate per-flow packet classification. A message should
  also be sent to the egress border router to add to the aggregate
  Marked traffic allocation for packet shaping by the Profile Meter on
  outbound traffic. (Its possible that this is always set to the full



Nichols, et al.              Informational                     [Page 21]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  allocation.) An RSVP message must be sent across the boundary to
  adjacent ISP's border router, either from the local domain's border
  router or from the local domain's BB. If the ISP is also implementing
  the RSVP with a BB and diff-serv framework, its border router
  forwards the message to the ISP's local BB. A similar process (to
  what happened in the first domain) can be carried out in the ISP
  domain, then an RSVP message gets forwarded to the next ISP along the
  path. Inside a domain, packets are served solely according to the
  Marked bits. The local BB knows exactly how much Premium traffic is
  permitted to enter at each border router and from which border router
  packets exit.

6. Recommendations

  This document has presented a reference architecture for
  differentiated services. Several variations can be envisioned,
  particularly for early and partial deployments, but we do not
  enumerate all of these variations here. There has been a great market
  demand for differentiated services lately. As one of the many efforts
  to meet that demand this memo sketches out the framework of a
  flexible architecture for offering differential services, and in
  particular defines a simple set of packet forwarding path mechanisms
  to support two basic types of differential services. Although there
  remain a number of issues and parameters that need further
  exploration and refinement, we believe it is both possible and
  feasible at this time to start deployment of differentiated services
  incrementally. First, given that the basic mechanisms required in the
  packet forwarding path are clearly understood, both Assured and
  Premium services can be implemented today with manually configured
  BBs and static resource allocation. Initially we recommend
  conservative choices on the amount of Marked traffic that is admitted
  into the network. Second, we plan to continue the effort started with
  this memo and the experimental work of the authors to define and
  deploy increasingly sophisticated BBs. We hope to turn the experience
  gained from in-progress trial implementations on ESNet and CAIRN into
  future proposals to the IETF.

  Future revisions of this memo will present the receiver-based and
  multicast flow allocations in detail.    After this step is finished,
  we believe the basic picture of an scalable, robust, secure resource
  management and allocation system will be completed. In this memo, we
  described how the proposed architecture supports two services that
  seem to us to provide at least a good starting point for trial
  deployment of differentiated services. Our main intent is to define
  an architecture with three services, Premium, Assured, and Best
  effort, that can be determined by specific bit- patterns, but not to
  preclude additional levels of differentiation within each service. It
  seems that more experimentation and experience is required before we



Nichols, et al.              Informational                     [Page 22]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  could standardize more than one level per service class. Our base-
  level approach says that everyone has to provide "at least" Premium
  service and Assured service as documented. We feel rather strongly
  about both 1) that we should not try to define, at this time,
  something beyond the minimalist two service approach and 2) that the
  architecture we define must be open-ended so that more levels of
  differentiation might be standardized in the future. We believe this
  architecture is completely compatible with approaches that would
  define more levels of differentiation within a particular service, if
  the benefits of doing so become well understood.

7. Acknowledgments

  The authors have benefited from many discussions, both in person and
  electronically and wish to particularly thank Dave Clark who has been
  responsible for the genesis of many of the ideas presented here,
  though he does not agree with all of the content this document. We
  also thank Sally Floyd for comments on an earlier draft. A comment
  from Jon Crowcroft was partially responsible for our including
  section 5. Comments from Fred Baker made us try to make it clearer
  that we are defining two base-level services, irrespective of the bit
  patterns used to encode them.

8. Security Considerations

  There are no security considerations associated with this document.

9. References

  [1] D. Clark, "Adding Service Discrimination to the Internet",
      Proceedings of the 23rd Annual Telecommunications Policy Research
      Conference (TPRC), Solomons, MD, October 1995.

  [2] V. Jacobson, "Differentiated Services Architecture", talk in the
      Int-Serv WG at the Munich IETF, August, 1997.

  [3] Clark, D. and J. Wroclawski, "An Approach to Service Allocation
      in the Internet", Work in Progress, also talk by D. Clark in the
      Int-Serv WG at the Munich IETF, August, 1997.

  [4] Braden, et al., "Recommendations on Queue Management and
      Congestion Avoidance in the Internet", RFC 2309, April 1998.

  [4] Braden, R., Zhang, L., Berson, S., Herzog, S. and S. Jamin,
      "Resource Reservation Protocol (RSVP) - Version 1 Functional
      Specification", RFC 2205, September 1997.





Nichols, et al.              Informational                     [Page 23]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


  [5] S. Floyd and V. Jacobson, "Link-sharing and Resource Management
      Models for Packet Networks", IEEE/ACM Transactions on Networking,
      pp 365-386, August 1995.

  [6] D. Clark, private communication, October 26, 1997.

  [7] "Advanced QoS Services for the Intelligent Internet", Cisco
      Systems White Paper, 1997.

  [8] Wroclawski, J., "Specification of the Controlled-Load Network
      Element Service", RFC 2211, September 1997.

  [9] Shenker, S., Partirdge, C. and R. Guerin, "Specification of
      Guaranteed Quality of Service", RFC 2212, September 1997.

  [10] D. Clark and W. Fang, "Explicit Allocation of Best Effort packet
      Delivery Service", IEEE/ACM Transactions on Networking, August,
      1998, Vol6, No 4, pp. 362-373. also at: http://
      diffserv.lcs.mit.edu/Papers/exp-alloc-ddc-wf.pdf

Authors' Addresses

  Kathleen Nichols
  Cisco Systems, Inc.
  170 West Tasman Drive
  San Jose, CA 95134-1706

  Phone: 408-525-4857
  EMail:   [email protected]


  Van Jacobson
  Cisco Systems, Inc.
  170 West Tasman Drive
  San Jose, CA 95134-1706

  EMail: [email protected]


  Lixia Zhang
  UCLA
  4531G Boelter Hall
  Los Angeles, CA  90095

  Phone: 310-825-2695
  EMail: [email protected]





Nichols, et al.              Informational                     [Page 24]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


Appendix: A Combined Approach to Differential Service in the Internet by
         David D. Clark

  After the draft-nichols-diff-svc-00 was submitted, the co-authors had
  a discussion with Dave Clark and John Wroclawski which resulted in
  Clark's using the presentation slot for the draft at the December
  1997 IETF Integrated Services Working Group meeting. A reading of the
  slides shows that it was Clark's proposal on "mechanisms",
  "services", and "rules" and how to proceed in the standards process
  that has guided much of the process in the subsequently formed IETF
  Differentiated Services Working Group. We believe Dave Clark's talk
  gave us a solid approach for bringing quality of service to the
  Internet in a manner that is compatible with its strengths.

  The slides presented at the December 1997 IETF Integrated Services
  Working Group are included with the Postscript version.



































Nichols, et al.              Informational                     [Page 25]

RFC 2638      Two-bit Differentiated Services Architecture     July 1999


Full Copyright Statement

  Copyright (C) The Internet Society (1999).  All Rights Reserved.

  This document and translations of it may be copied and furnished to
  others, and derivative works that comment on or otherwise explain it
  or assist in its implementation may be prepared, copied, published
  and distributed, in whole or in part, without restriction of any
  kind, provided that the above copyright notice and this paragraph are
  included on all such copies and derivative works.  However, this
  document itself may not be modified in any way, such as by removing
  the copyright notice or references to the Internet Society or other
  Internet organizations, except as needed for the purpose of
  developing Internet standards in which case the procedures for
  copyrights defined in the Internet Standards process must be
  followed, or as required to translate it into languages other than
  English.

  The limited permissions granted above are perpetual and will not be
  revoked by the Internet Society or its successors or assigns.

  This document and the information contained herein is provided on an
  "AS IS" basis and THE INTERNET SOCIETY AND THE INTERNET ENGINEERING
  TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING
  BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION
  HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF
  MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.

Acknowledgement

  Funding for the RFC Editor function is currently provided by the
  Internet Society.



















Nichols, et al.              Informational                     [Page 26]