Network Working Group                                        S. Dasgupta
Request for Comments: 5468                                J. de Oliveira
Category: Informational                                Drexel University
                                                            JP. Vasseur
                                                          Cisco Systems
                                                             April 2009


 Performance Analysis of Inter-Domain Path Computation Methodologies

Status of This Memo

  This memo provides information for the Internet community.  It does
  not specify an Internet standard of any kind.  Distribution of this
  memo is unlimited.

Copyright Notice

  Copyright (c) 2009 IETF Trust and the persons identified as the
  document authors.  All rights reserved.

  This document is subject to BCP 78 and the IETF Trust's Legal
  Provisions Relating to IETF Documents in effect on the date of
  publication of this document (http://trustee.ietf.org/license-info).
  Please review these documents carefully, as they describe your rights
  and restrictions with respect to this document.

Abstract

  This document presents a performance comparison between the per-
  domain path computation method and the Path Computation Element (PCE)
  Architecture-based Backward Recursive Path Computation (BRPC)
  procedure.  Metrics to capture the significant performance aspects
  are identified, and detailed simulations are carried out on realistic
  scenarios.  A performance analysis for each of the path computation
  methods is then undertaken.















Dasgupta, et al.             Informational                      [Page 1]

RFC 5468       Analysis of Inter-Domain Path Computation      April 2009


Table of Contents

  1. Introduction ....................................................2
  2. Terminology .....................................................3
  3. Evaluation Metrics ..............................................4
  4. Simulation Setup ................................................5
  5. Results and Analysis ............................................6
     5.1. Path Cost ..................................................7
     5.2. Crankback/Setup Delay ......................................7
     5.3. Signaling Failures .........................................8
     5.4. Failed TE-LSPs/Bandwidth on Link Failures ..................8
     5.5. TE LSP/Bandwidth Setup Capacity ............................8
  6. Security Considerations .........................................9
  7. Acknowledgment ..................................................9
  8. Informative References ..........................................9

1.  Introduction

  The IETF has specified two approaches for the computation of inter-
  domain (Generalized) Multi-Protocol Label Switching ((G)MPLS) Traffic
  Engineering (TE) Label Switched Paths (LSP): the per-domain path
  computation approach defined in [RFC5152] and the PCE-based approach
  specified in [RFC4655].  More specifically, we study the PCE-based
  path computation model that makes use of the BRPC method outlined in
  [RFC5441].  In the rest of this document, we will call PD and PCE the
  per-domain path computation approach and the PCE path computation
  approach, respectively.

  In the per-domain path computation approach, each path segment within
  a domain is computed during the signaling process by each entry node
  of the domain up to the next-hop exit node of that same domain.

  In contrast, the PCE-based approach and, in particular, the BRPC
  method defined in [RFC5441] rely on the collaboration between a set
  of PCEs to find to shortest inter-domain path after the computation
  of which the corresponding TE LSP is signaled: path computation is
  undertaken using multiple PCEs in a backward recursive fashion from
  the destination domain to the source domain.  The notion of a Virtual
  Shortest Path Tree (VSPT) is introduced.  Each link of a VSPT
  represents the shortest path satisfying the set of required
  constraints between the border nodes of a domain and the destination
  LSR.  The VSPT of each domain is returned by the corresponding PCE to
  create a new VSPT by PCEs present in other domains.  [RFC5441]
  discusses the BRPC procedure in complete detail.







Dasgupta, et al.             Informational                      [Page 2]

RFC 5468       Analysis of Inter-Domain Path Computation      April 2009


  This document presents some simulation results and analysis to
  compare the performance of the above two inter-domain path
  computation approaches.  Two realistic topologies with accompanying
  traffic matrices are used to undertake the simulations.

  Note that although the simulation results discussed in this document
  have used inter-area networks, they also apply to Inter-AS cases.

  Disclaimer: although simulations have been made on different and
  realistic topologies showing consistent results, the metrics shown
  below may vary with the network topology.

  Note that this document refers to multiple figures that are only
  available in the PDF version.

2.  Terminology

  Terminology used in this document:

  TE LSP: Traffic Engineered Label Switched Path.

  CSPF: Constrained Shortest Path First.

  PCE: Path Computation Element.

  BRPC: Backward Recursive PCE-based Computation.

  AS: Autonomous System.

  ABR: Routers used to connect two IGP areas (areas in OSPF or levels
  in IS-IS).

  ASBR: Routers used to connect together ASes of a different or the
  same Service Provider via one or more Inter-AS links.

  Border LSR: A border LSR is either an ABR in the context of inter-
  area TE or an ASBR in the context of Inter-AS TE.

  VSPT: Virtual Shortest Path Tree.

  LSA: Link State Advertisement.

  LSR: Label Switching Router.

  IGP: Interior Gateway Protocol.

  TED: Traffic Engineering Database.




Dasgupta, et al.             Informational                      [Page 3]

RFC 5468       Analysis of Inter-Domain Path Computation      April 2009


  PD: Per-Domain

3.  Evaluation Metrics

  This section discusses the metrics that are used to quantify and
  compare the performance of the two approaches.

  o  Path Cost.  The maximum and average path costs are observed for
     each TE LSP.  The distributions for the maximum and average path
     costs are then compared for the two path computation approaches.

  o  Signaling Failures.  Signaling failures may occur in various
     circumstances.  With PD, the head-end LSR chooses the downstream
     border router (ABR, ASBR) according to some selection criteria
     (IGP shortest path, ....) based on the information in its TED.
     This ABR then selects the next ABR using its TED, continuing the
     process till the destination is reached.  At each step, the TED
     information could be out of date, potentially resulting in a
     signaling failure during setup.  In the BRPC procedure, the PCEs
     are the ABRs that cooperate to form the VSPT based on the
     information in their respective TEDs.  As in the case of the PD
     approach, information in the TED could be out of date, potentially
     resulting in signaling failures during setup.  Also, only with the
     PD approach, another situation that leads to a signaling failure
     is when the selected exit ABR does not have any path obeying the
     set of constraints toward a downstream exit node or the TE LSP
     destination.  This situation does not occur with the BRPC.  The
     signaling failure metric captures the total number of signaling
     failures that occur during initial setup and re-route (on link
     failure) of a TE LSP.  The distribution of the number of signaling
     failures encountered for all TE LSPs is then compared for the PD
     and BRPC methods.

  o  Crankback Signaling.  In this document, we made the assumption
     that in the case of PD, when an entry border node fails to find a
     route in the corresponding domain, boundary re-routing crankback
     [RFC4920] signaling was used.  A crankback signaling message
     propagates to the entry border node of the domain and a new exit
     border node is chosen.  After this, path computation takes place
     to find a path segment to a new entry border node of the next
     domain.  This causes an additional delay in setup time.  This
     metric captures the distribution of the number of crankback
     signals and the corresponding delay in setup time for a TE LSP
     when using PD.  The total delay arising from the crankback
     signaling is proportional to the costs of the links over which the
     signal travels, i.e., the path that is setup from the entry border
     node of a domain to its exit border node (the assumption was made




Dasgupta, et al.             Informational                      [Page 4]

RFC 5468       Analysis of Inter-Domain Path Computation      April 2009


     that link metrics reflect propagation delays).  Similar to the
     above metrics, the distribution of total crankback signaling and
     corresponding proportional delay across all TE LSPs is compared.

  o  TE LSPs/Bandwidth Setup Capacity.  Due to the different path
     computation techniques, there is a significant difference in the
     amount of TE LSPs/bandwidth that can be set up.  This metric
     captures the difference in the number of TE LSPs and corresponding
     bandwidth that can be set up using the two path computation
     techniques.  The traffic matrix is continuously scaled and stopped
     when the first TE LSP cannot be set up for both the methods.  The
     difference in the scaling factor gives the extra bandwidth that
     can be set up using the corresponding path computation technique.

  o  Failed TE LSPs/Bandwidth on Link Failure.  Link failures are
     induced in the network during the course of the simulations
     conducted.  This metric captures the number of TE LSPs and the
     corresponding bandwidth that failed to find a route when one or
     more links lying on its path failed.

4.  Simulation Setup

  A very detailed simulator has been developed to replicate a real-life
  network scenario accurately.  Following is the set of entities used
  in the simulation with a brief description of their behavior.

  +------------+-------+-------+--------+--------+---------+----------+
  |   Domain   |  # of |  # of |  OC48  |  OC192 |  [0,20) | [20,100] |
  |    Name    | nodes | links |  links |  links |   Mbps  |   Mbps   |
  +------------+-------+-------+--------+--------+---------+----------+
  |     D1     |   17  |   24  |   18   |    6   |   125   |    368   |
  |     D2     |   14  |   17  |   12   |    5   |    76   |    186   |
  |     D3     |   19  |   26  |   20   |    6   |    14   |    20    |
  |     D4     |   9   |   12  |    9   |    3   |    7    |    18    |
  |  MESH-CORE |   83  |  167  |   132  |   35   |    0    |     0    |
  | (backbone) |       |       |        |        |         |          |
  |  SYM-CORE  |   29  |  377  |   26   |   11   |    0    |     0    |
  | (backbone) |       |       |        |        |         |          |
  +------------+-------+-------+--------+--------+---------+----------+

          Table 1.  Domain Details and TE LSP Size Distribution

  o  Topology Description.  To obtain meaningful results applicable to
     present-day Service Provider topologies, simulations have been run
     on two representative topologies.  They consists of a large
     backbone area to which four smaller areas are connected.  For the
     first topology named MESH-CORE, a densely connected backbone was
     obtained from RocketFuel [ROCKETFUEL].  The second topology has a



Dasgupta, et al.             Informational                      [Page 5]

RFC 5468       Analysis of Inter-Domain Path Computation      April 2009


     symmetrical backbone and is called SYM-CORE.  The four connected
     smaller areas are obtained from [DEF-DES].  Details of the
     topologies are shown in Table 1 along with their layout in Figure
     1.  All TE LSPs set up on this network have their source and
     destinations in different areas and all of them need to traverse
     the backbone network.  Table 1 also shows the number of TE LSPs
     that have their sources in the corresponding areas along with
     their size distribution.

  o  Node Behavior.  Every node in the topology represents a router
     that maintains states for all the TE LSPs passing through it.
     Each node in a domain is a source for TE LSPs to all the other
     nodes in the other domains.  As in a real-life scenario, where
     routers boot up at random points in time, the nodes in the
     topologies also start sending traffic on the TE LSPs originating
     from them at a random start time (to take into account the
     different boot-up times).  All nodes are up within an hour of the
     start of simulation.  All nodes maintain a TED that is updated
     using LSA updates as outlined in [RFC3630].  The flooding scope of
     the Traffic Engineering IGP updates are restricted only to the
     domain in which they originate in compliance with [RFC3630] and
     [RFC5305].

  o  TE LSP Setup.  When a node boots up, it sets up all TE LSPs that
     originate from it in descending order of size.  The network is
     dimensioned such that all TE LSPs can find a path.  Once set up,
     all TE LSPs stay in the network for the complete duration of the
     simulation unless they fail due to a link failure.  Even though
     the TE LSPs are set up in descending order of size from a head-end
     router, from the network perspective, TE LSPs are set up in random
     fashion as the routers boot up at random times.

  o  Inducing Failures.  For thorough performance analysis and
     comparison, link failures are induced in all the areas.  Each link
     in a domain can fail independently with a mean failure time of 24
     hours and be restored with a mean restore time of 15 minutes.
     Both inter-failure and inter-restore times are uniformly
     distributed.  No attempt to re-optimize the path of a TE LSP is
     made when a link is restored.  The links that join two domains
     never fail.  This step has been taken to concentrate only on how
     link failures within domains affect the performance.

5.  Results and Analysis

  Simulations were carried out on the two topologies previously
  described.  The results are presented and discussed in this section.
  All figures are from the PDF version of this document.  In the
  figures, "PD-Setup" and "PCE-Setup" represent results corresponding



Dasgupta, et al.             Informational                      [Page 6]

RFC 5468       Analysis of Inter-Domain Path Computation      April 2009


  to the initial setting up of TE LSPs on an empty network using the
  per-domain and the PCE approach, respectively.  Similarly, "PD-
  Failure" and "PCE-Failure" denote the results under the link failure
  scenario.  A period of one week was simulated and results were
  collected after the transient period.  Figure 2 and Figure 3
  illustrate the behavior of the metrics for topologies MESH-CORE and
  SYM-CORE, respectively.

5.1.  Path Cost

  Figures 2a and 3a show the distribution of the average path cost of
  the TE LSPs for MESH-CORE and SYM-CORE, respectively.  During the
  initial setup, roughly 40% of TE LSPs for MESH-CORE and 70% of TE
  LSPs for SYM-CORE have path costs greater with PD (PD-Setup) than
  with the PCE approach (PCE-Setup).  This is due to the ability of the
  BRPC procedure to select the inter-domain shortest constrained paths
  that satisfy the constraints.  Since the per-domain approach to path
  computation is undertaken in stages where every entry border router
  to a domain computes the path in the corresponding domain, the most
  optimal (shortest constrained inter-domain) route is not always
  found.  When failures start to take place in the network, TE LSPs are
  re-routed over different paths resulting in path costs that are
  different from the initial costs.  PD-Failure and PCE-Failure in
  Figures 2a and 3a show the distribution of the average path costs
  that the TE LSPs have over the duration of the simulation with link
  failures occurring.  Similarly, the average path costs with the PD
  approach are much higher than the PCE approach when link failures
  occur.  Figures 2b and 3b show similar trends and present the maximum
  path costs for a TE LSP for the two topologies, respectively.  It can
  be seen that with per-domain path computation, the maximum path costs
  are larger for 30% and 100% of the TE LSPs for MESH-CORE and SYM-
  CORE, respectively.

5.2.  Crankback/Setup Delay

  Due to crankbacks that take place in the per-domain approach of path
  computation, TE LSP setup time is significantly increased.  This
  could lead to Quality-of-Service (QoS) requirements not being met,
  especially during failures when re-routing needs to be quick in order
  to keep traffic disruption to a minimum (for example in the absence
  of local repair mechanisms such as defined in [RFC4090]).  Since
  crankbacks do not take place during path computation with a PCE,
  setup delays are significantly reduced.  Figures 2c and 3c show the
  distributions of the number of crankbacks that took place during the
  setup of the corresponding TE LSPs for MESH-CORE and SYM-CORE,
  respectively.  It can be seen that all crankbacks occurred when
  failures were taking place in the networks.  Figures 2d and 3d
  illustrate the "proportional" setup delays experienced by the TE LSPs



Dasgupta, et al.             Informational                      [Page 7]

RFC 5468       Analysis of Inter-Domain Path Computation      April 2009


  due to crankbacks for the two topologies.  It can be observed that
  for a large proportion of the TE LSPs, the setup delays arising out
  of crankbacks are very large, possibly proving to be very detrimental
  to QoS requirements.  The large delays arise out of the crankback
  signaling that needs to propagate back and forth from the exit border
  router of a domain to its entry border router.  More crankbacks occur
  for SYM-CORE as compared to MESH-CORE as it is a very "restricted"
  and "constrained" network in terms of connectivity.  This causes a
  lack of routes and often several cycles of crankback signaling are
  required to find a constrained path.

5.3.  Signaling Failures

  As discussed in the previous sections, signaling failures occur
  either due to an outdated TED or when a path cannot be found from the
  selected entry border router.  Figures 2e and 3e show the
  distribution of the total number of signaling failures experienced by
  the TE LSPs during setup.  About 38% and 55% of TE LSPs for MESH-CORE
  and SYM-CORE, respectively, experience a signaling failures with per-
  domain path computation when link failures take place in the network.
  In contrast, only about 3% of the TE LSPs experience signaling
  failures with the PCE method.  It should be noted that the signaling
  failures experienced with the PCE correspond only to the TEDs being
  out of date.

5.4.  Failed TE-LSPs/Bandwidth on Link Failures

  Figures 2f and 3f show the number of TE LSPs and the associated
  required bandwidth that fail to find a route when link failures are
  taking place in the topologies.  For MESH-CORE, with the per-domain
  approach, 395 TE LSPs failed to find a path corresponding to 1612
  Mbps of bandwidth.  For PCE, this number is lesser at 374
  corresponding to 1546 Mbps of bandwidth.  For SYM-CORE, with the per-
  domain approach, 434 TE LSPs fail to find a route corresponding to
  1893 Mbps of bandwidth.  With the PCE approach, only 192 TE LSPs fail
  to find a route, corresponding to 895 Mbps of bandwidth.  It is
  clearly visible that the PCE allows more TE LSPs to find a route thus
  leading to better performance during link failures.

5.5.  TE LSP/Bandwidth Setup Capacity

  Since PCE and the per-domain path computation approach differ in how
  path computation takes place, more bandwidth can be set up with PCE.
  This is primarily due to the way in which BRPC functions.  To observe
  the extra bandwidth that can fit into the network, the traffic matrix
  was scaled.  Scaling was stopped when the first TE LSP failed to set
  up with PCE.  This metric, like all the others discussed above, is
  topology dependent (therefore, the choice of two topologies for this



Dasgupta, et al.             Informational                      [Page 8]

RFC 5468       Analysis of Inter-Domain Path Computation      April 2009


  study).  This metric highlights the ability of PCE to fit more
  bandwidth in the network.  For MESH-CORE, on scaling, 1556 Mbps more
  could be set up with PCE.  In comparison, for SYM-CORE, this value is
  986 Mbps.  The amount of extra bandwidth that can be set up on SYM-
  CORE is lesser due to its restricted nature and limited capacity.

6.  Security Considerations

  This document does not raise any security issues.

7.  Acknowledgment

  The authors would like to acknowledge Dimitri Papadimitriou for his
  helpful comments to clarify the text.

8.  Informative References

  [DEF-DES]    J. Guichard, F. Le Faucheur, and J.-P. Vasseur,
               "Definitive MPLS Network Designs", Cisco Press, 2005.

  [RFC5152]    Vasseur, JP., Ed., Ayyangar, A., Ed., and R. Zhang, "A
               Per-Domain Path Computation Method for Establishing
               Inter-Domain Traffic Engineering (TE) Label Switched
               Paths (LSPs)", RFC 5152, February 2008.

  [RFC5441]    Vasseur, JP., Zhang, R., Bitar, N., and JL. Le Roux, "A
               Backward Recursive PCE-Based Computation (BRPC)
               Procedure to Compute  Shortest Constrained Inter-Domain
               Traffic Engineering Label Switched Paths", RFC 5441,
               April 2009.

  [RFC3630]    Katz, D., Kompella, K., and D. Yeung, "Traffic
               Engineering (TE) Extensions to OSPF Version 2", RFC
               3630, September 2003.

  [RFC5305]    Li, T. and H. Smit, "IS-IS Extensions for Traffic
               Engineering", RFC 5305, October 2008.

  [RFC4090]    Pan, P., Ed., Swallow, G., Ed., and A. Atlas, Ed., "Fast
               Reroute Extensions to RSVP-TE for LSP Tunnels", RFC
               4090, May 2005.

  [RFC4655]    Farrel, A., Vasseur, J.-P., and J. Ash, "A Path
               Computation Element (PCE)-Based Architecture", RFC 4655,
               August 2006.






Dasgupta, et al.             Informational                      [Page 9]

RFC 5468       Analysis of Inter-Domain Path Computation      April 2009


  [RFC4920]    Farrel, A., Ed., Satyanarayana, A., Iwata, A., Fujita,
               N., and G. Ash, "Crankback Signaling Extensions for MPLS
               and GMPLS RSVP-TE", RFC 4920, July 2007.

  [ROCKETFUEL] N. Spring, R. Mahajan, and D. Wehterall, "Measuring ISP
               Topologies with Rocketfuel", Proceedings of ACM SIGCOMM,
               2002.

Authors' Addresses

  Sukrit Dasgupta
  Drexel University
  Dept of ECE, 3141 Chestnut Street
  Philadelphia, PA  19104
  USA

  Phone: 215-895-1862
  EMail: [email protected]
  URI:   www.pages.drexel.edu/~sd88


  Jaudelice C. de Oliveira
  Drexel University
  Dept. of ECE, 3141 Chestnut Street
  Philadelphia, PA  19104
  USA

  Phone: 215-895-2248
  EMail: [email protected]
  URI:   www.ece.drexel.edu/faculty/deoliveira

  JP Vasseur
  Cisco Systems
  1414 Massachussetts Avenue
  Boxborough, MA  01719
  USA

  EMail: [email protected]













Dasgupta, et al.             Informational                     [Page 10]