Network Working Group                                     M. Chadalapaka
Request for Comments: 5047                                            HP
Category: Informational                                       J. Hufferd
                                                           Brocade Inc.
                                                              J. Satran
                                                                    IBM
                                                                H. Shah
                                                   Broadcom Corporation
                                                           October 2007


                   DA: Datamover Architecture for
        the Internet Small Computer System Interface (iSCSI)

Status of This Memo

  This memo provides information for the Internet community.  It does
  not specify an Internet standard of any kind.  Distribution of this
  memo is unlimited.

Abstract

  The Internet Small Computer System Interface (iSCSI) is a SCSI
  transport protocol that maps the SCSI family of application protocols
  onto TCP/IP.  Datamover Architecture for iSCSI (DA) defines an
  abstract model in which the movement of data between iSCSI end nodes
  is logically separated from the rest of the iSCSI protocol in order
  to allow iSCSI to adapt to innovations available in new IP
  transports.  While DA defines the architectural functions required of
  the class of Datamover protocols, it does not define any specific
  Datamover protocols.  Each such Datamover protocol, defined in a
  separate document, provides a reliable transport for all iSCSI PDUs,
  but actually moves the data required for certain iSCSI PDUs without
  involving the remote iSCSI layer itself.  This document begins with
  an introduction of a few new abstractions, defines a layered
  architecture for iSCSI and Datamover protocols, and then models the
  interactions within an iSCSI end node between the iSCSI layer and the
  Datamover layer that happen in order to transparently perform remote
  data movement within an IP fabric.  It is intended that this
  definition will help map iSCSI to generic Remote Direct Memory Access
  (RDMA)-capable IP fabrics in the future comprising TCP, the Stream
  Control Transmission Protocol (SCTP), and possibly other underlying
  network transport layers, such as InfiniBand.








Chadalapaka, et al.          Informational                      [Page 1]

RFC 5047                           DA                       October 2007


Table of Contents

  1. Motivation ......................................................4
     1.1. Intent .....................................................4
     1.2. Interpretation of Requirements .............................5
  2. Definitions and Acronyms ........................................5
     2.1. Definitions ................................................5
     2.2. Acronyms ...................................................6
  3. Architectural Layering of iSCSI and Datamover Layers ............7
  4. Design Overview .................................................9
  5. Architectural Concepts .........................................10
     5.1. iSCSI PDU Types ...........................................10
          5.1.1. iSCSI Data-Type PDUs ...............................10
          5.1.2. iSCSI Control-Type PDUs ............................11
     5.2. Data_Descriptor ...........................................11
     5.3. Connection_Handle .........................................11
     5.4. Operational Primitive .....................................12
     5.5. Transport Connection ......................................13
  6. Datamover Layer and Datamover Protocol .........................13
  7. Functional Overview ............................................14
     7.1. Startup ...................................................14
     7.2. Full Feature Phase ........................................15
     7.3. Wrap-up ...................................................15
  8. Operational Primitives Provided by the Datamover Layer .........16
     8.1. Send_Control ..............................................16
     8.2. Put_Data ..................................................17
     8.3. Get_Data ..................................................17
     8.4. Allocate_Connection_Resources .............................18
     8.5. Deallocate_Connection_Resources ...........................19
     8.6. Enable_Datamover ..........................................19
     8.7. Connection_Terminate ......................................20
     8.8. Notice_Key_Values .........................................20
     8.9. Deallocate_Task_Resources .................................20
  9. Operational Primitives Provided by the iSCSI Layer .............21
     9.1. Control_Notify ............................................21
     9.2. Connection_Terminate_Notify ...............................22
     9.3. Data_Completion_Notify ....................................22
     9.4. Data_ACK_Notify ...........................................23
  10. Datamover Interface (DI) ......................................23
     10.1. Overview .................................................23
     10.2. Interactions for Handling Asynchronous Notifications .....24
          10.2.1. Connection Termination ............................24
          10.2.2. Data Transfer Completion ..........................24
          10.2.3. Data Acknowledgement ..............................25
     10.3. Interactions for Sending an iSCSI PDU ....................25
          10.3.1. SCSI Command ......................................26
          10.3.2. SCSI Response .....................................26
          10.3.3. Task Management Function Request ..................26



Chadalapaka, et al.          Informational                      [Page 2]

RFC 5047                           DA                       October 2007


          10.3.4. Task Management Function Response .................27
          10.3.5. SCSI Data-Out and SCSI Data-In ....................27
          10.3.6. Ready To Transfer (R2T) ...........................28
          10.3.7. Asynchronous Message ..............................28
          10.3.8. Text Request ......................................28
          10.3.9. Text Response .....................................28
          10.3.10. Login Request ....................................29
          10.3.11. Login Response ...................................29
          10.3.12. Logout Command ...................................29
          10.3.13. Logout Response ..................................30
          10.3.14. SNACK Request ....................................30
          10.3.15. Reject ...........................................30
          10.3.16. NOP-Out ..........................................30
          10.3.17. NOP-In ...........................................30
     10.4. Interactions for Receiving an iSCSI PDU ..................31
          10.4.1. General Control-Type PDU Notification .............31
          10.4.2. SCSI Data Transfer PDUs ...........................31
          10.4.3. Login Request .....................................32
          10.4.4. Login Response ....................................32
  11. Security Considerations .......................................33
     11.1. Architectural Considerations .............................33
     11.2. Wire Protocol Considerations .............................33
  12. References ....................................................34
     12.1. Normative References .....................................34
     12.2. Informative References ...................................34
  Appendix A. Design Considerations and Examples ....................35
     A.1. Design Considerations for a Datamover Protocol ............35
     A.2. Examples of Datamover Interactions ........................35
  Acknowledgements ..................................................44

Table of Figures

  Figure 1. Datamover Architecture Diagram, with the RDMAP Example ...8
  Figure 2. A Successful iSCSI Login on Initiator ...................37
  Figure 3. A Successful iSCSI Login on Target ......................37
  Figure 4. A Failed iSCSI Login on Initiator .......................38
  Figure 5. A Failed iSCSI Login on Target ..........................38
  Figure 6. iSCSI Does Not Enable the Datamover .....................39
  Figure 7. A Normal iSCSI Connection Termination ...................40
  Figure 8. An Abnormal iSCSI Connection Termination ................40
  Figure 9. A SCSI Write Data Transfer ..............................41
  Figure 10. A SCSI Read Data Transfer ..............................42
  Figure 11. A SCSI Read Data Acknowledgement .......................43
  Figure 12. Task Resource Cleanup on Abort .........................44







Chadalapaka, et al.          Informational                      [Page 3]

RFC 5047                           DA                       October 2007


1.  Motivation

1.1.  Intent

  There are relatively new standard protocols that enable Remote Direct
  Memory Access (RDMA) and Remote Direct Data Placement (RDDP)
  technologies to work over IP fabrics.  The principal value
  proposition of these technologies is that they enable one end node to
  place data in the final intended buffer on the remote end node, thus
  eliminating the need for a receive path data copy that moves the data
  to its final location.  The data copy avoidance in turn eliminates
  unnecessary memory bandwidth consumption, substantially decreases the
  reassembly buffer size requirements, and preserves CPU cycles that
  would otherwise be spent in copying.

  The iSCSI specification [RFC3720] defines a very detailed data
  transfer model that employs SCSI Data-In PDUs, SCSI Data-Out PDUs,
  and R2T PDUs, in addition to the SCSI Command and SCSI Response PDUs
  that respectively create and conclude the task context for the data
  transfer.  In the traditional iSCSI model, the iSCSI protocol layer
  plays the central role in pacing the data transfer and carrying out
  the ensuing data transfer itself.  An alternative architecture would
  be for iSCSI to delegate a large part of this data transfer role to a
  separate protocol layer exclusively designed to move data, which in
  turn is possibly aided by a data movement and placement technology
  such as RDMA.

  If iSCSI were operating in such RDMA environments, iSCSI would be
  shielded from the low-level data transfer mechanics but would only be
  privy to the conclusion of the requested data transfer.  Thus, there
  would be an effective "off-loading" of the work that an iSCSI
  protocol layer is expected to perform, compared to today's iSCSI end
  nodes.  For such RDMA environments, it is highly desirable that there
  be a standard architecture to separate the data movement part of the
  iSCSI protocol definition from the rest of the iSCSI functionality.
  This architecture precisely defines what a Datamover layer is and
  also describes the model of interactions between the iSCSI layer and
  the Datamover layer (Section 6).  In order to satisfy this need, this
  document presents a Datamover Architecture for iSCSI (DA) and
  summarizes a reasonable model for interactions between the iSCSI
  layer and the Datamover layer for each of the iSCSI PDUs that are
  defined in [RFC3720].  Note that while DA is motivated by the advent
  of RDMA over TCP/IP technology, the architecture is not dependent on
  RDMA in its design.  DA is intended to be a generic architectural
  framework for allowing different types of Datamovers based on
  different types of RDMA and transport protocols.  Adoption of this
  model will help iSCSI proliferate into more environments.




Chadalapaka, et al.          Informational                      [Page 4]

RFC 5047                           DA                       October 2007


1.2.  Interpretation of Requirements

  This document introduces certain architectural abstractions and
  builds an abstract functional interface model between iSCSI and
  Datamover protocol layers based on those abstractions.  This
  architectural style is motivated by the following desires:

     a) Provide guidance to Datamover protocol designers with respect
        to the functional boundary between iSCSI and the Datamover
        protocols.  This guidance is critical since a significant part
        of the [RFC3720] protocol definition is left unchanged by DA
        architecture and the iSCSI notions from [RFC3720] (e.g., tasks,
        ITTs) are leveraged by the Datamover protocol.

     b) Aid existing iSCSI implementations to rapidly adapt to DA
        architecture, largely by leveraging the architectural
        abstractions into implementation constructs -- e.g., functions,
        APIs, modules.

  However, note that DA architecture does not intend to impose any
  implementation specifics per se.  When a DA architectural concept
  (e.g., Operational Primitive) is described as mandatory ("MUST") or
  recommended ("SHOULD") of a layer (iSCSI or Datamover) in this
  document, the intent is that an implementation respectively MUST or
  SHOULD produce the same protocol action as what the model describes.
  Specifically, no implementation compliance in terms of names, modules
  or API arguments etc. is implied by this Architecture by such use of
  [RFC2119] terms, only a functional compliance is sought.

2.  Definitions and Acronyms

2.1.  Definitions

  I/O Buffer - A buffer that is used in a SCSI Read or Write operation
     so that SCSI data may be sent from or received by the buffer.

  Datamover protocol  - A Datamover protocol is a data transfer wire
     protocol for iSCSI that meets the requirements stated in Section
     6.

  Datamover layer - A Datamover layer is a protocol layer within an end
     node that implements the Datamover protocol.

  Datamover-assisted - An iSCSI connection is said to be "Datamover-
     assisted" when a Datamover layer is enabled for moving control and
     data information on that iSCSI connection.





Chadalapaka, et al.          Informational                      [Page 5]

RFC 5047                           DA                       October 2007


  The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
  "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
  document are to be interpreted as described in [RFC2119].

2.2.  Acronyms

  Acronym        Definition
  -------------------------------------------------------------

  DA             Datamover Architecture for iSCSI

  DDP            Direct Data Placement Protocol

  DI             Datamover Interface

  IANA           Internet Assigned Numbers Authority

  IETF           Internet Engineering Task Force

  I/O            Input - Output

  IP             Internet Protocol

  iSCSI          Internet SCSI

  iSER           iSCSI Extensions for RDMA

  ITT            Initiator Task Tag

  LO             Leading Only

  MPA            Marker PDU Aligned Framing for TCP

  PDU            Protocol Data Unit

  RDDP           Remote Direct Data Placement

  RDMA           Remote Direct Memory Access

  R2T            Ready To Transfer

  R2TSN          Ready To Transfer Sequence Number

  RDMA           Remote Direct Memory Access

  RDMAP          Remote Direct Memory Access Protocol

  RFC            Request For Comments



Chadalapaka, et al.          Informational                      [Page 6]

RFC 5047                           DA                       October 2007


  SAM            SCSI Architecture Model

  SCSI           Small Computer Systems Interface

  SN             Sequence Number

  SNACK          Selective Negative Acknowledgment - also
                 Sequence Number Acknowledgement for Data

  TCP            Transmission Control Protocol

  TTT            Target Transfer Tag

3.  Architectural Layering of iSCSI and Datamover Layers

  Figure 1 illustrates an example of the architectural layering of
  iSCSI and Datamover layers, in conjunction with a TCP/IP
  implementation of RDMAP/DDP ([DDP]) layers in an iSCSI end node.
  Note that RDMAP/DDP/MPA and TCP protocol layers are shown here only
  as an example, and in reality, DA is completely oblivious to protocol
  layers below the Datamover layer.  The RDMAP/DDP/MPA protocol stack
  provides a generic transport service with direct data placement.
  There is no need to tailor the implementation of this protocol stack
  to the specific ULP to benefit from these services.



























Chadalapaka, et al.          Informational                      [Page 7]

RFC 5047                           DA                       October 2007


         Initiator stack                            Target stack

     +----------------+     SCSI application   +----------------+
     | SCSI Layer     |     protocols          | SCSI Layer     |
     +----------------+                        +----------------+
            ^                                          ^
            |                                          |
            v                                          v
     +----------------+     iSCSI protocol     +----------------+
     | iSCSI Layer    |    (excluding data     | iSCSI Layer    |
     +----------------+       movement)        +----------------+
            ^                                          ^
     --  ---+--  ---- DI (Datamover Interface)---  ----+---  ----
            v                                          v
     +----------------+      a Datamover       +----------------+
     | Datamover Layer|       protocol         | Datamover Layer|
     +----------------+                        +----------------+
            ^                                          ^
    +-------+----------+                     +---------+-----------+
    |       v          |                     |         v           |
    |+---------------+ |                     | +-----------------+ |
    || RDMAP/DDP/MPA | |    RDMAP/DDP/MPA    | | RDMAP/DDP/MPA   | |
    || Layers        | |    protocols        | | Layers          | |
    |+---------------+ |                     | +-----------------+ |
    |       ^          |                     |         ^           |
    |       | network  |                     |         | network   |
    |       | transport|                     |         | transport |
    |       v          |                     |         v           |
    |+---------------+ |                     | +----------------+  |
    || TCP Layer     | |    TCP protocol     | | TCP Layer      |  |
    |+---------------+ |                     | +----------------+  |
    |       ^          |                     |         ^           |
    +-------+----------+                     +---------+-----------+
            +------------------------------------------+

             Figure 1.  Datamover Architecture Diagram,
                       with the RDMAP Example

  The scope of this document is limited to:

     1. Defining the notion of a Datamover layer and a Datamover
        protocol (Section 6).

     2. Defining the functionality distribution between the iSCSI layer
        and the Datamover layer, along with the communication model
        between the two (Operational Primitives).





Chadalapaka, et al.          Informational                      [Page 8]

RFC 5047                           DA                       October 2007


     3. Modeling the interactions between the blocks labeled as "iSCSI
        Layer" and "Datamover Layer" in Figure 1 -- i.e., defining the
        interface labeled "DI" in the figure -- for each defined iSCSI
        PDU, based on the Operational Primitives.

4.  Design Overview

  This document discusses and defines a model for interactions between
  the iSCSI layer and a "Datamover layer" (see Section 6) operating
  within an iSCSI end node, presumably communicating with one or more
  iSCSI end nodes with similar layering.  The model for interactions
  for handling different iSCSI operations is called the "Datamover
  Interface" (DI, Section 10), while the architecture itself is called
  the "Datamover Architecture for iSCSI" (DA).  It is likely that the
  architecture will have implications on the Datamover wire protocols
  as DA places certain requirements and functionality expectations on
  the Datamover layer.  However, this document itself neither defines
  any new wire protocol for the Datamover layer, nor any potential
  modifications to the iSCSI wire protocol to employ the Datamover
  layer.  The scope of this document is strictly limited to specifying
  the architectural framework and the minimally required interactions
  that happen within an iSCSI end node to leverage the Datamover layer.

  The design ideas behind DA can be summarized as follows:

     1) DA defines an abstract functional interface model of the iSCSI
        layer's interactions with a Datamover layer below -- i.e., DA
        models the interactions between the logical "bottom" interface
        of iSCSI and the logical "top" interface of a Datamover.

     2) DA guides the wire protocol for a Datamover layer by defining
        the iSCSI knowledge that the Datamover layer may utilize in its
        protocol definition (as an example, this document completely
        limits the notion of "iSCSI session" to the iSCSI layer).

     3) DA is designed to allow implementation of the Datamover layer
        either in hardware or in software.

     4) DA is not a wire protocol spec, but an architecture that also
        models the interactions between iSCSI and Datamover layers
        operating within an iSCSI end node.

     5) DA by design seeks to model the iSCSI-Datamover interactions in
        a way that the modeling is independent of the specifics of
        either a particular iSCSI revision or an instantiation of a
        Datamover layer.





Chadalapaka, et al.          Informational                      [Page 9]

RFC 5047                           DA                       October 2007


     6) DA introduces and relies on the notion of a defined set of
        Operational Primitives (could be seen as entry point
        definitions in implementation terms) provided by each layer to
        the other to carry out the request-response interactions.

     7) DA is intended to allow Datamover protocol definitions with
        minimal changes to existing iSCSI implementations.

     8) DA is designed to allow the iSCSI layer to completely rely on
        the Datamover layer for all data transport needs.

     9) DA models the architecturally required minimal interactions
        between an operational iSCSI layer and a Datamover layer to
        realize the iSCSI-transparent data movement.  There may be
        several other interactions in a typical implementation in order
        to bootstrap a Datamover layer (or an iSCSI layer) into
        operation, but they are outside the scope of this document.

  Note that in summary, DA is architected to support many different
  Datamover protocols operating under the iSCSI layer.  One such
  example of a Datamover protocol is iSER [iSER].

5.  Architectural Concepts

5.1.  iSCSI PDU Types

  This section defines the iSCSI PDU classification terminology, as
  defined and used in this document.  Out of the set of legal iSCSI
  PDUs defined in [RFC3720], as we will see in Section 5.1.1, the iSCSI
  layer does not request a SCSI Data-Out PDU carrying solicited data
  for transmission across the Datamover Interface per this
  architecture.  For this reason, the SCSI Data-Out PDU carrying
  solicited data is excluded in the iSCSI PDU classification we
  introduce in this section (for SCSI Data-Out PDUs for unsolicited
  Data, see Section 5.1.2).  The rest of the legal iSCSI PDUs that may
  be exchanged across the Datamover Interface are defined to consist of
  two classes:

     1) iSCSI data-type PDUs

     2) iSCSI control-type PDUs

5.1.1.  iSCSI Data-Type PDUs

  An iSCSI data-type PDU is defined as an iSCSI PDU that causes data
  transfer, transparent to the remote iSCSI layer, to take place
  between the peer iSCSI nodes on a Full Feature Phase iSCSI
  connection.  A data-type PDU, when requested for transmission by the



Chadalapaka, et al.          Informational                     [Page 10]

RFC 5047                           DA                       October 2007


  sender iSCSI layer, results in the associated data transfer without
  the participation of the remote iSCSI layer, i.e., the PDU itself is
  not delivered as-is to the remote iSCSI layer.  The following iSCSI
  PDUs constitute the set of iSCSI data-type PDUs:

     1) SCSI Data-In PDU

     2) R2T PDU

  In an iSCSI end node structured as an iSCSI layer and a Datamover
  layer as defined in this document, the solicitation for Data-Out
  (i.e., R2T PDU) is not delivered to the initiator iSCSI layer, per
  the definition of an iSCSI data-type PDU.  The data transfer is
  instead performed via the mechanisms known to the Datamover layer
  (e.g., RDMA Read).  This in turn implies that a SCSI Data-Out PDU for
  solicited data is never requested for transmission across the
  Datamover Interface at the initiator.

5.1.2.  iSCSI Control-Type PDUs

  Any iSCSI PDU that is not an iSCSI data-type PDU and also not a
  solicited SCSI Data-Out PDU is defined as an iSCSI control-type PDU.
  Specifically, note that SCSI Data-Out PDUs for unsolicited Data are
  defined as iSCSI control-type PDUs.

5.2.  Data_Descriptor

  A Data_Descriptor is an information element that describes an
  iSCSI/SCSI data buffer, provided by the iSCSI layer to its local
  Datamover layer or provided by the Datamover layer to its local iSCSI
  layer for identifying the data associated respectively with the
  requested or completed operation.

  In implementation terms, a Data_Descriptor may be a scatter-gather
  list describing a local buffer, the exact structure of which is
  subject to the constraints imposed by the operating environment on
  the local iSCSI node.

5.3.  Connection_Handle

  A Connection_Handle is an information element that identifies the
  particular iSCSI connection for which an inbound or outbound iSCSI
  PDU is intended.  A connection handle is unique for a given pair of
  an iSCSI layer instance and a Datamover layer instance.  The
  Connection_Handle qualifier is used in all invocations of any
  Operational Primitive for connection identification.





Chadalapaka, et al.          Informational                     [Page 11]

RFC 5047                           DA                       October 2007


  Note that the Connection_Handle is conceptually different from the
  Connection Identifier (CID) defined by the iSCSI specification.
  While the CID is a unique identifier of an iSCSI connection within an
  iSCSI session, the uniqueness of the Connection_Handle extends to the
  entire iSCSI layer instance coupled with the Datamover layer
  instance, across possibly multiple iSCSI sessions.

  In implementation terms, a Connection_Handle could be an opaque
  identifier exchanged between the iSCSI layer and the Datamover layer
  at the connection login time.  One may also consider it to be similar
  in scope of uniqueness to a socket identifier.  The exact structure
  and modalities of exchange of a Connection_Handle between the two
  layers is implementation-specific.

5.4.  Operational Primitive

  An Operational Primitive, in this document, is an abstract functional
  interface procedure that requests another layer perform a specific
  action on the requestor's behalf or notifies the other layer of some
  event.  The Datamover Interface between an iSCSI layer instance and a
  Datamover layer instance within an iSCSI end node uses a set of
  Operational Primitives to define the functional interface between the
  two layers.  Note that not every invocation of an Operational
  Primitive may elicit a response from the requested layer.  This
  document describes the types of Operational Primitives that are
  implicitly required and provided by the iSCSI protocol layer as
  defined in [RFC3720], and the semantics of these Primitives.

  Note that ownership of buffers and data structures is likely to be
  exchanged between the iSCSI layer and its local Datamover layer in
  invoking the Operational Primitives defined in this architecture.
  The buffer management details, including how buffers are allocated
  and released, are implementation-specific and thus are outside the
  scope of this document.

  Each Operational Primitive invocation needs a certain "information
  context" (e.g., Connection_Handle) for performing the specific action
  being requested.  The required information context is described in
  this document by a listing of "qualifiers" on each invocation, in the
  style of function call arguments.  There is no specific
  implementation implied in this notation.  The "qualifiers" of any
  Operational Primitive invocation specified in this document thus
  represent the mandatory information context that the Operational
  Primitive invocation MUST consider in performing the action.  While
  the qualifiers are required, the method of realizing the qualifiers
  (passed synchronously with invocation, or retrieved from task
  context, or retrieved from shared memory etc.) is really up to the
  implementations.



Chadalapaka, et al.          Informational                     [Page 12]

RFC 5047                           DA                       October 2007


  When an Operational Primitive implementation is described as
  mandatory ("MUST") or recommended ("SHOULD") of a layer (iSCSI or
  Datamover) in this document, the intent is that an implementation
  respectively MUST or SHOULD produce the same protocol action as what
  the model describes.

5.5.  Transport Connection

  The term "Transport Connection" is used in this document as a generic
  term to represent the end-to-end logical connection as defined by the
  underlying reliable transport protocol.  For this document, all
  instances of Transport Connection refer to a TCP connection.

6.  Datamover Layer and Datamover Protocol

  This section introduces the notion of a "Datamover layer" and
  "Datamover protocol" as meant in this document, and defines the
  requirements on a Datamover protocol.

  A Datamover layer is the implementation component that realizes a
  Datamover protocol functionality in an iSCSI-capable end node in
  communicating with other iSCSI end nodes with similar capabilities.
  More specifically, a "Datamover layer" MUST provide the following
  functionality and the "Datamover protocol" MUST consist of the wire
  protocol required to realize the following functionality:

     1) guarantee that all the necessary data transfers take place when
        the local iSCSI layer requests transmitting a command (in order
        to complete a SCSI command, for an initiator), or
        sending/receiving an iSCSI data sequence (in order to complete
        part of a SCSI command for a target).

     2) transport an iSCSI control-type PDU as-is to the peer Datamover
        layer when requested to do so by the local iSCSI layer.

     3) provide notification and delivery to the iSCSI layer upon
        arrival of an iSCSI control-type PDU.

     4) provide an initiator-to-target data acknowledgement of SCSI
        read data back to the target iSCSI layer, when requested.

     5) provide an asynchronous notification upon completion of a
        requested data transfer operation that moved data without
        involving the iSCSI layer.

     6) place the SCSI data into the I/O buffers or pick up the SCSI
        data for transmission out of the data buffers that the iSCSI
        layer had requested to be used for a SCSI I/O.



Chadalapaka, et al.          Informational                     [Page 13]

RFC 5047                           DA                       October 2007


     7) provide an error-free (i.e., must have at least the same level
        of assurance of data integrity as the CRC32C iSCSI data
        digest), reliable, in-order delivery transport mechanism over
        IP networks in performing the data transfer, and asynchronously
        notify the iSCSI layer upon iSCSI connection termination.

  Note that this architecture expects that each compliant Datamover
  protocol will define the precise means of satisfying the requirements
  specified in this section.

  In order to meet the functional requirements listed in this section,
  certain Datamover protocols may require pre-posted buffers from the
  local iSCSI protocol layer via mechanisms outside the scope of this
  document.  In some implementations, the absence of such buffers may
  result in a connection failure.  Datamover protocols may also realize
  these functional requirements via methods not explicitly listed in
  this document.

7.  Functional Overview

  This section presents an overview of the functional interactions
  between the iSCSI layer and the Datamover layer as intended by this
  Architecture.

7.1.  Startup

  The iSCSI Login Phase on an iSCSI connection occurs as defined in
  [RFC3720].  The Architecture assumes that at the end of the Login
  Phase, both the initiator and target, if they had so decided,
  transition the connection to being Datamover-assisted.  The precise
  means of how an iSCSI initiator and an iSCSI target agree on having
  the connection Datamover-assisted is defined by the Datamover
  protocol.  The only architectural requirement is that all iSCSI
  interactions in the iSCSI Full Feature Phase MUST be Datamover-
  assisted subject to the prior agreement, meaning that the Datamover
  protocol is in the iSCSI-to-iSCSI communication path below the iSCSI
  layer on either side as shown in Figure 1.  DA defines the
  Enable_Datamover Operational Primitive (Section 8.6) to bring about
  this transition to a Datamover-assisted connection.

  The Architecture also assumes that the Datamover layer may require a
  certain number of opaque local resources for making a connection
  Datamover-assisted.  DA thus defines the
  Allocate_Connection_Resources Operational Primitive (Section 8.4) to
  model this interaction.  This Primitive is intended to be invoked on
  each side once the two sides decide (as previously noted) to have the
  connection be Datamover-assisted.  The expected sequence of Primitive
  invocations is depicted in Figures 2 and 3 in Section 13.2.  Figures



Chadalapaka, et al.          Informational                     [Page 14]

RFC 5047                           DA                       October 2007


  4, 5, and 6 illustrate how the Primitives may be employed to deal
  with various legal login outcomes.

7.2.  Full Feature Phase

  All iSCSI peer communication in the Full Feature Phase happens
  through the Datamover layers if the iSCSI connection is Datamover-
  assisted.  The Architecture assumes that a Datamover layer may
  require a certain number of opaque local resources for each new iSCSI
  task.  In the normal course of execution, these task-level resources
  in the Datamover layer are assumed to be transparently allocated on
  each task initiation and deallocated on the conclusion of each task
  as appropriate.  In exception scenarios however -- scenarios that do
  not yield a SCSI Response for each task such as ABORT TASK operation
  -- the Architecture assumes that the Datamover layer needs to be
  notified of the individual task terminations to aid its task-level
  resource management.  DA thus defines the Deallocate_Task_Resources
  Operational Primitive (Section 8.9) to model this task-resource
  management.  In specifying the ITT qualifier for the
  Deallocate_Task_Resources Primitive, the Architecture further assumes
  that the Datamover layer tracks its opaque task-level local resources
  by the iSCSI ITT.  DA also defines Send_Control (Section 8.1),
  Put_Data (Section 8.2), Get_Data (Section 8.3),
  Data_Completion_Notify (Section 9.3), Data_ACK_Notify (Section 9.4),
  and Control_Notify (Section 9.1) Operational Primitives to model the
  various Full Feature Phase interactions.

  Figures 9, 10, and 11 in Section 13.2 show some Full Feature Phase
  interactions -- SCSI Write task, SCSI Read task, and a SCSI Read Data
  acknowledgement, respectively.  Figure 12 in Section 13.2 illustrates
  how an ABORT TASK operation can be modeled leading to deterministic
  resource cleanup on the Datamover layer.

7.3.  Wrap-up

  Once an iSCSI connection becomes Datamover-assisted, the connection
  continues in that state until the end of the Full Feature Phase,
  i.e., the termination of the connection.  The Architecture assumes
  that when a connection is normally logged out, the Datamover layer
  needs to be notified so that its connection-level opaque resources
  (see Section 7.1) may be freed up.  DA thus defines a
  Connection_Terminate Operational Primitive (Section 8.7) to model
  this interaction.  The Architecture further assumes that when a
  connection termination happens without iSCSI layer's involvement
  (e.g., TCP RST), the Datamover layer is capable of locally cleaning
  up its task-level and connection-level resources before notifying the
  iSCSI layer of the fact.  DA thus defines the




Chadalapaka, et al.          Informational                     [Page 15]

RFC 5047                           DA                       October 2007


  Connection_Terminate_Notify Operational Primitive (Section 9.2) to
  model this interaction.

  Figures 7 and 8 in Section 13.2 illustrate the interactions between
  the iSCSI and Datamover layers in normal and unexpected connection
  termination scenarios.

8.  Operational Primitives Provided by the Datamover Layer

  While the iSCSI specification itself does not have a notion of
  Operational Primitives, any iSCSI layer implementing the iSCSI
  specification functionally requires the following Operational
  Primitives from its Datamover layer.  Thus, any Datamover protocol
  compliant with this architecture MUST implement the Operational
  Primitives described in this section.  These Operational Primitives
  are invoked by the iSCSI layer as appropriate.  Unless otherwise
  stated, all the following Operational Primitives may be used both on
  the initiator side and the target side.  In general programming
  terminology, this set of Operational Primitives may be construed as
  "down calls".

     1) Send_Control

     2) Put_Data

     3) Get_Data

     4) Allocate_Connection_Resources

     5) Deallocate_Connection_Resources

     6) Enable_Datamover

     7) Connection_Terminate

     8) Notice_Key_Values

     9) Deallocate_Task_Resources

8.1.  Send_Control

  Input qualifiers: Connection_Handle, iSCSI PDU-specific qualifiers

  Return Results: Not specified.

  An iSCSI layer requests that its local Datamover layer transmit an
  iSCSI control-type PDU to the peer iSCSI layer operating in the
  remote iSCSI node by this Operational Primitive.  The Datamover layer



Chadalapaka, et al.          Informational                     [Page 16]

RFC 5047                           DA                       October 2007


  performs the requested operation, and may add its own protocol
  headers in doing so.  The iSCSI layer MUST NOT invoke the
  Send_Control Operational Primitive on an iSCSI connection that is not
  yet Datamover-assisted.

  An initiator iSCSI layer requesting the transfer of a SCSI Command
  PDU or a target iSCSI layer requesting the transfer of a SCSI
  response PDU are examples of invoking the Send_Control Operational
  Primitive.  As Section 10.3.1 illustrates later on, the iSCSI PDU-
  specific qualifiers in this example are: BHS and AHS,
  DataDescriptorOut, DataDescriptorIn, ImmediateDataSize, and
  UnsolicitedDataSize.

8.2.  Put_Data

  Input qualifiers: Connection_Handle, contents of a SCSI Data-In PDU
  header, Data_Descriptor, Notify_Enable

  Return Results: Not specified.

  An iSCSI layer requests that its local Datamover layer transmit the
  data identified by the Data_Descriptor for the SCSI Data-In PDU to
  the peer iSCSI layer on the remote iSCSI node by this Operational
  Primitive.  The Datamover layer performs the operation by using its
  own protocol means, completely transparent to the remote iSCSI layer.
  The iSCSI layer MUST NOT invoke the Put_Data Operational Primitive on
  an iSCSI connection that is not yet Datamover-assisted.

  The Notify_Enable qualifier is used to request the local Datamover
  layer to generate or not generate the eventual local completion
  notification to the iSCSI layer for this Put_Data invocation.  For
  detailed semantics of this qualifier, see Section 9.3.

  A Put_Data Primitive may only be invoked by an iSCSI layer on the
  target to its local Datamover layer.

  A target iSCSI layer requesting the transfer of an iSCSI read data
  sequence (also known as a read burst) is an example of invoking the
  Put_Data Operational Primitive.

8.3.  Get_Data

  Input qualifiers: Connection_Handle, contents of an R2T PDU,
  Data_Descriptor, Notify_Enable

  Return Results: Not specified.





Chadalapaka, et al.          Informational                     [Page 17]

RFC 5047                           DA                       October 2007


  An iSCSI layer requests that its local Datamover layer retrieve
  certain data identified by the R2T PDU from the peer iSCSI layer on
  the remote iSCSI node and place it into the buffer identified by the
  Data_Descriptor by invoking this Operational Primitive.  The
  Datamover layer performs the operation by using its own protocol
  means, completely transparent to the remote iSCSI layer.  The iSCSI
  layer MUST NOT invoke the Get_Data Operational Primitive on an iSCSI
  connection that is not yet Datamover-assisted.

  The Notify_Enable qualifier is used to request that the local
  Datamover layer generate or not generate the eventual local
  completion notification to the iSCSI layer for this Get_Data
  invocation.  For detailed semantics of this qualifier, see Section
  9.3.

  A Get_Data Primitive may only be invoked by an iSCSI layer on the
  target to its local Datamover layer.

  A target iSCSI layer requesting the transfer of an iSCSI write data
  sequence (also known as a write burst) is an example of invoking the
  Get_Data Operational Primitive.

8.4.  Allocate_Connection_Resources

  Input qualifiers: Connection_Handle[, Resource_Descriptor ]

  Return Results: Status.

  By invoking this Operational Primitive, an iSCSI layer requests that
  its local Datamover layer perform all the Datamover-specific resource
  allocations required for the Full Feature Phase of an iSCSI
  connection.  The Connection_Handle identifies the connection for
  which the iSCSI layer is requesting resources to be allocated.
  Allocation of these resources is a step towards eventually
  transitioning the connection to become a Datamover-assisted iSCSI
  connection.  Note that the Datamover layer however does not allocate
  any Datamover-specific task-level resources upon invocation of this
  Primitive.

  An iSCSI layer, in addition, optionally specifies the
  implementation-specific resource requirements for the iSCSI
  connection to the Datamover layer by passing an input qualifier
  called Resource_Descriptor.  The exact structure of a
  Resource_Descriptor is implementation-dependent, and hence
  structurally opaque to DA.

  A return result of Status=success means that the
  Allocate_Connection_Resources invocation corresponding to that



Chadalapaka, et al.          Informational                     [Page 18]

RFC 5047                           DA                       October 2007


  Connection_Handle succeeded.  If an Allocate_Connection_Resources
  invocation is made for a Connection_Handle for which an earlier
  invocation succeeded, the return Status must be success and the
  request will be ignored by the Datamover layer.  A return result of
  Status=failure means that the Allocate_Connection_Resources
  invocation corresponding to that Connection_Handle failed.  There
  MUST NOT be more than one Allocate_Connection_Resources Primitive
  invocation outstanding for a given Connection_Handle at any time.

  The iSCSI layer must invoke the Allocate_Connection_Resources
  Primitive before the invocation of the Enable_Datamover Primitive.

8.5.  Deallocate_Connection_Resources

  Input qualifiers: Connection_Handle

  Return Results: Not specified.

  By invoking this Operational Primitive, an iSCSI layer requests that
  its local Datamover layer deallocate all the Datamover-specific
  resources that may have been allocated earlier for the Transport
  Connection identified by the Connection_Handle.  The iSCSI layer may
  invoke this Operational Primitive when the Datamover-specific
  resources associated with the Connection_Handle are no longer
  necessary (such as the Login failure of the corresponding iSCSI
  connection).

8.6.  Enable_Datamover

  Input qualifiers: Connection_Handle, Transport_Connection_Descriptor
  [, Final_Login_Response_PDU]

  Return Results: Not specified.

  By invoking this Operational Primitive, an iSCSI layer requests that
  its local Datamover layer assist all further iSCSI exchanges on the
  iSCSI connection (i.e., to make the connection Datamover-assisted)
  identified by the Connection_Handle, for which the Datamover-specific
  resource allocation was earlier made.  The iSCSI layer MUST NOT
  invoke the Enable_Datamover Operational Primitive for an iSCSI
  connection unless there is a corresponding prior resource allocation.

  The Final_Login_Response_PDU input qualifier is applicable only for a
  target, and contains the final Login Response that concludes the
  iSCSI Login Phase and which must be sent as a byte stream as expected
  by the initiator iSCSI layer.  When this qualifier is used, the
  target-Datamover layer MUST transmit this final Login Response before
  Datamover assistance is enabled for the Transport Connection.



Chadalapaka, et al.          Informational                     [Page 19]

RFC 5047                           DA                       October 2007


  The iSCSI layer identifies the specific Transport Connection
  associated with the Connection_Handle to the Datamover layer by
  specifying the Transport_Connection_Descriptor.  The exact structure
  of this Descriptor is implementation-dependent.

8.7.  Connection_Terminate

  Input qualifiers: Connection_Handle

  Return Results: Not specified.

  By invoking this Operational Primitive, an iSCSI layer requests that
  its local Datamover layer terminate the Transport Connection and
  deallocate all the connection and task resources associated with the
  Connection_Handle.  When this Operational Primitive invocation
  returns to the iSCSI layer, the iSCSI layer may assume the full
  ownership of all the iSCSI-level resources, e.g., I/O Buffers,
  associated with the connection.  This Operational Primitive may be
  invoked only with a valid Connection_Handle, and the Transport
  Connection associated with the Connection_Handle must already be
  Datamover-assisted.

8.8.  Notice_Key_Values

  Input qualifiers: Connection_Handle, Number of keys, a list of Key-
  Value pairs.

  Return Results: Not specified.

  By invoking this Operational Primitive, an iSCSI layer requests that
  its local Datamover layer take note of the negotiated values of the
  listed keys for the Transport Connection.  This Operational Primitive
  may be invoked only with a valid Connection_Handle, and the Key-Value
  pairs MUST be the current values that were successfully agreed upon
  by the iSCSI peers for the connection.  The Datamover layer may use
  the values of the keys to aid the Datamover operation as it deems
  appropriate.  The specific keys to be passed as input qualifiers and
  the point(s) in time this Operational Primitive is invoked are
  implementation-dependent.

8.9.  Deallocate_Task_Resources

  Input qualifiers: Connection_Handle, ITT

  Return Results: Not specified.

  By invoking this Operational Primitive, an iSCSI layer requests that
  its local Datamover layer deallocate all Datamover-specific resources



Chadalapaka, et al.          Informational                     [Page 20]

RFC 5047                           DA                       October 2007


  that earlier may have been allocated for the task identified by the
  ITT qualifier.  The iSCSI layer uses this Operational Primitive
  during exception processing when one or more active tasks are to be
  terminated without corresponding SCSI Response PDUs.  This Primitive
  MUST be invoked for each active task terminated without a SCSI
  Response PDU.  This Primitive MUST NOT be invoked by the iSCSI layer
  when a SCSI Response PDU normally concludes a task.  When a SCSI
  Response PDU normally concludes a task (even if the SCSI Status was
  not a success), the Datamover layer is assumed to have automatically
  deallocated all Datamover-specific task resources for that task.
  Refer to Section 7.2 for a related discussion on the Architectural
  assumptions on the task-level Datamover resource management,
  especially with respect to when the resources are assumed to be
  allocated.

9.  Operational Primitives Provided by the iSCSI Layer

  While the iSCSI specification itself does not have a notion of
  Operational Primitives, any iSCSI layer implementing the iSCSI
  specification would have to provide the following Operational
  Primitives to its local Datamover layer.  Thus, any iSCSI protocol
  implementation compliant with this architecture MUST implement the
  Operational Primitives described in this section.  These Operational
  Primitives are invoked by the Datamover layer as appropriate and when
  the iSCSI connection is Datamover-assisted.  Unless otherwise stated,
  all the following Operational Primitives may be used both on the
  initiator side and the target side.  In general programming
  terminology, this set of Operational Primitives may be construed as
  "up calls".

     1) Control_Notify

     2) Connection_Terminate_Notify

     3) Data_Completion_Notify

     4) Data_ACK_Notify

9.1.  Control_Notify

  Input qualifiers: Connection_Handle, an iSCSI control-type PDU.

  Return Results: Not specified.

  A Datamover layer notifies its local iSCSI layer, via this
  Operational Primitive, of the arrival of an iSCSI control-type PDU
  from the peer Datamover layer on the remote iSCSI node.  The iSCSI
  layer processes the control-type PDU as defined in [RFC3720].



Chadalapaka, et al.          Informational                     [Page 21]

RFC 5047                           DA                       October 2007


  A target iSCSI layer being notified of the arrival of a SCSI command
  is an example of invoking the Control_Notify Operational Primitive.

  Note that implementations may choose to describe the "iSCSI control-
  type PDU" qualifier in this notification using a Data_Descriptor
  (Section 5.2) and not necessarily one contiguous buffer.

9.2.  Connection_Terminate_Notify

  Input qualifiers: Connection_Handle

  Return Results: Not specified.

  A Datamover layer notifies its local iSCSI layer on an unsolicited
  termination or failure of an iSCSI connection providing the
  Connection_Handle associated with the iSCSI Connection.  The iSCSI
  layer MUST consider the Connection_Handle to be invalid upon being so
  notified.  The iSCSI layer processes the connection termination as
  defined in [RFC3720].  The Datamover layer MUST deallocate the
  connection and task resources associated with the terminated
  connection before notifying the iSCSI layer of the termination via
  this Operational Primitive.

  A target iSCSI layer is notified of an ungraceful connection
  termination by the Datamover layer when the underlying Transport
  Connection is torn down.  Such a Connection_Terminate_Notify
  Operational Primitive may be triggered, for example, by a TCP RESET
  in cases where the underlying Transport Connection uses TCP.

9.3.  Data_Completion_Notify

  Input qualifiers: Connection_Handle, ITT, SN

  Return Results: Not specified.

  A Datamover layer notifies its local iSCSI layer on completing the
  retrieval of the data or upon sending the data, as requested in a
  prior iSCSI data-type PDU, from/to the peer Datamover layer on the
  remote iSCSI node via this Operational Primitive.  The iSCSI layer
  processes the operation as defined in [RFC3720].

  SN may be either the DataSN associated with the SCSI Data-In PDU or
  R2TSN associated with the R2T PDU depending on the SCSI operation.
  Note that, for targets, a TTT (see [RFC3720]) could have been
  specified instead of an SN.  However, the considered choice was to
  leave the SN to be the qualifier for two reasons -- a) it is generic
  and applicable to initiators and targets as well as Data-In and
  Data-Out, and b) having both SN and TTT qualifiers for the



Chadalapaka, et al.          Informational                     [Page 22]

RFC 5047                           DA                       October 2007


  notification is considered onerous on the Datamover layer, in terms
  of state maintenance for each completion notification.  The
  implication of this choice is that iSCSI target implementations will
  have to adapt to using the ITT-SN tuple in associating the solicited
  data to the appropriate task, rather than the ITT-TTT tuple for doing
  the same.

  If Notify_Enable is set in either a Put_Data or a Get_Data
  invocation, the Datamover layer MUST invoke the
  Data_Completion_Notify Operational Primitive upon completing that
  requested data transfer.  If the Notify_Enable was cleared in either
  a Put_Data or a Get_Data invocation, the Datamover layer MUST NOT
  invoke the Data_Completion_Notify Operational Primitive upon
  completing that requested data transfer.

  A Data_Completion_Notify invocation serves to notify the iSCSI layer
  of the Put_Data or Get_Data completion, respectively.  As earlier
  noted in Sections 8.2 and 8.3, specific Datamover protocol
  definitions may restrict the usage scope of Put_Data and Get_Data,
  and thus implicitly the usage scope of Data_Completion_Notify.

  A target iSCSI layer being notified of the retrieval of a write data
  sequence is an example of invoking the Data_Completion_Notify
  Operational Primitive.

9.4.  Data_ACK_Notify

  Input qualifiers: Connection_Handle, ITT, DataSN

  Return Results: Not specified.

  A target Datamover layer notifies its local iSCSI layer of the
  arrival of a previously requested data acknowledgement from the peer
  Datamover layer on the remote (initiator) iSCSI node via this
  Operational Primitive.  The iSCSI layer processes the data
  acknowledgement notification as defined in [RFC3720].

  A target iSCSI layer being notified of the arrival of a data
  acknowledgement for a certain SCSI Read data PDU is the only example
  of invoking the Data_ACK_Notify Operational Primitive.

10.  Datamover Interface (DI)

10.1.  Overview

  This section describes the model of interactions between iSCSI and
  Datamover layers when the iSCSI connection is Datamover-assisted so
  the iSCSI layer may carry out the following:



Chadalapaka, et al.          Informational                     [Page 23]

RFC 5047                           DA                       October 2007


     -  send iSCSI data-type PDUs and exchange iSCSI control-type PDUs,
        and

     -  handle asynchronous notifications such as completion of data
        sequence transfer and connection failure.

  This chapter relies on the notion of Operational Primitives (Section
  5.4) to define DI.

10.2.  Interactions for Handling Asynchronous Notifications

10.2.1.  Connection Termination

  As stated in Section 9.2, the Datamover layer notifies the iSCSI
  layer of a failed or terminated connection via the
  Connection_Terminate_Notify Operational Primitive.  The iSCSI layer
  MUST consider the connection unusable upon the invocation of this
  Primitive and handle the connection termination as specified in
  [RFC3720].

10.2.2.  Data Transfer Completion

  As stated in Section 9.3, the Datamover layer notifies the iSCSI
  layer of a completed data transfer operation via the
  Data_Completion_Notify Operational Primitive.  The iSCSI layer
  processes the transfer completion as specified in [RFC3720].

10.2.2.1.  Completion of a Requested SCSI Data Transfer

  To notify the iSCSI layer of the completion of a requested iSCSI
  data-type PDU transfer, the Datamover layer uses the
  Data_Completion_Notify Operational Primitive with the following input
  qualifiers.

     a) Connection_Handle.

     b) ITT: Initiator Task Tag semantics as defined in [RFC3720].

     c) SN: DataSN for a SCSI Data-in/Data-out PDU, and R2TSN for an
        iSCSI R2T PDU.  The semantics for both types of sequence
        numbers are as defined in [RFC3720].

  The rationale for choosing SN is explained in Section 9.3.

  Every invocation of the Data_Completion_Notify Operational Primitive
  MUST be preceded by an invocation of the Put_Data or Get_Data
  Operational Primitive with the Notify_Enable qualifier set by the
  iSCSI layer at an earlier point in time.



Chadalapaka, et al.          Informational                     [Page 24]

RFC 5047                           DA                       October 2007


10.2.3.  Data Acknowledgement

  [RFC3720] allows the iSCSI targets to optionally solicit data
  acknowledgement from the initiator for one or more Data-In PDUs, via
  setting of the A-bit on a Data-In PDU.  The Data_ACK_Notify
  Operational Primitive with the following input qualifiers is used by
  the target Datamover layer to notify the local iSCSI layer of the
  arrival of data acknowledgement of a previously solicited iSCSI read
  data acknowledgement.  This Operational Primitive thus is applicable
  only to iSCSI targets.

     a) Connection_Handle.

     b) ITT: Initiator Task Tag semantics as defined in [RFC3720].

     c) DataSN: of the next SCSI Data-In PDU, which immediately follows
        the SCSI Data-In PDU with the A-bit set to which this
        notification corresponds, with semantics as defined in
        [RFC3720].

  Every invocation of the Data_ACK_Notify Operational Primitive MUST be
  preceded by an invocation of the Put_Data Operational Primitive by
  the iSCSI target layer with the A-bit set to 1 at an earlier point in
  time.

10.3.  Interactions for Sending an iSCSI PDU

  This section discusses the model of interactions for sending each of
  the iSCSI PDUs defined in [RFC3720].  A Connection_Handle (see
  Section 5.3) is assumed to qualify each of these interactions so that
  the Datamover layer can route it to the appropriate Transport
  Connection.  The qualifying Connection_Handle is not explicitly
  listed in the subsequent sections.

  Note that the defined list of input qualifiers represents the
  semantically required set for the Datamover layer to consider in
  implementing the Primitive in each interaction described in this
  section (see Section 5.4 for an elaboration).  Implementations may
  choose to deduce the qualifiers in ways that are optimized for the
  implementation specifics.  Two examples of this are:

     1. For SCSI command (Section 10.3.1), deducing the
        ImmediateDataSize input qualifier from the DataSegmentLength
        field of the SCSI Command PDU.

     2. For SCSI Data-Out (Section 10.3.5.1), deducing the
        DataDescriptorOut input qualifier from the associated SCSI
        command invocation qualifiers (assuming such state is



Chadalapaka, et al.          Informational                     [Page 25]

RFC 5047                           DA                       October 2007


        maintained) in conjunction with BHS fields of the SCSI Data-Out
        PDU.

10.3.1.  SCSI Command

  The Send_Control Operational Primitive with the following input
  qualifiers is used for requesting the transmission of a SCSI Command
  PDU.

     a) BHS and AHS, if any, of the SCSI Command PDU as defined in
        [RFC3720].

     b) DataDescriptorOut: that defines the I/O Buffer meant for Data-
        Out for the entire command, in the case of a write or
        bidirectional command.

     c) DataDescriptorIn: that defines the I/O Buffer meant for Data-In
        for the entire command, in the case of a read or bidirectional
        command.

     d) ImmediateDataSize: that defines the number of octets of
        immediate unsolicited data for a write/bidirectional command.

     e) UnsolicitedDataSize: that defines the number of octets of
        immediate and non-immediate unsolicited data for a
        write/bidirectional command.

10.3.2.  SCSI Response

  The Send_Control Operational Primitive with the following input
  qualifiers is used for requesting the transmission of a SCSI Response
  PDU.

     a) BHS of the SCSI Response PDU as defined in [RFC3720].

     b) DataDescriptorStatus: that defines the iSCSI buffer that
        contains the sense and response information for the command.

10.3.3.  Task Management Function Request

  The Send_Control Operational Primitive with the following input
  qualifiers is used for requesting the transmission of a Task
  Management Function Request PDU.

     a) BHS of the Task Management Function Request PDU as defined in
        [RFC3720].





Chadalapaka, et al.          Informational                     [Page 26]

RFC 5047                           DA                       October 2007


     b) DataDescriptorOut: that defines the I/O Buffer meant for Data-
        Out for the entire command, in the case of a write or
        bidirectional command.  (Only valid if Function="TASK REASSIGN"
        - [RFC3720].)

     c) DataDescriptorIn: that defines the I/O Buffer meant for Data-In
        for the entire command, in the case of a read or bidirectional
        command.  (Only valid if Function="TASK REASSIGN" - [RFC3720].)

10.3.4.  Task Management Function Response

  The Send_Control Operational Primitive with the following input
  qualifier is used for requesting the transmission of a Task
  Management Function Response PDU.

     a) BHS of the Task Management Function Response PDU as defined in
        [RFC3720].

10.3.5.  SCSI Data-Out and SCSI Data-In

10.3.5.1.  SCSI Data-Out

  The Send_Control Operational Primitive with the following input
  qualifiers is used by the initiator iSCSI layer for requesting the
  transmission of a SCSI Data-Out PDU carrying the non-immediate
  unsolicited data.

     a) BHS of the SCSI Data-Out PDU as defined in [RFC3720].

     b) DataDescriptorOut: that defines the I/O Buffer with the Data-
        Out to be carried in the iSCSI data segment of the PDU.

10.3.5.2.  SCSI Data-In

  The Put_Data Operational Primitive with the following input
  qualifiers is used by the target iSCSI layer for requesting the
  transmission of the data carried by a SCSI Data-In PDU.

     a) BHS of the SCSI Data-In PDU as defined in [RFC3720].

     b) DataDescriptorIn: that defines the I/O Buffer with the Data-In
        being requested for transmission.









Chadalapaka, et al.          Informational                     [Page 27]

RFC 5047                           DA                       October 2007


10.3.6.  Ready To Transfer (R2T)

  The Get_Data Operational Primitive with the following input
  qualifiers is used by the target iSCSI layer for requesting the
  retrieval of the data as specified by the semantic content of an R2T
  PDU.

     a) BHS of the Ready To Transfer PDU as defined in [RFC3720].

     b) DataDescriptorOut: that defines the I/O Buffer for the Data-Out
        being requested for retrieval.

10.3.7.  Asynchronous Message

  The Send_Control Operational Primitive with the following input
  qualifiers is used for requesting the transmission of an Asynchronous
  Message PDU.

     a) BHS of the Asynchronous Message PDU as defined in [RFC3720].

     b) DataDescriptorSense: that defines an iSCSI buffer that contains
        the sense and iSCSI Event information.

10.3.8.  Text Request

  The Send_Control Operational Primitive with the following input
  qualifiers is used for requesting the transmission of a Text Request
  PDU.

     a) BHS of the Text Request PDU as defined in [RFC3720].

     b) DataDescriptorTextOut: that defines the iSCSI Text Request
        buffer.

10.3.9.  Text Response

  The Send_Control Operational Primitive with the following input
  qualifiers is used for requesting the transmission of a Text Response
  PDU.

     a) BHS of the Text Response PDU as defined in [RFC3720].

     b) DataDescriptorTextIn: that defines the iSCSI Text Response
        buffer.







Chadalapaka, et al.          Informational                     [Page 28]

RFC 5047                           DA                       October 2007


10.3.10.  Login Request

  The Send_Control Operational Primitive with the following input
  qualifiers is used for requesting the transmission of a Login Request
  PDU.

     a) BHS of the Login Request PDU as defined in [RFC3720].

     b) DataDescriptorLoginRequest: that defines the iSCSI Login
        Request buffer.

  Note that specific Datamover protocols may choose to disallow the
  standard DA Primitives from being used for the iSCSI Login Phase.
  When used in conjunction with such Datamover protocols, an attempt to
  send a Login Request via the Send_Control Operational Primitive
  invocation is clearly an error scenario, as the Login Request PDU is
  being sent while the connection is in the iSCSI Full Feature Phase.
  It is outside the scope of this document to specify the resulting
  implementation behavior in this case -- [RFC3720] already defines the
  error handling for this error scenario.

10.3.11.  Login Response

  The Send_Control Operational Primitive with the following input
  qualifiers is used for requesting the transmission of a Login
  Response PDU.

     a) BHS of the Login Response PDU as defined in [RFC3720].

     b) DataDescriptorLoginResponse: that defines the iSCSI Login
        Response buffer.

  Note that specific Datamover protocols may choose to disallow the
  standard DA Primitives from being used for the iSCSI Login Phase.
  When used in conjunction with such Datamover protocols, an attempt to
  send a Login Response via the Send_Control Operational Primitive
  invocation is clearly an error scenario, as the Login Response PDU is
  being sent while in the iSCSI Full Feature Phase.  It is outside the
  scope of this document to specify the resulting implementation
  behavior in this case -- [RFC3720] already defines the error handling
  for this error scenario.

10.3.12.  Logout Command

  The Send_Control Operational Primitive with the following input
  qualifier is used for requesting the transmission of a Logout Command
  PDU.




Chadalapaka, et al.          Informational                     [Page 29]

RFC 5047                           DA                       October 2007


     a) BHS of the Logout Command PDU as defined in [RFC3720].

10.3.13.  Logout Response

  The Send_Control Operational Primitive with the following input
  qualifier is used for requesting the transmission of a Logout
  Response PDU.

     a) BHS of the Logout Response PDU as defined in [RFC3720].

10.3.14.  SNACK Request

  The Send_Control Operational Primitive with the following input
  qualifier is used for requesting the transmission of a SNACK Request
  PDU.

     a) BHS of the SNACK Request PDU as defined in [RFC3720].

10.3.15.  Reject

  The Send_Control Operational Primitive with the following input
  qualifiers is used for requesting the transmission of a Reject PDU.

     a) BHS of the Reject PDU as defined in [RFC3720].

     b) DataDescriptorReject: that defines the iSCSI Reject buffer.

10.3.16.  NOP-Out

  The Send_Control Operational Primitive with the following input
  qualifiers is used for requesting the transmission of a NOP-Out PDU.

     a) BHS of the NOP-Out PDU as defined in [RFC3720].

     b) DataDescriptorNOPOut: that defines the iSCSI Ping data buffer.

10.3.17.  NOP-In

  The Send_Control Operational Primitive with the following input
  qualifiers is used for requesting the transmission of a NOP-In PDU.

     a) BHS of the NOP-In PDU as defined in [RFC3720].

     b) DataDescriptorNOPIn: that defines the iSCSI Return Ping data
        buffer.






Chadalapaka, et al.          Informational                     [Page 30]

RFC 5047                           DA                       October 2007


10.4.  Interactions for Receiving an iSCSI PDU

  The only PDUs that are received by an iSCSI layer operating on a
  Datamover layer are the iSCSI control-type PDUs.  The Datamover layer
  delivers the iSCSI control-type PDUs as they arrive, qualifying each
  with the Connection_Handle (see Section 5.3) that identifies the
  iSCSI connection for which the PDU is meant.  The subsequent
  processing of the iSCSI control-type PDUs proceeds as defined in
  [RFC3720].

10.4.1.  General Control-Type PDU Notification

  This sub-section describes the general mechanics applicable to
  several control-type PDUs.  The following sub-sections note
  additional considerations for control-type PDUs that are not covered
  in this sub-section.

  The Control_Notify Operational Primitive is used to notify the iSCSI
  layer of the arrival of the following iSCSI control-type PDUs: SCSI
  Command, SCSI Response, Task Management Function Request, Task
  Management Function Response, Asynchronous Message, Text Request,
  Text Response, Logout Command, Logout Response, SNACK, Reject, NOP-
  Out, NOP-In.

10.4.2.  SCSI Data Transfer PDUs

10.4.2.1.  SCSI Data-Out

  The Control_Notify Operational Primitive is used to notify the iSCSI
  layer of the arrival of a SCSI Data-Out PDU carrying the non-
  immediate unsolicited data.  Note however that the solicited SCSI
  Data-Out arriving on the target does not cause a notification to the
  iSCSI layer using the Control_Notify Primitive because the solicited
  SCSI Data-Out was not sent by the initiator iSCSI layer as control-
  type PDUs.

10.4.2.2.  SCSI Data-In

  The Datamover layer does not notify the iSCSI layer of the arrival of
  the SCSI Data-in at the initiator, because SCSI Data-in is an iSCSI
  data-type PDU (see section 5.1).  The iSCSI layer at the initiator
  however may infer the arrival of the SCSI Data-In when it receives a
  subsequent notification of the SCSI Response PDU via a Control_Notify
  invocation.

  While this document does not contemplate the possibility of a Data-In
  PDU being received at the initiator iSCSI layer, specific Datamover
  protocols may define how to deal with an unexpected inbound SCSI



Chadalapaka, et al.          Informational                     [Page 31]

RFC 5047                           DA                       October 2007


  Data-In PDU that may result in the initiator iSCSI layer receiving
  the Data-In PDU.  This document leaves the details of handling this
  error scenario to the specific Datamover protocols, so each may
  define the appropriate error handling specific to the Datamover
  environment.

10.4.2.3.  Ready To Transfer (R2T)

  Because an R2T PDU is an iSCSI data-type PDU (see Section 5.1) that
  is not delivered as-is to the initiator iSCSI layer, the Datamover
  layer does not notify the iSCSI layer of the arrival of an R2T PDU.
  When an iSCSI node sends an R2T PDU to its local Datamover layer, the
  local and remote Datamover layers transparently bring about the data
  transfer requested by the R2T PDU.

  While this document does not contemplate the possibility of an R2T
  PDU being received at the initiator iSCSI layer, specific Datamover
  protocols may define how to deal with an unexpected inbound R2T PDU
  that may result in the initiator iSCSI layer receiving the R2T PDU.
  This document leaves the details of handling this error scenario to
  the specific Datamover protocols, so each may define the appropriate
  error handling specific to the Datamover environment.

10.4.3.  Login Request

  The Control_Notify Operational Primitive is used for notifying the
  target iSCSI layer of the arrival of a Login Request PDU.  Note that
  specific Datamover protocols may choose to disallow the standard DA
  Primitives from being used for the iSCSI Login Phase.  When used in
  conjunction with such Datamover protocols, the arrival of a Login
  Request necessitating the Control_Notify Operational Primitive
  invocation is clearly an error scenario, as the Login Request PDU is
  arriving in the iSCSI Full Feature Phase.  It is outside the scope of
  this document to specify the resulting implementation behavior in
  this case -- [RFC3720] already defines the error handling in this
  error scenario.

10.4.4.  Login Response

  The Control_Notify Operational Primitive is used to notify the
  initiator iSCSI layer of the arrival of a Login Response PDU.  Note
  that specific Datamover protocols may choose to disallow the standard
  DA Primitives from being used for the iSCSI Login Phase.  When used
  in conjunction with such Datamover protocols, the arrival of a Login
  Response necessitating the Control_Notify Operational Primitive
  invocation is clearly an error scenario, as the Login Response PDU is
  arriving in the iSCSI Full Feature Phase.  It is outside the scope of
  this document to specify the resulting implementation behavior in



Chadalapaka, et al.          Informational                     [Page 32]

RFC 5047                           DA                       October 2007


  this case -- [RFC3720] already defines the error handling in this
  error scenario.

11.  Security Considerations

11.1.  Architectural Considerations

  DA enables compliant iSCSI implementations to realize a control and
  data separation in the way they interact with their Datamover
  protocols.  Note however that this separation does not imply a
  separation in transport mediums between control traffic and data
  traffic -- the basic iSCSI architecture with respect to tasks and PDU
  relationships to tasks remains unchanged.  [RFC3720] defines several
  MUST requirements on ordering relationships across control and data
  for a given task besides a mandatory deterministic task allegiance
  model -- DA does not change this basic architecture (DA has a
  normative reference to [RFC3720]) for allow any additional
  flexibility in compliance in this area.  To summarize, sending bulk
  data transfers (prompted by Put_Data and Get_Data Primitive
  invocations) on a different transport medium would be as ill-advised
  as sending just the Data-Out/Data-In PDUs on a different TCP
  connection in RFC 3720-based iSCSI implementations.  Consequently,
  all the iSCSI-related security text in [RFC3723] is directly
  applicable to a DA-enabled iSCSI implementation.

  Another area with security implications is the Datamover connection
  resource management model, which DA defines -- particularly the
  Allocate_Connection_Resources Primitive.  An inadvertent realization
  of this model could leave an iSCSI implementation exposed to denial-
  of-service attacks.  As Figures 2 and 3 in Section 13.2 illustrate,
  the most effective countermeasure to this potential attack consists
  of performing the Datamover resource allocation when the iSCSI layer
  is sufficiently far along in the iSCSI Login Phase that it is
  reasonably certain that the peer side is not an attacker.  In
  particular, if the Login Phase includes a SecurityNegotiation stage,
  an iSCSI end node MUST defer the Datamover connection resource
  allocation (i.e., invoking the Allocate_Connection_Resources
  Primitive) to the LoginOperationalNegotiation stage [RFC3720] so that
  the resource allocation happens post-authentication.  This
  considerably minimizes the potential for a denial-of service attack.

11.2.  Wire Protocol Considerations

  In view of the fact that the DA architecture itself does not define
  any new wire protocol or propose modifications to the existing
  protocols, there are no additional wire protocol security
  considerations in employing DA itself.  However, a DA-compliant iSCSI
  implementation MUST comply with all the iSCSI-related requirements



Chadalapaka, et al.          Informational                     [Page 33]

RFC 5047                           DA                       October 2007


  stipulated in [RFC3723] and [RFC3720].  Note further that in
  realizing DA, each Datamover protocol must define and elaborate as
  appropriate on any additional security considerations resulting from
  the use of that Datamover protocol.

  All Datamover protocol designers are strongly recommended to refer to
  [RDDPSEC] for the types of security issues to consider.  While
  [RDDPSEC] elaborates on the security considerations applicable to an
  RDDP-based Datamover [iSER], the document is representative of the
  type of analysis of resource exhaustion and the application of
  countermeasures that need to be done for any Datamover protocol.

12.  References

12.1.  Normative References

  [RFC3720] Satran, J., Meth, K., Sapuntzakis, C., Chadalapaka, M., and
            E. Zeidner, "Internet Small Computer Systems Interface
            (iSCSI)", RFC 3720, April 2004.

  [RFC3723] Aboba, B., Tseng, J., Walker, J., Rangan, V., and F.
            Travostino, "Securing Block Storage Protocols over IP", RFC
            3723, April 2004.

  [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
            Requirement Levels", BCP 14, RFC 2119, March 1997.

12.2.  Informative References

  [DDP]     Shah, H., Pinkerton, J., Recio, R., and P. Culley, "Direct
            Data Placement over Reliable Transports", RFC 5041, October
            2007.

  [iSER]    Ko, M., Chadalapaka, M., Hufferd, J., Elzur, U., Shah, H.,
            and P. Thaler, "Internet Small Computer System Interface
            (iSCSI) Extensions for Remote Direct Memory Access (RDMA)",
            RFC 5046, October 2007.

  [RDDPSEC] Pinkerton, J. and E. Deleganes, "Direct Data Placement
            Protocol (DDP) / Remote Direct Memory Access Protocol
            (RDMAP) Security", RFC 5042, October 2007.










Chadalapaka, et al.          Informational                     [Page 34]

RFC 5047                           DA                       October 2007


Appendix A.  Design Considerations and Examples

A.1.  Design Considerations for a Datamover Protocol

  This section discusses the specific considerations for RDMA-based and
  RDDP-based Datamover protocols.

     a) Note that the modeling of interactions for SCSI Data-Out
        (Section 10.3.5.1) is only used for unsolicited data transfer.

     b) The modeling of interactions for SNACK (Sections 10.3.14 and
        10.4.1) is not expected to be used given that one of the design
        requirements on the Datamover is that it "guarantees an error-
        free, reliable, in-order transport mechanism" (Section 6).  The
        interactions for sending and receiving a SNACK are nevertheless
        modeled in this document because the receiving iSCSI layer can
        deterministically deal with an inadvertent SNACK.  This also
        shows the DA designers' intent that DI is not meant to filter
        certain types of PDUs.

     c) The onus is on a reliable Datamover (per requirements stated in
        Section 6) to realize end-to-end data acknowledgements via
        Datamover-specific means.  In view of this, even use of data-
        ACK-type SNACKs are unnecessary.  Consequently, an initiator
        may never request sending a SNACK Request in this model
        assuming that the proactive (timeout-driven) SNACK
        functionality is turned off in the legacy iSCSI code.

     d) Note that the current DA model for bootstrapping a
        Connection_Handle into service -- i.e., associating a new iSCSI
        connection with a Connection_Handle -- clearly implies that the
        iSCSI connection must already be in Full Feature Phase when the
        Datamover layer comes into the stack.  This further implies
        that the iSCSI Login Phase must be carried out in the
        traditional "Byte streaming mode" with no assistance or
        involvement from the Datamover layer.

A.2.  Examples of Datamover Interactions

  The figures described in this section provide some examples of the
  usage of Operational Primitives in interactions between the iSCSI
  layer and the Datamover layer.  The following abbreviations are used
  in this section.

  Avail - Available

  Abted - Aborted




Chadalapaka, et al.          Informational                     [Page 35]

RFC 5047                           DA                       October 2007


  Buf - I/O Buffer

  Cmd - Command

  Compl - Complete

  Conn - Connection

  Ctrl_Ntfy - Control_Notify

  Dal_Tk_Res - Deallocate_Task_Resources

  Data_Cmp_Nfy - Data_Completion_Notify

  Data_ACK_Nfy - Data_ACK_Notify

  DM - Datamover

  Imm - Immediate

  Snd_Ctrl - Send_Control

  Msg - Message

  Resp - Response

  Sol - Solicited

  TMF Req - Task Management Function Request

  TMF Res - Task Management Function Response

  Trans - Transfer

  Unsol - Unsolicited
















Chadalapaka, et al.          Informational                     [Page 36]

RFC 5047                           DA                       October 2007


        |   | Allocate_Connection_Resources  | D |    ^
        |   |------------------------------->| a |    |
        |   |    Connection resources are    | t |    |
        | i |     successfully allocated     | a |    |   iSCSI
        | S |                                | m |    |   Login
        | C |                                | o |    |   Phase
        | S |                                | v |    |
        | I |                                | e |    |
        |   |                                | r |    | Login Phase
        | L | Final Login Response (success)          v succeeds
        | a |<----------------------------------------^
        | y |                                | L |    |   iSCSI
        | e |       Enable_Datamover         | a |    |   Full
        | r |------------------------------->| y |    |   Feature
        |   |     Datamover is enabled       | e |    |   Phase
        |   |                                | r |    |
        |   |   Full Feature Phase           |   |    |
        |   |   control and data Transfer    |   |    v

          Figure 2.  A Successful iSCSI Login on Initiator


        |   | Notice_Key_Values              |   |      |
        |   |------------------------------->|   |      |
        |   |  Datamover layer is notified   |   |      |
        |   |  of the negotiated key values  |   |      |
        |   |                                |   |      |
        |   | Allocate_Connection_Resources  |   |      |
        |   |------------------------------->| D |      |
        |   |    Connection resources are    | a |      |
        | i |     successfully allocated     | t |      |   iSCSI
        | S |                                | a |      |   Login
        | C |                                | m |Final |   Phase
        | S |                                | o |Login |
        | I |Enable_Datamover(Login Response)| v |Resp  |
        |   |------------------------------->| e |---->vLogin Phase
        | L |     Datamover is enabled       | r |      ^ succeeds
        | a |                                |   |      |
        | y |                                | L |      |   iSCSI
        | e |                                | a |      |   Full
        | r |                                | y |      |   Feature
        |   |                                | e |      |   Phase
        |   |      Full Feature Phase        | r |      |
        |   |   control and data Transfer    |   |      |
        |   |                                |   |      v

          Figure 3.  A Successful iSCSI Login on Target




Chadalapaka, et al.          Informational                     [Page 37]

RFC 5047                           DA                       October 2007


        |   | Allocate_Connection_Resources  | D |      ^
        |   |------------------------------->| a |      |
        |   |    Connection resources are    | t |      |
        | i |     successfully allocated     | a |      |   iSCSI
        | S |                                | m |      |   Login
        | C |                                | o |      |   Phase
        | S |                                | v |      |
        | I |                                | e |      |
        |   |                                | r |      | Login
        |   |                                |   |      | Phase
        | L | Final Login Response (failure)            v fails
        | a |<------------------------------------------
        | y |                                | L |
        | e | Deallocate_Connection_Resources| a |
        | r |------------------------------->| y |
        |   |     Datamover-specific         | e |
        |   |     connection resources freed | r |
        |   |                                |   |
        |   |
        |   | Connection terminated by standard means
        |   |--------------------------------------------->

           Figure 4.  A Failed iSCSI Login on Initiator


        |   | Allocate_Connection_Resources  | D |      ^
        |   |------------------------------->| a |      |
        |   |    Connection resources are    | t |      |
        | i |     successfully allocated     | a |      |   iSCSI
        | S |                                | m |      |   Login
        | C |                                | o |      |   Phase
        | S |                                | v |      |
        | I |                                | e |      |
        |   |                                | r |      | Login
        |   |                                |   |      | Phase
        | L | Final Login Response (failure)            v fails
        | a |---------------------------------------------->
        | y |                                | L |
        | e | Deallocate_Connection_Resources| a |
        | r |------------------------------->| y |
        |   |     Datamover-specific         | e |
        |   |     connection resources freed | r |
        |   |                                |   |
        |   |
        |   | Connection terminated by standard means
        |   |-------------------------------------------->

            Figure 5.  A Failed iSCSI Login on Target



Chadalapaka, et al.          Informational                     [Page 38]

RFC 5047                           DA                       October 2007



        |   | Allocate_Connection_Resources  | D |     ^
        |   |------------------------------->| a |     |
        |   |    Connection resources are    | t |     |
        | i |     successfully allocated     | a |     |   iSCSI
        | S |                                | m |     |   Login
        | C |                                | o |     |   Phase
        | S |                                | v |     |
        | I |                                | e |     |
        |   |                                | r |     |
        | L | Login non-Final Request/Response         |
        | a |<-----------------------------------------|
        | y |    iSCSI layer decides not to  | L |     |
        | e |    enable Datamover for this   | a |     |
        | r |    connection                  | y |     |
        |   |                                | e |     |
        |   | Deallocate_Connection_Resources| r |     |
        |   |------------------------------->|   |     |
        |   |     All Datamover-specific     |   |     |
        |   |     resources deallocated      |   |     |
        |   |                                |   |     | Login
        |   |                                |   |     | Phase
        |   |                                          | continues
        |   | Regular Login negotiation continues      |
        |   |<---------------------------------------->|
        |   |                                          .
        |   |                                          .
        |   |                                          .

         Figure 6.  iSCSI Does Not Enable the Datamover





















Chadalapaka, et al.          Informational                     [Page 39]

RFC 5047                           DA                       October 2007


        |   |                                |   |   ^
        |   |  Full Feature Phase Control &  |   |   |
        |   |    Data Transfer Using DM      | D |   | iSCSI
        |   |                                | a |   | Full Feature
        | i |                                | t |   | Phase
        | S |                                | a |   | (DM Enabled)
        | C |                                | m |   |
        | S |    Successful iSCSI Logout     | o |   |
        | I |                                | v |   v
        |   |     Connection_Terminate       | e |
        | L |------------------------------->| r |
        | a |   Connection is terminated     |   |
        | y |   Datamover-specific resources | L | Transport
        | e |   deallocated, both connection | a | Connection
        | r |   level & task level           | y | is terminated
        |   |                                | e |
        |   |                                | r |
        |   |                                |   |
        |   |                                |   |

          Figure 7.  A Normal iSCSI Connection Termination


        |   |                                |   |   ^
        |   |  Full Feature Phase Control &  | D |   | iSCSI
        |   |    Data Transfer Using DM      | a |   | Full Feature
        | i |                                | t |   | Phase
        | S |                                | a |   | (DM Enabled)
        | C |                                | m |   v
        | S |                                | o |<--Transport
        | I |   Datamover-specific resources | v | Connection
        |   |   deallocated, both connection | e | Terminated (e.g.
        | L |   level & task level           | r | unexpected
        | a |                                |   | FIN/RESET)
        | y |                                | L |
        | e |   Connection_Terminate_Notify  | a |
        | r |<-------------------------------| y |
        |   |                                | e |
        |   |                                | r |
        |   |                                |   |

          Figure 8.  An Abnormal iSCSI Connection Termination









Chadalapaka, et al.          Informational                     [Page 40]

RFC 5047                           DA                       October 2007


       <-----Initiator----->                <-------Target------->

       |  |          |  | DM Msg holding |  |            |  |
  SCSI |  |          |  | SCSI Cmd PDU & |  |            |  |SCSI
  Cmd  |  | Snd_Ctrl |  |Unsol Imm Data  |  |Ctrl_Notify |  |Cmd
  ---->|  |--------->|  |--------------->|  |----------->|  |--->
       |  |          |  |                |  |            |  |
       |  |          |  | DM Msg holding |  |            |  |
       |  | Snd_Ctrl |  |SCSI Dataout PDU|  |Ctrl_Notify |  |
       |  |--------->|  |--------------->|  |----------->|  |
       |  |    .     |  |        .       |  |     .      |  |Unsol
       |  |    .     | D|        .       | D|     .      |  |Data
       |  |    .     | a| DM Msg holding | a|     .      |  |Trans
       | i| Snd_Ctrl | t|SCSI Dataout PDU| t|Ctrl_Notify | i|
       | S|--------->| a|--------------->| a|----------->| S|
       | C|          | m|                | m|            | C|Buf
       | S|          | o|                | o|            | S|Avail
       | I|          | v|                | v|  Get_Data  | I|(R2T)
       |  |          | e|----------------| e|<-----------|  |<----
       | L|          | r||Solicited Data | r|            | L|  .
       | a|          |  ||  Transfer     |  |            | a|  .
       | y|          | L|--------------->| L|      .     | y|Buf
       | e|          | a|        .       | a|      .     | e|Avail
       | r|          | y|        .       | y|  Get_Data  | r|(R2T)
       |  |          | e|----------------| e|<-----------|  |<----
       |  |          | r||Solicited Data | r|            |  |
       |  |          |  ||   Transfer    |  |            |  |
       |  |          |  |--------------->|  |Data_Cmp_Nfy|  |Data
       |  |          |  |                |  |----------->|  |Trans
       |  |          |  |                |  |            |  |Compl
       |  |          |  | DM Msg holding |  |            |  |
  SCSI |  |          |  |SCSI Resp PDU & |  |            |  |SCSI
  Resp |  |Ctrl_Ntfy |  |  Sense Data    |  |  Snd_Ctrl  |  |Resp
  <----|  |<---------|  |<---------------|  |<-----------|  |<----
       |  |          |  |                |  |            |  |

                Figure 9.  A SCSI Write Data Transfer














Chadalapaka, et al.          Informational                     [Page 41]

RFC 5047                           DA                       October 2007


       <-----Initiator----->                <-------Target------->

       |  |          |  |                |  |            |  |
  SCSI |  |          |  | DM Msg holding |  |            |  |SCSI
  Cmd  |  | Snd_Ctrl |  |  SCSI Cmd PDU  |  |Ctrl_Notify |  |Cmd
  ---->|  |--------->|  |--------------->|  |----------->|  |--->
       |  |          |  |                |  |            |  |
       |  |          | D|    SCSI Read   | D|            |  |Buf
       |  |          | a|  Data Transfer | a|  Put_Data  |  |Avail
       | i|          | t|<---------------| t|<-----------| i|<----
       | S|          | a|        .       | a|     .      | S|  .
       | C|          | m|        .       | m|     .      | C|  .
       | S|          | o|        .       | o|     .      | S|  .
       | I|          | v|    SCSI Read   | v|     .      | I|Buf
       |  |          | e|  Data Transfer | e|  Put_Data  |  |Avail
       | L|          | r|<---------------| r|<-----------| L|<----
       | a|          |  |                |  |            | a|
       | y|          | L|                | L|            | y|
       | e|          | a|                | a|Data_Cmp_Nfy| e|Data
       | r|          | y|                | y|----------->| r|Trans
       |  |          | e|                | e|            |  |Compl
       |  |          | r| DM Msg holding | r|            |  |
  SCSI |  |          |  |SCSI Resp PDU & |  |            |  |SCSI
  Resp |  |Ctrl_Ntfy |  |  Sense Data    |  |  Snd_Ctrl  |  |Resp
  <----|  |<---------|  |<---------------|  |<-----------|  |<----
       |  |          |  |                |  |            |  |

                Figure 10.  A SCSI Read Data Transfer























Chadalapaka, et al.          Informational                     [Page 42]

RFC 5047                           DA                       October 2007


       <-----Initiator----->                <-------Target------->

       |  |          |  |                |  |            |  |
  SCSI |  |          |  | DM Msg holding |  |            |  |SCSI
  Cmd  |  | Snd_Ctrl |  |  SCSI Cmd PDU  |  |Ctrl_Notify |  |Cmd
  ---->|  |--------->|  |--------------->|  |----------->|  |---->
       |  |          |  |                |  |            |  |
       |  |          | D|    SCSI Read   | D|  Put_Data  |  |Buf
       |  |          | a|  Data Transfer | a|Data_in.A=1 |  |Avail
       | i|          | t|<---------------| t|<-----------| i|<----
       | S|          | a|        .       | a|     .      | S|  .
       | C|          | m|        .       | m|Data_ACK_Nfy| C|  .
       | S|          | o|                | o|----------->| S|  .
       | I|          | v|                | v|     .      | I|
       |  |          | e|                | e|     .      |  |
       | L|          | r|                | r|            | L|
       | a|          |  |                |  |            | a|
       | y|          | L|                | L|            | y|
       | e|          | a|                | a|            | e|Data
       | r|          | y|                | y|            | r|Trans
       |  |          | e|                | e|            |  |Compl
       |  |          | r| DM Msg holding | r|            |  |
  SCSI |  |          |  |SCSI Resp PDU & |  |            |  |SCSI
  Resp |  |Ctrl_Ntfy |  |  Sense Data    |  |  Snd_Ctrl  |  |Resp
  <----|  |<---------|  |<---------------|  |<-----------|  |<----
       |  |          |  |                |  |            |  |

          Figure 11.  A SCSI Read Data Acknowledgement























Chadalapaka, et al.          Informational                     [Page 43]

RFC 5047                           DA                       October 2007


       <-----Initiator----->                <-------Target------->

       |  |          |  |                |  |            |  |
  SCSI |  |          |  | DM Msg holding |  |            |  |SCSI
  Cmd  |  | Snd_Ctrl |  |  SCSI Cmd PDU  |  |Ctrl_Notify |  |Cmd
  ---->|  |--------->|  |--------------->|  |----------->|  |---->
       |  |          |  |                |  |            |  |
       |  |          | D|    SCSI Read   | D|            |  |Buf
       |  |          | a|  Data Transfer | a|  Put_Data  |  |Avail
       | i|          | t|<---------------| t|<-----------| i|<----
       | S|          | a|        .       | a|     .      | S|  .
  Abort| C|          | m| DM Msg holding | m|     .      | C|Abort
  Task | S| Snd_Ctrl | o|  Abort TMF Req | o|Ctrl_Notify | S|Task
  ---->| I|--------->| v|--------------->| v|----------->| I|---->
       |  |          | e|       .        | e|     .      |  |
  Abort| L|          | r|  DM Msg holding| r|            | L| .
  Done | a|Ctrl_Ntfy |  |   Abort TMF Res|  | Snd_Ctrl   |  |Abted
  <----| y|<---------| L|<---------------| L|<-----------| y|<----
       | e|          | a|                | a|            | e|
       | r|          | y|                | y|            | r|
       |  |          | e|                | e|            |  |
       |  |          | r|                | r|            |  |
       |  |          |  |                |  |            |  |
       |  |Dal_Tk_Res|  |                |  |Dal_Tk_Res  |  |
       |  |--------->|  |                |  |<-----------|  |
       |  |          |  |                |  |            |  |

           Figure 12.  Task Resource Cleanup on Abort

Acknowledgements

  The IP Storage (IPS) Working Group in the Transport Area of
  IETF has been responsible for defining the iSCSI protocol
  (apart from a host of other relevant IP Storage protocols).
  The authors are grateful to the entire working group, whose
  work allowed this document to build on the concepts and
  details of the iSCSI protocol.

  In addition, the following individuals reviewed and
  contributed to the improvement of this document.  The authors
  are grateful for their contribution.

  John Carrier
  Adaptec, Inc.
  691 S. Milpitas Blvd., Milpitas, CA 95035, USA
  Phone: +1 (360) 378-8526
  EMail: [email protected]




Chadalapaka, et al.          Informational                     [Page 44]

RFC 5047                           DA                       October 2007


  Hari Ghadia
  Adaptec, Inc.
  691 S. Milpitas Blvd., Milpitas, CA 95035, USA
  Phone: +1 (408) 957-5608
  EMail: [email protected]

  Hari Mudaliar
  Adaptec, Inc.
  691 S. Milpitas Blvd., Milpitas, CA 95035, USA
  Phone: +1 (408) 957-6012
  EMail: [email protected]

  Patricia Thaler
  Agilent Technologies, Inc.
  1101 Creekside Ridge Drive, #100, M/S-RG10,
  Roseville, CA 95678, USA
  Phone: +1-916-788-5662
  EMail: [email protected]

  Uri Elzur
  Broadcom Corporation
  16215 Alton Parkway, Irvine, CA 92619-7013, USA
  Phone: +1 (949) 585-6432
  EMail: [email protected]

  Mike Penna
  Broadcom Corporation
  16215 Alton Parkway,Irvine, CA 92619-7013, USA
  Phone: +1 (949) 926-7149
  EMail: [email protected]

  David Black
  EMC Corporation
  176 South St., Hopkinton, MA 01748, USA
  Phone: +1 (508) 293-7953
  EMail: [email protected]

  Ted Compton
  EMC Corporation
  Research Triangle Park, NC 27709, USA
  Phone: +1-919-248-6075
  EMail: [email protected]

  Dwight Barron
  Hewlett-Packard Company
  20555 SH 249, Houston, TX 77070-2698, USA
  Phone: +1 (281) 514-2769
  EMail: [email protected]



Chadalapaka, et al.          Informational                     [Page 45]

RFC 5047                           DA                       October 2007


  Paul R. Culley
  Hewlett-Packard Company
  20555 SH 249, Houston, TX 77070-2698, USA
  Phone: +1 (281) 514-5543
  EMail: [email protected]

  Dave Garcia
  Hewlett-Packard Company
  19333 Vallco Parkway, Cupertino, CA 95014, USA
  Phone: +1 (408) 285-6116
  EMail: [email protected]

  Randy Haagens
  Hewlett-Packard Company
  8000 Foothills Blvd, MS 5668, Roseville CA, USA
  Phone: +1-916-785-4578
  EMail: [email protected]

  Jeff Hilland
  Hewlett-Packard Company
  20555 SH 249, Houston, TX 77070-2698, USA
  Phone: +1 (281) 514-9489
  EMail: [email protected]

  Mike Krause
  Hewlett-Packard Company, 43LN
  19410 Homestead Road, Cupertino, CA 95014, USA
  Phone: +1 (408) 447-3191
  EMail: [email protected]

  Jim Wendt
  Hewlett-Packard Company
  8000 Foothills Blvd, MS 5668, Roseville CA, USA
  Phone: +1-916-785-5198
  EMail: [email protected]

  Mike Ko
  IBM
  650 Harry Rd, San Jose, CA 95120, USA
  Phone: +1 (408) 927-2085
  EMail: [email protected]

  Renato Recio
  IBM Corporation
  11501 Burnett Road, Austin, TX 78758, USA
  Phone: +1 (512) 838-1365
  EMail: [email protected]




Chadalapaka, et al.          Informational                     [Page 46]

RFC 5047                           DA                       October 2007


  Howard C. Herbert
  Intel Corporation
  MS CH7-404,5000 West Chandler Blvd., Chandler, AZ 85226, USA
  Phone: +1 (480) 554-3116
  EMail: [email protected]

  Dave Minturn
  Intel Corporation
  MS JF1-210, 5200 North East Elam Young Parkway
  Hillsboro, OR 97124, USA
  Phone: +1 (503) 712-4106
  EMail: [email protected]

  James Pinkerton
  Microsoft Corporation
  One Microsoft Way, Redmond, WA 98052, USA
  Phone: +1 (425) 705-5442
  EMail: [email protected]

  Tom Talpey
  Network Appliance
  375 Totten Pond Road, Waltham, MA 02451, USA
  Phone: +1 (781) 768-5329
  EMail: [email protected]



























Chadalapaka, et al.          Informational                     [Page 47]

RFC 5047                           DA                       October 2007


Authors' Addresses

  Mallikarjun Chadalapaka
  Hewlett-Packard Company
  8000 Foothills Blvd.
  Roseville, CA 95747-5668, USA

  Phone: +1-916-785-5621
  EMail: [email protected]


  John L. Hufferd
  Brocade, Inc.
  1745 Technology Drive
  San Jose, CA 95110, USA

  Phone: +1-408-333-5244
  EMail: [email protected]


  Julian Satran
  IBM, Haifa Research Lab
  Haifa University Campus - Mount Carmel
  Haifa 31905, Israel

  Phone +972-4-829-6264
  EMail: [email protected]


  Hemal Shah
  Broadcom Corporation
  5300 California Avenue
  Irvine, California 92617, USA

  Phone: +1-949-926-6941
  EMail: [email protected]

  Comments may be sent to Mallikarjun Chadalapaka.













Chadalapaka, et al.          Informational                     [Page 48]

RFC 5047                           DA                       October 2007


Full Copyright Statement

  Copyright (C) The IETF Trust (2007).

  This document is subject to the rights, licenses and restrictions
  contained in BCP 78, and except as set forth therein, the authors
  retain all their rights.

  This document and the information contained herein are provided on an
  "AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS
  OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY, THE IETF TRUST AND
  THE INTERNET ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS
  OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF
  THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED
  WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.

Intellectual Property

  The IETF takes no position regarding the validity or scope of any
  Intellectual Property Rights or other rights that might be claimed to
  pertain to the implementation or use of the technology described in
  this document or the extent to which any license under such rights
  might or might not be available; nor does it represent that it has
  made any independent effort to identify any such rights.  Information
  on the procedures with respect to rights in RFC documents can be
  found in BCP 78 and BCP 79.

  Copies of IPR disclosures made to the IETF Secretariat and any
  assurances of licenses to be made available, or the result of an
  attempt made to obtain a general license or permission for the use of
  such proprietary rights by implementers or users of this
  specification can be obtained from the IETF on-line IPR repository at
  http://www.ietf.org/ipr.

  The IETF invites any interested party to bring to its attention any
  copyrights, patents or patent applications, or other proprietary
  rights that may cover technology that may be required to implement
  this standard.  Please address the information to the IETF at
  [email protected].












Chadalapaka, et al.          Informational                     [Page 49]