Independent Submission                                       J. Wackerow
Request for Comments: 9517                                  DDI Alliance
Category: Informational                                     January 2024
ISSN: 2070-1721


     A URN Namespace for the Data Documentation Initiative (DDI)

Abstract

  This document describes the Namespace Identifier (NID) "ddi" for
  Uniform Resource Names (URNs) used to identify resources that conform
  to the standards published by the Data Documentation Initiative (DDI)
  Alliance.

  The DDI Alliance is not affiliated with the Internet Engineering Task
  Force (IETF) or Internet Society (ISOC).  This Independent Submission
  is not a standard nor does it have IETF community consensus.

Status of This Memo

  This document is not an Internet Standards Track specification; it is
  published for informational purposes.

  This is a contribution to the RFC Series, independently of any other
  RFC stream.  The RFC Editor has chosen to publish this document at
  its discretion and makes no statement about its value for
  implementation or deployment.  Documents approved for publication by
  the RFC Editor are not candidates for any level of Internet Standard;
  see Section 2 of RFC 7841.

  Information about the current status of this document, any errata,
  and how to provide feedback on it may be obtained at
  https://www.rfc-editor.org/info/rfc9517.

Copyright Notice

  Copyright (c) 2024 IETF Trust and the persons identified as the
  document authors.  All rights reserved.

  This document is subject to BCP 78 and the IETF Trust's Legal
  Provisions Relating to IETF Documents
  (https://trustee.ietf.org/license-info) in effect on the date of
  publication of this document.  Please review these documents
  carefully, as they describe your rights and restrictions with respect
  to this document.

Table of Contents

  1.  Introduction
  2.  Conventions Used in This Document
  3.  Specification
    3.1.  Declaration of Syntactic Structure
      3.1.1.  Description
      3.1.2.  ABNF Grammar
      3.1.3.  Regular Expression
      3.1.4.  Examples of DDI URNs
    3.2.  Relevant Ancillary Documentation
    3.3.  Identifier Uniqueness Considerations
    3.4.  Identifier Persistence Considerations
    3.5.  Process of Identifier Assignment
    3.6.  Process for Identifier Resolution
    3.7.  Rules for Lexical Equivalence
    3.8.  Conformance with URN Syntax
    3.9.  Validation Mechanism
    3.10. Scope
  4.  Namespace Considerations
    4.1.  URN Assignment Procedures
    4.2.  URN Resolution/Delegation
    4.3.  Type of Resources To Be Identified
    4.4.  Type of Services
  5.  Community Considerations
    5.1.  Open Assignment and Use of Identifiers
    5.2.  Open Operation of Resolution Servers
    5.3.  Creation of Software for Service Discovery
  6.  IANA Considerations
  7.  Security Considerations
  8.  References
    8.1.  Normative References
    8.2.  Informative References
  Appendix A.  Example DNS Records
    A.1.  Delegation of the URN Namespace "ddi"
    A.2.  Delegation of DDI Agencies
    A.3.  DDI Services
  Appendix B.  Algorithm for DDI Service Discovery
    B.1.  Application Unique String
    B.2.  First Well Known Rule
    B.3.  Valid Databases
    B.4.  Expected Output
  Acknowledgments
  Author's Address

1.  Introduction

  This document registers a formal Namespace Identifier (NID) for URNs
  associated with DDI resources in accordance with the process defined
  in [RFC8141].

  The DDI Alliance is an international collaboration dedicated to
  establishing metadata standards and semantic products for describing
  social science data, data covering human activity, and other data
  based on observational methods.  DDI specifications are free
  standards that document and manage different stages in the research
  data lifecycle, such as conceptualization, collection, processing,
  distribution, discovery, and archiving.  Documenting data with DDI
  facilitates understanding, interpretation, and use -- by people,
  software systems, and computer networks.

  The specifications DDI Codebook [DDI-C] and DDI Lifecycle [DDI-L] are
  expressed in XML Schema; DDI Extended Knowledge Organization System
  (XKOS) [DDI-XKOS] in OWL/RDF; Structured Data Transformation Language
  (SDTL) [DDI-SDTL] in JSON Schema; and the upcoming DDI Cross Domain
  Integration (DDI-CDI) in UML.  DDI is aligned with other metadata
  standards like Dublin Core Metadata Initiative [DUBLINC]; Statistical
  Data and Metadata Exchange [SDMX] for exchanging aggregate data; ISO/
  IEC 11179 [IS11179] for building metadata registries, such as
  question, variable, and concept banks; and ISO 19115 [ISO.19115.2003]
  for supporting geographic information systems.

  DDI URNs support reusability of DDI resources inside a single DDI
  instance and in a distributed network of DDI instances.

  The DDI specification is developed and maintained by the DDI Alliance
  [DDI-ALL].  The DDI Alliance is a self-sustaining membership
  organization whose over 40-member institutions have a voice in the
  development of the DDI specifications.  This memo describing the ddi
  URN is an informational specification.  It is not a standard and is
  not the product of the IETF.

2.  Conventions Used in This Document

  The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
  "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and
  "OPTIONAL" in this document are to be interpreted as described in
  BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all
  capitals, as shown here.

  In this document, these words will appear with that interpretation
  only when in ALL CAPS.  Lowercase uses of these words are not to be
  interpreted as carrying [RFC2119] significance.

  DDI:  Data Documentation Initiative.  The single term is often used
     as a synonym for the DDI specification.

  DDI agency:  An organization that maintains DDI resources.

3.  Specification

  This section provides the information required to register a formal
  namespace according to the registration procedure defined in
  [RFC8141].  The URNs conform to the syntax defined in [RFC8141].

3.1.  Declaration of Syntactic Structure

3.1.1.  Description

  The Namespace Specific String (NSS) of all URNs using the "ddi" NID
  is a globally unique identifier consisting of the DDI agency-
  identifier (registration authority identifier), the identifier of the
  DDI resource (data identifier), and the version of the resource
  (version-identifier) [DDI-ID].  This structure is according to the
  International Registration Data Identifier (IRDI) defined in
  "Information technology - Metadata registries (MDR) - Part 6:
  Registration", Annex A [IS11179].

  A description of the DDI resource identification is available in the
  Identification section of the "DDI Lifecycle 3.3 Technical Guide"
  [DDI-ID].

  The DDI NSS has the following structure:

     <agency-identifier>:<resource-identifier>:<version-identifier>

  agency-identifier is the identifier of a DDI agency that maintains
  DDI resources.  This identifier basically follows the rules of
  reversed domain names and is case insensitive.  This way, the DNS
  resolution of DDI agency-identifiers is supported.  The hierarchy of
  domains descends from the left to the right label in the name; each
  label to the right specifies a subdivision, or subdomain, of the
  domain to the left.  The left-most label of agency-identifier conveys
  the top-level domain.  It SHALL be a country code corresponding to
  ISO 3166 alpa-2 codes [ISO3166] or another top-level domain
  maintained by IANA [TLD].  All two-letter top-level domains are
  reserved for current and future ISO 3166 codes.  Assignment of
  identifiers for DDI agencies in the requested namespace is managed by
  the DDI Alliance (see Section 3.5 on "Process of Identifier
  Assignment").  The next subdomain identifies the agency within that
  top-level domain.  Further optional subdomains can follow.  The top-
  level domain and possible subdomains are separated by the full stop
  character.  The full stop character is not allowed within top-level
  domain names or subdomain names.  The top-level domain and subdomains
  are composed from the limited set of characters for the preferred
  form of a DNS label ([RFC1035], Section 2.3.1).  The length of the
  label and the full name are restricted by DNS rules ([RFC2181],
  Section 11).  The agency identifier is case insensitive ([RFC4343],
  Section 2).

  resource-identifier is the identifier of a DDI resource of a DDI
  agency.  The value MUST be unique in the scope of this DDI agency.
  The resource-identifier is case sensitive.

  version-identifier is the version of a DDI resource of a DDI agency.
  The value MUST be unique in the scope of this resource.  The resource
  version is case sensitive.

3.1.2.  ABNF Grammar

  The following syntax specification for the complete URN uses the
  Augmented Backus-Naur form (ABNF) as described in [RFC5234].

  ; Rules are case sensitive, if not stated otherwise.
  ddi-urn              = urn separator ddi separator ddi-irdi
  ; urn is case insensitive, see [RFC8141].
  urn                  = "urn"
  ; ddi is the URN namespace identifier.
  ; ddi is case insensitive, see [RFC8141], Section 2.1.
  ddi                  = "ddi"
  ; ddi-irdi is the namespace specific string (NSS).
  ; ddi-irdi - international registration data identifier,
  ; see [IS11179] Annex A.2.
  ddi-irdi             = agency-identifier separator
                         resource-identifier separator
                         version-identifier
  ; agency-identifier is case insensitive, see [RFC4343], Section 2.
  ; For allowed characters, see [RFC1035], Section 2.3.1.
  ; For length restrictions, see [RFC2181], Section 11.
  agency-identifier    = top-level-domain
                           sub-separator ddi-authority-id
                           *(sub-separator ddi-sub-authority-id)
                         ; length limit is 255 characters
                         ; see Section 11 of [RFC2181]
  top-level-domain     = dns-label
  ddi-authority-id     = dns-label
  ddi-sub-authority-id = dns-label
  dns-label            = (ALPHA / DIGIT)
                           [ *(ALPHA / DIGIT / "-")
                             (ALPHA / DIGIT) ]
                         ; length limit is 63 characters
                         ; see Section 11 of [RFC2181]
  resource-identifier  = restricted-string
                         *("/" restricted-string)
  version-identifier   = restricted-string
                         *("/" restricted-string)
  restricted-string    = 1*(unreserved / sub-delims / "@")
  ; Definitions for unreserved and sub-delims from
  ; [RFC3986], Section 2.2.
  unreserved           = ALPHA / DIGIT / "-" / "." / "_" / "~"
  sub-delims           = "!" / "$" / "&" / "'" / "(" / ")" /
                         "*" / "+" / "," / ";" / "="
  separator            = ":"
  sub-separator        = "."
  ; ALPHA and DIGIT are actually defined in the ABNF
  ; specification. They are declared here for convenience
  ; purposes.
  ALPHA                = %x41-5A /  ; uppercase letters
                         %x61-7A    ; lowercase letters
  DIGIT                = %x30-39    ; digits

                          Figure 1: ABNF Grammar

3.1.3.  Regular Expression

  The used syntax is the XML Schema flavor, which can be easily used in
  other flavors.  These regular expressions implicitly anchor at the
  head and tail.  The following regular expression syntax uses
  components (component names indicated by angle brackets, i.e.
  <component>) and is written in free-spacing mode for easier reading
  (the XML Schema flavor does not support that).  Please note that use
  of multiple quantifiers in regular expressions can result in false
  outcomes due to so-called greediness.  Therefore, there are separate
  regular expressions for the length restriction and other purposes for
  the components agency-identifier and dns-label.

  ddi-urn              := [Uu][Rr][Nn] : [Dd][Dd][Ii] :
                            <agency-identifier> :
                            <resource-identifier> :
                            <version-identifier>
  agency-identifier    := <top-level-domain> \.
                            <ddi-authority-id>
                            (\. <ddi-sub-authority-id>)*
  agency-identifier    := .{1,255}

  top-level-domain     := <dns-label>
  ddi-authority-id     := <dns-label>

  ddi-sub-authority-id := <dns-label>
  dns-label            := [A-Za-z0-9]([-A-Za-z0-9]*[A-Za-z0-9])?

  dns-label            := .{1,63}

  resource-identifier  := <restricted-string>
                          (/ <restricted-string>)*

  version-identifier   := <restricted-string>
                          (/ <restricted-string>)*

  restricted-string    := [A-Za-z0-9-._~!$&'()*+,;=@]+

3.1.4.  Examples of DDI URNs

  The examples are taken from the DDI Lifecycle 3.3 documentation
  [DDI-ID].  Please note that the resource-identifiers are simplified.
  In real applications, they are much longer for unique identification
  purposes.  They don't relate to DDI types like the examples might
  suggest.

     urn:ddi:us.ddia1:R-V1:1

                 Figure 2: URN of a Represented Variable

  The DDI represented variable identified by "R-V1" with the version
  "1" of the DDI agency "ddia1" located in the domain "us" [DDI-EXRV].

     urn:ddi:us.ddia1:PISA-QS.QI-2:1

                     Figure 3: URN of a Question Item

  The DDI question item identified by "PISA-QS.QI-2" with the version
  "1" of the DDI agency "ddia1" located in the domain "us" [DDI-EXQU].

     urn:ddi:int.ddi.cv:AggregationMethod:1.0

          Figure 4: URN as Reference to a Controlled Vocabulary

  The DDI controlled vocabulary identified by "AggregationMethod" with
  the version "1.0" in the scope of the DDI agency "ddi" and sub-agency
  "cv" in the domain "int" [DDI-CVAG].

3.2.  Relevant Ancillary Documentation

  An introductory article on DDI can be found at [DDI-INTR].

  Information on the DDI specifications (DDI-C, DDI-L, XKOS, Controlled
  Vocabularies, and SDTL) can be found in the standards section of the
  DDI Alliance website [DDI-ALL].

  Information on domain names can be found in the relevant RFCs.

  *  For an overview, see [RFC1034].

  *  Regarding case insensitivity, see Section 2.3.3 of [RFC1035].

  *  Regarding syntax, see the "Lexical grammar" in the "Grammatical
     Host Table Specification" section of [RFC0952] and Section 2.1 of
     [RFC1123].

  *  Regarding size limits, see Section 2.1 of [RFC1123] and
     Section 2.3.4 of [RFC1035].

3.3.  Identifier Uniqueness Considerations

  Assignment of identifiers for DDI agencies in the requested namespace
  will be managed by the DDI Alliance, which will ensure that the
  assigned DDI agency-identifiers are consistent with the directives
  for unique identification of DDI agencies.

  Assignment of URNs for resources of a DDI agency in the requested
  namespace will be managed by the respective DDI agency, which ensures
  that the assigned URNs are unique for the scope of the agency.

3.4.  Identifier Persistence Considerations

  Persistence of identifiers is dependent upon the suitable delegation
  of resolution at the level of the DDI agencies and the persistence of
  DDI agency assignment.  The persistence of the referenced resource is
  also the responsibility of the DDI agency.

3.5.  Process of Identifier Assignment

  Assignment of identifiers for DDI agencies in the requested namespace
  is managed by the DDI Alliance.  A registry for DDI agency
  identifiers ensures through an approval process that the syntax of
  agency-identifiers complies with the associated rules [DDI-REGI].

  Assignment of URNs for resources of a DDI agency and sub-agencies of
  a DDI agency in the requested namespace will be managed by the
  respective DDI agency.

3.6.  Process for Identifier Resolution

  The DDI Alliance promotes a service discovery system for identifying
  available services connected to DDI agencies using the Domain Name
  System (DNS).  A DNS request for a DDI agency within the domain
  ddi.urn.arpa is delegated by the DNS servers of the DDI Alliance to
  the DNS servers of the relevant DDI agency.  The response is a list
  of available DDI services for the agency identifier under which the
  agency has assigned URNs.  The approach is based on the Dynamic
  Delegation Discovery System (DDDS) [RFC3401] and especially the
  straightforward URI-enabled NAPTR (U-NAPTR) [RFC4848].

  The DDI Alliance is responsible for operating or delegating
  resolution requests to the resolution servers of the relevant DDI
  agencies.  DDI agencies are responsible for operating or delegating
  resolution servers for the agency-identifier under which they have
  assigned URNs.

  Client  NS for    NS for           NS for        DDI services
          urn.arpa  ddialliance.org  example1.edu  for us.ddia1
    |       |            |               |              |
  1 |------>|            |               |              |
  2 |       |----------->|               |              |
  3 |                    |-------------->|              |
  4 |<-----------------------------------|              |
  5 |-------------------------------------------------->|
  6 |<--------------------------------------------------|

      Figure 5: Sample Sequence Diagram for Receiving a List of DDI
               Services from the Example DDI agency "ddia1"

  1.  The name server (NS) of IANA for the domain "urn.arpa." is
      reached with the request "ddia1.us.ddi.urn.arpa." for the DDI
      agency "us.ddia1".

  2.  The request is delegated to the name server for
      "ddialliance.org".

  3.  The request is delegated to the name server for "example1.edu"
      (domain of the DDI agency "us.ddia1").

  4.  The server responds with a list of NAPTR records [RFC3403]
      pointing to available DDI services for the DDI agency "us.ddia1".

  5.  The client selects an appropriate DDI service and sends a request
      for a DDI URN to this service.

  6.  The DDI service responds, for example, with a DDI object
      identified by the requested DDI URN.

  See Appendix A for examples of name server records.

3.7.  Rules for Lexical Equivalence

  The DDI agency-identifier basically follows the rules of domain
  names.  Domain names are case insensitive.  Thus, the following
  portion of the URN is case insensitive for matches:

     urn:ddi:<agency-id>:

  The remainder of the identifier MUST be considered case sensitive.

3.8.  Conformance with URN Syntax

  The NSS conforms to the related section in [RFC8141].  It is composed
  from the limited set of characters for a URN NSS [RFC8141].  Percent-
  encoding is not used.

3.9.  Validation Mechanism

  The DDI Alliance will promote development of software for validation
  purposes.

3.10.  Scope

  The scope is global.

4.  Namespace Considerations

  There is no available namespace that will allow one to uniquely
  identify and access DDI resources.

4.1.  URN Assignment Procedures

  See Section 3.5, "Process of Identifier Assignment".

4.2.  URN Resolution/Delegation

  See Section 3.6, "Process for Identifier Resolution".

  It is RECOMMENDED that sub-agencies for flexible administration be
  used.  For example, delegation of URNs of a sub-agency to different
  servers would be easily possible.

4.3.  Type of Resources To Be Identified

  The DDI specifications define resources at a granular level, many of
  which can be identified by a DDI URN.

4.4.  Type of Services

  Examples of potential services are listed below.  The services and
  appropriate service tags need to be defined in the future.  The
  mentioned service tags are from [RFC2483].

  *  DDI repository

     I2R (URI to Resource):  given a DDI URN return, one instance of
        the resource identified by that URN.

  *  DDI registry

     I2C (URI to URC, Uniform Resource Characteristics are
     descriptions of resources):  given a DDI URN return, a description
        or a summary of that resource.

  *  DDI URN resolution

     I2L (URI to URL):  given a DDI URN return, one URL that identifies
        a location where the identified DDI resource can be found.

     I2Ls (URI to URLs):  given a DDI URN return, one or more URLs that
        identify multiple locations of the identified DDI resource.

5.  Community Considerations

5.1.  Open Assignment and Use of Identifiers

  DDI agency-identifiers can be registered at the DDI Alliance.  The
  DDI Alliance maintains a registry of the assigned values for the DDI
  agency-identifier used in the NSS.  Information may be obtained from
  the following address: [email protected].

  DDI agencies assign URNs and potential sub-agencies within the scope
  of the assigned DDI agency-identifiers.

  See also Section 3.3 on "Identifier Uniqueness Considerations".

5.2.  Open Operation of Resolution Servers

  The DDI Alliance operates publicly accessible name servers for the
  delegation of DNS requests within the domain ddi.urn.arpa to DNS
  servers of DDI agencies.

5.3.  Creation of Software for Service Discovery

  The DDI Alliance promotes software for service discovery for
  identifying available services connected to DDI agencies using the
  Domain Name System (DNS).  See also Section 3.6 on "Process for
  Identifier Resolution".  A basic resolver library is available
  [DDI-RESO].

6.  IANA Considerations

  IANA has updated the "ddi" entry in the "Formal URN Namespaces"
  registry to reference this specification.

  The following NAPTR record for the key "ddi" has been registered in
  the urn.arpa zone:

     ddi IN NAPTR 100 10 "" "" "" registry.ddialliance.org.

  Requests for the domain ddi.urn.arpa are delegated to the name
  servers of the DDI Alliance.

7.  Security Considerations

  URN:DDI identifiers are assigned to resources that are public
  information; therefore, resolving these identifiers has low security
  profile.

  Registration of DDI agencies is approved by the DDI Alliance.
  Assignment and resolution of URN:DDI identifiers are controlled by
  the DDI Alliance and approved DDI agencies.  The DDI Alliance SHALL
  have in place control mechanisms in order to make sure that DDI
  Agency applications from malicious third parties will not be
  accepted.  URN:DDI resolvers will be protected against eavesdropping
  and attacks with appropriate tools.

  This document introduces no additional technical security
  considerations beyond those associated with the use and resolution of
  URNs in general.

  The security of the DNS-based resolution of DDI agency-identifiers is
  only as good as the security of DNS queries in general.  A full
  discussion of the security threats pertaining to DNS and possible
  solutions can be found in [RFC3833].  Further information on security
  considerations regarding U-NAPTR can be found in [RFC4848],
  Section 6.  "DNS Queries over HTTPS (DoH)" [RFC8484] could be used to
  increase security by preventing eavesdropping and manipulation of DNS
  data by machine-in-the-middle attacks.  The HTTPS protocol encrypts
  the data between the DoH client and the DoH-based DNS resolver.

8.  References

8.1.  Normative References

  [DDI-C]    DDI Alliance, "DDI-Codebook 2.5", 2014,
             <https://ddialliance.org/Specification/DDI-Codebook/2.5/>.

  [DDI-ID]   DDI Alliance, "Identification", DDI Lifecycle (3.3)
             Technical Guide: General Structures, <https://ddi-
             lifecycle-technical-
             guide.readthedocs.io/en/latest/General%20Structures/
             Identification.html>.

  [DDI-L]    DDI Alliance, "DDI-Lifecycle",
             <https://ddialliance.org/Specification/DDI-Lifecycle/>.

  [DDI-SDTL] DDI Alliance, "SDTL - Structured Data Transformation
             Language - Version 1.0", December 2020,
             <https://ddialliance.org/products/sdtl/1.0/>.

  [DDI-XKOS] DDI Alliance, "XKOS - Extended Knowledge Organization
             System", <https://ddialliance.org/Specification/RDF/XKOS>.

  [RFC0952]  Harrenstien, K., Stahl, M., and E. Feinler, "DoD Internet
             host table specification", RFC 952, DOI 10.17487/RFC0952,
             October 1985, <https://www.rfc-editor.org/info/rfc952>.

  [RFC1034]  Mockapetris, P., "Domain names - concepts and facilities",
             STD 13, RFC 1034, DOI 10.17487/RFC1034, November 1987,
             <https://www.rfc-editor.org/info/rfc1034>.

  [RFC1035]  Mockapetris, P., "Domain names - implementation and
             specification", STD 13, RFC 1035, DOI 10.17487/RFC1035,
             November 1987, <https://www.rfc-editor.org/info/rfc1035>.

  [RFC1123]  Braden, R., Ed., "Requirements for Internet Hosts -
             Application and Support", STD 3, RFC 1123,
             DOI 10.17487/RFC1123, October 1989,
             <https://www.rfc-editor.org/info/rfc1123>.

  [RFC2119]  Bradner, S., "Key words for use in RFCs to Indicate
             Requirement Levels", BCP 14, RFC 2119,
             DOI 10.17487/RFC2119, March 1997,
             <https://www.rfc-editor.org/info/rfc2119>.

  [RFC2181]  Elz, R. and R. Bush, "Clarifications to the DNS
             Specification", RFC 2181, DOI 10.17487/RFC2181, July 1997,
             <https://www.rfc-editor.org/info/rfc2181>.

  [RFC3986]  Berners-Lee, T., Fielding, R., and L. Masinter, "Uniform
             Resource Identifier (URI): Generic Syntax", STD 66,
             RFC 3986, DOI 10.17487/RFC3986, January 2005,
             <https://www.rfc-editor.org/info/rfc3986>.

  [RFC4343]  Eastlake 3rd, D., "Domain Name System (DNS) Case
             Insensitivity Clarification", RFC 4343,
             DOI 10.17487/RFC4343, January 2006,
             <https://www.rfc-editor.org/info/rfc4343>.

  [RFC5234]  Crocker, D., Ed. and P. Overell, "Augmented BNF for Syntax
             Specifications: ABNF", STD 68, RFC 5234,
             DOI 10.17487/RFC5234, January 2008,
             <https://www.rfc-editor.org/info/rfc5234>.

  [RFC8141]  Saint-Andre, P. and J. Klensin, "Uniform Resource Names
             (URNs)", RFC 8141, DOI 10.17487/RFC8141, April 2017,
             <https://www.rfc-editor.org/info/rfc8141>.

  [RFC8174]  Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC
             2119 Key Words", BCP 14, RFC 8174, DOI 10.17487/RFC8174,
             May 2017, <https://www.rfc-editor.org/info/rfc8174>.

  [TLD]      IANA, "Root Zone Database",
             <https://www.iana.org/domains/root/db>.

8.2.  Informative References

  [ABNF2RS]  "ABNF to REGEX: Regular Expression Generator", October
             2019, <https://www.msweet.org/abnf/>.

  [ABNFGEN]  Degener, J., "abnfgen", <http://www.quut.com/abnfgen/>.

  [ABNFPFE]  IETF, "IETF Author Tools - ABNF Tools",
             <https://author-tools.ietf.org/abnf>.

  [DDI-ALL]  DDI Alliance, "Document, Discover and Interoperate",
             <https://ddialliance.org/>.

  [DDI-CVAG] DDI Alliance, "DDI Controlled Vocabulary for Aggregation
             Method", <https://ddialliance.org/Specification/DDI-CV/
             AggregationMethod_1.0.html>.

  [DDI-EXQU] DDI Alliance, "Questions", DDI Lifecycle 3.3 Technical
             Guide: Examples, <https://ddi-lifecycle-technical-
             guide.readthedocs.io/en/latest/Examples/Questions.html>.

  [DDI-EXRV] DDI Alliance, "Represented Variable", DDI Lifecycle 3.3
             Technical Guide: Examples, <https://ddi-lifecycle-
             technical-guide.readthedocs.io/en/latest/Examples/
             RepresentedVariable.html>.

  [DDI-INTR] Vardigan, M., Heus, P., and W. Thomas, "Data Documentation
             Initiative: Toward a Standard for the Social Sciences",
             The International Journal of Digital Curation, Issue 1,
             Volume 3, DOI 10.2218/ijdc.v3i1.45, December 2008,
             <http://www.ijdc.net/article/view/66>.

  [DDI-REGI] DDI Alliance, "Welcome to the DDI Registry",
             <https://registry.ddialliance.org/>.

  [DDI-RESO] DDI Alliance, "Tools",
             <https://registry.ddialliance.org/Home/Tools>.

  [DUBLINC]  Dublin Core Metadata Initiative, "Dublin Core",
             <https://www.dublincore.org/>.

  [IS11179]  ISO, "Information technology - Metadata registries (MDR) -
             Part 6: Registration", ISO/IEC 11179-6:2023, January 2023,
             <https://www.iso.org/standard/78916.html>.

  [ISO.19115.2003]
             ISO, "Geographic information - Metadata", ISO 19115:2003,
             <https://www.iso.org/standard/26020.html>.

  [ISO3166]  ISO, "ISO 3166 Country Codes",
             <https://www.iso.org/iso-3166-country-codes.html>.

  [RFC2483]  Mealling, M. and R. Daniel, "URI Resolution Services
             Necessary for URN Resolution", RFC 2483,
             DOI 10.17487/RFC2483, January 1999,
             <https://www.rfc-editor.org/info/rfc2483>.

  [RFC2782]  Gulbrandsen, A., Vixie, P., and L. Esibov, "A DNS RR for
             specifying the location of services (DNS SRV)", RFC 2782,
             DOI 10.17487/RFC2782, February 2000,
             <https://www.rfc-editor.org/info/rfc2782>.

  [RFC3401]  Mealling, M., "Dynamic Delegation Discovery System (DDDS)
             Part One: The Comprehensive DDDS", RFC 3401,
             DOI 10.17487/RFC3401, October 2002,
             <https://www.rfc-editor.org/info/rfc3401>.

  [RFC3402]  Mealling, M., "Dynamic Delegation Discovery System (DDDS)
             Part Two: The Algorithm", RFC 3402, DOI 10.17487/RFC3402,
             October 2002, <https://www.rfc-editor.org/info/rfc3402>.

  [RFC3403]  Mealling, M., "Dynamic Delegation Discovery System (DDDS)
             Part Three: The Domain Name System (DNS) Database",
             RFC 3403, DOI 10.17487/RFC3403, October 2002,
             <https://www.rfc-editor.org/info/rfc3403>.

  [RFC3833]  Atkins, D. and R. Austein, "Threat Analysis of the Domain
             Name System (DNS)", RFC 3833, DOI 10.17487/RFC3833, August
             2004, <https://www.rfc-editor.org/info/rfc3833>.

  [RFC3958]  Daigle, L. and A. Newton, "Domain-Based Application
             Service Location Using SRV RRs and the Dynamic Delegation
             Discovery Service (DDDS)", RFC 3958, DOI 10.17487/RFC3958,
             January 2005, <https://www.rfc-editor.org/info/rfc3958>.

  [RFC4848]  Daigle, L., "Domain-Based Application Service Location
             Using URIs and the Dynamic Delegation Discovery Service
             (DDDS)", RFC 4848, DOI 10.17487/RFC4848, April 2007,
             <https://www.rfc-editor.org/info/rfc4848>.

  [RFC8484]  Hoffman, P. and P. McManus, "DNS Queries over HTTPS
             (DoH)", RFC 8484, DOI 10.17487/RFC8484, October 2018,
             <https://www.rfc-editor.org/info/rfc8484>.

  [SDMX]     Statistical Data and Metadata eXchange, "SDMX",
             <https://sdmx.org/>.

Appendix A.  Example DNS Records

  The examples use NAPTR [RFC3403] and SRV [RFC2782] [RFC3958] records.
  The values for the services and flags fields of the NAPTR records
  will be determined by the DDI application ([RFC3403], Section 9).

  For a description of the packet format of NAPTR, see [RFC3403],
  Section 4.1.

A.1.  Delegation of the URN Namespace "ddi"

  Example records below are defined at a.iana-servers.net and other
  authoritative name servers for the domain urn.arpa.

  The empty flag indicates that the lookup is not terminal and the next
  probe to DNS is for more NAPTR records where the new domain is
  "dns.ddialliance.org".

  ; Delegation to name servers of ddialliance.org
  ;           order pref  flag  service  regexp replacement
  ddi.urn.arpa.
     IN NAPTR 100   10    ""    ""       ""     dns.ddialliance.org.

A.2.  Delegation of DDI Agencies

  Example records below are defined at dns.ddialliance.org for
  ddi.urn.arpa.

  The empty flag indicates that the lookup is not terminal and the next
  probe to DNS is for more NAPTR records where the new domain is the
  DNS server of the relevant DDI agency.

  ; Delegation to name servers of subdomains in ddi.urn.arpa, i.e.
  ; DDI agencies.
  ;           order pref  flag  service  regexp replacement
  ddia1.us.ddi.urn.arpa.
     IN NAPTR 100   10    ""    ""       ""     dns.example1.edu.
  ddia2.de.ddi.urn.arpa.
     IN NAPTR 100   10    ""    ""       ""     dns.example2.org.
  ddia3.gb.ddi.urn.arpa.
     IN NAPTR 100   10    ""    ""       ""     dns.example3.ac.uk.

A.3.  DDI Services

  Example records below are defined at dns.example2.org for
  ddi.urn.arpa.

  The "u" flag states that the rule is terminal and that the output is
  a URI that contains the information needed to contact that DDI
  service.  The "s" flag states that the rule is terminal and that the
  output of the rewrite will be a domain name for which an SRV record
  SHOULD be queried.  See also [RFC4848], Section 4.4.

  The service I2R returns one instance of the resource identified by
  the given URN.  That service is a repository of DDI resources
  available at http://repos.example2.org/I2R/; possibly a REST-based
  service.  The service I2C returns a description of the resource
  identified by the given URN.  That service is a registry of DDI
  resources available at registry-udp.example2.org port 10060.

  U-NAPTR permits regular expressions of a form that does a complete
  replacement of the matched string with a URI, expressed as a constant
  string.  With this limited form of regular expression ([RFC4848],
  Section 2.2), applications using NAPTR need not implement full
  regular expression parsers.

  ddia2.de.ddi.urn.arpa.
  ;         order pref flag
    IN NAPTR 100  10  "u"  "I2R+http"                   ( ; service
                        "!.*!http://repos.example2.org/I2R/!"; regex
                           .                           ; replacement
                                                        )
    IN NAPTR 100  10  "s"  "I2C+udp"                    ( ; service
                           ""                             ; regex
                           registry._udp.example2.org. ; replacement
                                                        )
  ; all subdomains in ddia2.de.ddi.urn.arpa.
  *.ddia2.de.ddi.urn.arpa.
  ddia2.de.ddi.urn.arpa.
  ;         order pref flag
    IN NAPTR 100  10  "u"  "I2R+http"                   ( ; service
                        "!.*!http://repos.example2.org/I2R/!"; regex
                           .                           ; replacement
                                                        )
    IN NAPTR 100  10  "s"  "I2C+udp"                    ( ; service
                           ""                             ; regex
                           registry._udp.example2.org.; replacement
                                                        )
  ;_service._protocol.name
  ; TTL    class  SRV  priority weight port  targetreplac
  _registry._udp.example2.org
    14400  IN     SRV  0        0   10060 registry-udp.example2.org.

Appendix B.  Algorithm for DDI Service Discovery

  The description is based on the Dynamic Delegation Discovery System
  (DDDS) algorithm [RFC3402].

  The application selects the appropriate service from the output
  described below and contacts the service for the given URN.

  The process can be optimized by an application cache for the NAPTR
  records of already requested DDI agencies.

B.1.  Application Unique String

  The Application Unique String is a DDI URN.

B.2.  First Well Known Rule

  1.  Extracting the characters between the second and third colon (the
      agency-identifier).

  2.  Normalizing case of that string.

  3.  Reversing the order of the substrings separated by dots.

  4.  Appending the string ".ddi.urn.arpa" to the end to get a domain
      name.

B.3.  Valid Databases

  The DNS is specified as a DDDS Database for this application, which
  uses the NAPTR DNS resource records to contain the rewrite rules for
  service discovery.

  The DNS is queried for NAPTR records for the domain name, which is
  the output of the First Well Known Rule.

B.4.  Expected Output

  The expected output is the information necessary to connect to one or
  more authoritative servers (host, port, protocol, or URL) for an
  application service within a given DDI agency.  The result is a list
  of terminal NAPTR records pointing to services available for the
  relevant DDI agency.

Acknowledgments

  Many thanks to Arofan Gregory, Dan Smith, and Wendy Thomas from the
  DDI Alliance Technical Committee and Peter Koch from DENIC (German
  Network Information Center) for the discussion and input that led to
  this document.

  The following software tools have been helpful in evaluating the ABNF
  grammar and the regular expressions: an ABNF parser [ABNFPFE], a tool
  that creates regular expressions from an ABNF grammar [ABNF2RS], and
  a tool that generates random strings that match an ABNF grammar
  [ABNFGEN].

Author's Address

  Joachim Wackerow
  c/o The Data Documentation Initiative Alliance (DDI Alliance)
  ICPSR, University of Michigan
  PO Box 1248
  Ann Arbor, MI 48106-1248
  United States of America
  Email: [email protected], [email protected]
  URI:   ddialliance.org