Internet Engineering Task Force (IETF) T. Hardie

Internet Engineering Task Force (IETF) T. Hardie
Request for Comments: 8165 May 2017
Category: Informational
ISSN: 2070-1721

Design Considerations for Metadata Insertion

Abstract

The IAB published RFC 7624 in response to several revelations of
pervasive attacks on Internet communications. This document
considers the implications of protocol designs that associate
metadata with encrypted flows. In particular, it asserts that
designs that share metadata only by explicit actions at the host are
preferable to designs in which middleboxes insert metadata.

Status of This Memo

This document is not an Internet Standards Track specification; it is
published for informational purposes.

This document is a product of the Internet Engineering Task Force
(IETF). It represents the consensus of the IETF community. It has
received public review and has been approved for publication by the
Internet Engineering Steering Group (IESG). Not all documents
approved by the IESG are a candidate for any level of Internet
Standard; see Section 2 of RFC 7841.

Information about the current status of this document, any errata,
and how to provide feedback on it may be obtained at
http://www.rfc-editor.org/info/rfc8165.

Copyright Notice

Copyright (c) 2017 IETF Trust and the persons identified as the
document authors. All rights reserved.

This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.

Hardie Informational [Page 1]

RFC 8165 Design Considerations for Metadata Insertion May 2017

Table of Contents

1. Introduction ....................................................2
2. Terminology .....................................................2
3. Design Pattern ..................................................2
4. Advice ..........................................................3
5. Deployment Considerations .......................................4
6. IANA Considerations .............................................5
7. Security Considerations .........................................5
8. References ......................................................6
8.1. Normative References .......................................6
8.2. Informative References .....................................6
Acknowledgements ...................................................7
Author's Address ...................................................7

1. Introduction

To minimize the risks associated with pervasive surveillance, it is
necessary for the Internet technical community to address the
vulnerabilities exploited in the attacks documented in [RFC7258] and
the threats described in [RFC7624]. The goal of this document is to
address a common design pattern that emerges from the increase in
encryption: explicit association of metadata that would previously
have been inferred from the plaintext protocol.

2. Terminology

This document makes extensive use of standard security and privacy
terminology; see [RFC4949] and [RFC6973]. Readers should be familiar
with the terms defined in [RFC6973], including "Eavesdropper",
"Observer", "Initiator", "Intermediary", "Recipient", "Attack" (in a
privacy context), "Correlation", "Fingerprint", "Traffic Analysis",
and "Identifiability" (and related terms). Readers should also be
familiar with terms that are specific to the attacks discussed in
[RFC7624], including "Pervasive Attack", "Passive Pervasive Attack",
"Active Pervasive Attack", "Observation", "Inference", and
"Collaborator".

3. Design Pattern

One of the core mitigations for the loss of confidentiality in the
presence of pervasive surveillance is data minimization, which limits
the amount of data disclosed to those elements absolutely required to
complete the relevant protocol exchange. When data minimization is
in effect, some information that was previously available may be
removed from specific protocol exchanges. The information may be
removed explicitly (for example, by a browser suppressing cookies

Hardie Informational [Page 2]

RFC 8165 Design Considerations for Metadata Insertion May 2017

during private modes) or by other means. As noted in [RFC7624], some
topologies that aggregate or alter the network path also act to
reduce the ease with which metadata is available to eavesdroppers.

In some cases, other actors within a protocol context will continue
to have access to the information that has been thus withdrawn from
specific protocol exchanges. If those actors attach the information
as metadata to those protocol exchanges, the confidentiality effect
of data minimization is lost.

Restoring information is particularly tempting at systems not
primarily deployed to increase confidentiality. A proxy providing
compression, for example, may wish to restore the identity of the
requesting party; similarly, a VPN system used to provide channel
security may believe that the origin IP should be restored. Actors
considering restoring metadata may believe that they understand the
relevant privacy considerations or believe that, because the primary
purpose of the service was not privacy-related, none exist. Examples
of this design pattern include [RFC7239] and [RFC7871].

4. Advice

Avoid inserting metadata to restore information that would otherwise
be unavailable to later participants in a protocol exchange. It
contributes to the overall loss of confidentiality for the Internet
and trust in the Internet as a medium. Do not add metadata to flows
at intermediary devices unless a positive affirmation of approval for
restoration has been received from the actor whose data will be
added.

Instead, design the protocol so that the actor can add such metadata
themselves so that it flows end to end, rather than requiring the
action of other parties. In addition to improving privacy, this
approach ensures consistent availability between the communicating
parties, no matter what path is taken. (Note that this document does
not attempt to describe how an actor sets policies on providing this
metadata, as the range of systems that might be implied is very
broad).

As an example, RFC 7871 describes a method that had already been
deployed and notes that it is unlikely that a clean-slate design
would result in this mechanism. If a clean-slate design were built
to follow the advice in this document, that design would likely not
use a core element of RFC 7871: rather than adding metadata at a
proxy, it would provide facilities for end systems to add it to their
initial queries. In the case of RFC 7871, the relevant metadata is
relatively easy for an end system to derive, as Session Traversal
Utilities for NAT (STUN) [RFC5389] provides a method for learning the

Hardie Informational [Page 3]

RFC 8165 Design Considerations for Metadata Insertion May 2017

reflexive transport address from which a client subnet could be
derived. This would allow clients to populate this data themselves,
thus affirming their consent and providing data at a granularity with
which they were comfortable. As in RFC 7871, the addition of this
data would require confirmation that the upstream DNS resolver
understands what to do with it, but the same negotiation mechanism,
an Extension Mechanisms for DNS (EDNS(0)) option [RFC6891], could be
used. Because of this negotiation, there would be a new variability
in responses that would change the caching behavior for data supplied
by participating servers. This is not a major change from the
current design, however, as the same considerations set out in
Sections 7.3.2 and 7.5 of RFC 7871 would apply to client-supplied
subnets as well as to proxy-supplied subnets.

From a protocol perspective, in other words, this approach would be a
minor change from RFC 7871, would be as fully featured, and would
provide better privacy properties than the on-path update mechanism
RFC 7871 provides. The next section examines why, despite this,
deployment considerations have sometimes trumped cleaner designs.

5. Deployment Considerations

There are a few common tensions associated with the deployment of
systems that restore metadata. The first is the trade-off in speed
of deployment for different actors. The Forwarded HTTP Extension in
[RFC7239] provides an example of this. When used with a proxy, it
restores information related to the original requesting party, thus
allowing a responding server to tailor responses according to the
original party's region, network, or other characteristics associated
with the identity. It would, of course, be possible for the
originating client to add this data itself, after using STUN
[RFC5389] or a similar mechanism to first determine the information
to declare. This would require, however, full specification and
adoption of this mechanism by the end systems. It would not be
available at all during this period and would thereafter be limited
to systems that have been upgraded to include it. The long tail of
browser deployments indicates that many systems might go without
upgrades for a significant period of time. The proxy infrastructure,
in contrast, is commonly under more active management and represents
a much smaller number of elements; this impacts both the general
deployment difficulty and the number of systems that the origin
server must trust.

The second common tension is between metadata minimization and the
desire to tailor content responses. For origin servers whose content
is common across users, the loss of metadata may have limited impact
on the system's functioning. For other systems, which commonly
tailor content by region or network, the loss of metadata may imply a

Hardie Informational [Page 4]

RFC 8165 Design Considerations for Metadata Insertion May 2017

loss of functionality. Where the user desires this functionality,
restoration can commonly be achieved by the use of other identifiers
or login procedures. Where the user does not desire this
functionality, but it is a preference of the server or a third party,
adjustment is more difficult. At the extreme, content blocking by
network origin may be a regulatory requirement. Trusting a network
intermediary to provide accurate data is, of course, fragile in this
case, but it may be a part of the regulatory framework.

There are also tensions with latency of operation. For example,
where the end system does not initially know the information that
would be added by on-path devices, it must engage the protocol
mechanisms to determine it. Determining a public IP address to
include in a locally supplied header might require a STUN exchange,
and the additional latency of this exchange discourages deployment of
host-based solutions. To minimize this latency, engaging those
mechanisms may need to be done in parallel with or in advance of the
core protocol exchanges with which this metadata would be supplied.

These tensions do not change the basic recommendation, but they
suggest that the parties who are introducing encryption and data
minimization for existing protocols consider carefully whether the
work also implies introducing mechanisms for the end-to-end
provisioning of metadata when a user has actively consented to
provide it.

6. IANA Considerations

This document makes no request of IANA.

7. Security Considerations

This memorandum describes a design pattern emerging from responses to
the attacks described in [RFC7258]. Continued use of this design
pattern, which uses mid-flow devices to restore metadata, lowers the
impact of mitigations to that attack.

Note that some emergency service recipients, notably PSAPs (Public
Safety Answering Points) may prefer data provided by a network to
data provided by an end system, because an end system could use false
data to attack others or consume resources. While this has the
consequence that the data available to the PSAP is often more coarse
than that available to the end system, the risk of false data being
provided involves a risk to the lives of those targeted.

Hardie Informational [Page 5]

RFC 8165 Design Considerations for Metadata Insertion May 2017

8. References

8.1. Normative References

[RFC4949] Shirey, R., "Internet Security Glossary, Version 2",
FYI 36, RFC 4949, DOI 10.17487/RFC4949, August 2007,
<http://www.rfc-editor.org/info/rfc4949>.

[RFC6973] Cooper, A., Tschofenig, H., Aboba, B., Peterson, J.,
Morris, J., Hansen, M., and R. Smith, "Privacy
Considerations for Internet Protocols", RFC 6973,
DOI 10.17487/RFC6973, July 2013,
<http://www.rfc-editor.org/info/rfc6973>.

[RFC7258] Farrell, S. and H. Tschofenig, "Pervasive Monitoring Is an
Attack", BCP 188, RFC 7258, DOI 10.17487/RFC7258, May
2014, <http://www.rfc-editor.org/info/rfc7258>.

[RFC7624] Barnes, R., Schneier, B., Jennings, C., Hardie, T.,
Trammell, B., Huitema, C., and D. Borkmann,
"Confidentiality in the Face of Pervasive Surveillance: A
Threat Model and Problem Statement", RFC 7624,
DOI 10.17487/RFC7624, August 2015,
<http://www.rfc-editor.org/info/rfc7624>.

8.2. Informative References

[RFC5389] Rosenberg, J., Mahy, R., Matthews, P., and D. Wing,
"Session Traversal Utilities for NAT (STUN)", RFC 5389,
DOI 10.17487/RFC5389, October 2008,
<http://www.rfc-editor.org/info/rfc5389>.

[RFC6891] Damas, J., Graff, M., and P. Vixie, "Extension Mechanisms
for DNS (EDNS(0))", STD 75, RFC 6891,
DOI 10.17487/RFC6891, April 2013,
<http://www.rfc-editor.org/info/rfc6891>.

[RFC7239] Petersson, A. and M. Nilsson, "Forwarded HTTP Extension",
RFC 7239, DOI 10.17487/RFC7239, June 2014,
<http://www.rfc-editor.org/info/rfc7239>.

[RFC7687] Farrell, S., Wenning, R., Bos, B., Blanchet, M., and H.
Tschofenig, "Report from the Strengthening the Internet
(STRINT) Workshop", RFC 7687, DOI 10.17487/RFC7687,
December 2015, <http://www.rfc-editor.org/info/rfc7687>.

Hardie Informational [Page 6]

RFC 8165 Design Considerations for Metadata Insertion May 2017

[RFC7871] Contavalli, C., van der Gaast, W., Lawrence, D., and W.
Kumari, "Client Subnet in DNS Queries", RFC 7871,
DOI 10.17487/RFC7871, May 2016,
<http://www.rfc-editor.org/info/rfc7871>.

Acknowledgements

This document is derived in part from the work initially done on the
perpass mailing list and at the STRINT workshop [RFC7687]. The text
was originally developed by the IAB's Privacy and Security Program
before submission to the IETF. The document also benefited from an
extensive review by Mohamed Boucadair.

Author's Address

Ted Hardie

Email: [email protected]

Hardie Informational [Page 7]