Overview#Uniform Resource Identifier (URI) is a string of characters used to identify a name of a resource.
Schemes specifying a concrete syntax and associated protocols define each URI. The most common form of URI is the Uniform Resource Locator (URL), frequently referred to informally as a web address. More rarely seen in usage is the Uniform Resource Name or (URN), which was designed to complement URLs by providing a mechanism for the identification of resources in particular namespaces.
Syntax#The Uniform Resource Identifier syntax consists of a Uniform Resource Identifier scheme name (such as "http", "ftp", "mailto", "crid" or "file") followed by a colon character, and then by a scheme-specific part.
The specifications that govern the schemes determine the syntax and semantics of the scheme-specific part.
However, URI syntax does require all schemes to adhere to a general syntax that (among other things) reserves certain characters for special purposes (without always identifying those purposes).
The URI syntax also enforces restrictions on the scheme-specific part in order to (for example) provide for a degree of consistency when the part has a hierarchical structure.
Every Uniform Resource Identifier is defined as consisting of several parts, as follows:
hierarchical part ┌───────────────────┴─────────────────────┐ authority path ┌───────────────┴───────────────┐┌───┴────┐ abc://username:firstname.lastname@example.org:123/path/data?key=value&key2=value2#fragid1 └┬┘ └───────┬───────┘ └────┬────┘ └┬┘ └─────────┬─────────┘ └──┬──┘ scheme user information host port query fragment urn:example:mammal:monotreme:echidna └┬┘ └──────────────┬───────────────┘ scheme path
A typical Uniform Resource Identifier might look like:
URI = <scheme name> : <hierarchical part> [ ? <query> ] [ # <fragment> ]
http-URI = "http:" "//" authority path-abempty [ "?" query ] [ "#" fragment ]
Uniform Resource Identifier Components#
- URI Scheme
- Hierarchical - The hierarchical part of the URI is intended to hold identification information hierarchical in nature. If this part begins with a double forward slash ("//"), it is followed by an authority part and a path. If the hierarchical part doesn't begin with ("//") it contains only a path.
- URI Authorities
- URI Path
- URI Query
- URI Fragment Identifiers
Several existing specifications can aid in this.
- RFC 5988 specifies relation types for Web Linking. By providing a framework for linking on the Web, where every link has a relation type, context and target, it allows applications to define a link's semantics and connectivity.
- RFC 6570 provides a standard syntax for URI Templates that can be used to dynamically insert application-specific variables into a URI to enable such applications while avoiding impinging upon URI owners' control of them.
- RFC 5785 allows specific paths to be 'reserved' for standard use on URI Schemes that opt into that mechanism ('http' and 'https' by default). Note, however, that this is not a general "escape valve" for applications that need structured URIs; see that specification for more information.
Specifying more elaborate structures in an attempt to avoid collisions is not an acceptable solution, and does not address the issues in RFC 7320 Section 1. For example, prefixing query parameters with "myapp_" does not help, because the prefix itself is subject to the risk of collision (since it is not "reserved").URIs, URLs, and URNs?
Uniform Resource Identifier URI Opacity (Best Practices)#In general, URIs SHOULD NOT attempt to infer properties of the referenced resource or include metadata.
On the other hand, the URI "mailto:email@example.com" indicates that the URI refers to a mailbox. The "mailto" URI Scheme specification authorizes agents to infer that URIs of this form identify Internet mailboxes.
More Information#There might be more information for this subject on one of the following:
- Extensible Resource Identifier
- Hypertext Transfer Protocol
- Internationalized Resource Identifiers
- Link-based Resource Descriptor Documents
- RFC 2079
- RFC 3986
- RFC 5341
- RFC 5785
- RFC 6116
- RFC 6874
- URI Schemes
- URI Template
- URIs, URLs, and URNs
- Uniform Resource Name
- Unique Identifier
- [#1] - Should metadata (e.g., versioning information) be encoded in URIs? - based on information obtained 2018-01-24-