The Handle System Corporation for National Research Initiatives
       
Global Handle Registry
Proxy Server System
Proxy Server System
 

CNRI runs a proxy server system at http://hdl.handle.net/. It's a collection of web servers that understands the handle protocol and knows how to talk to the Handle System. Many implementations of the Handle System intended to help manage web content use handles embedded in URLs on web pages, and for the convenience of their customers, use the proxy server for resolution. CNRI's proxy server uses the proxy servlet code that is available for download.)

For any HTTP request that combines the proxy's domain name with a handle, for example

http://hdl.handle.net/4263537/5555

one of the proxy servers will query the Handle System for the handle, take the URL in the handle record (or if there are multiple URLs in the handle record it will select one, and that selection is in no particular order) and send an HTTP redirect to that URL to the user's web browser.

The proxy server will return a specific URL if the query includes its unique index value. For example,

http://hdl.handle.net/4263537/5555?index=3

will redirect the user's web browser to the URL with index value 3.

The Digital Object Identifier System is an implementation of the Handle System that uses handles that are called DOIs. The International DOI Foundation runs a proxy server system at http://dx.doi.org.

Handles may include values other than URLs. These added values are intended for use by more advanced applications which have the ability to take advantage of multiple pieces of data, e.g., the location of enhanced metadata or related objects. The proxy server, which is assumed to be talking to a plain web browser, ignores these values.

The proxy server is configured to display a "Handle Not Found" error page when queried for a handle that it cannot find.

The handles 4263537/5555 and 4263537/5555/ are both valid handles, but it is unusual for a handle to have trailing slash. If a resolution request for a handle with a trailing slash is received by the proxy server and that handle is not found, the proxy server will return an error report that includes a warning that the requested handle contained a trailing slash, and a link to click to resolve the same string without the slash.

The Handle System uses UTF-8, a Unicode implementation, and has no character set constraints. But the proxy server is a web server that sends redirects to web browsers using HTTP syntax, so characters in handles that may not be interpreted correctly by web browsers, for example '?', should be avoided or encoded. A non-ASCII character in a handle should be converted to UTF-8, and each UTF-8 byte that isn't ASCII should be %encoded.

The "# " is another example. Only if you send the proxy server the handle

4263537/5555#resolve

encoded as

http://hdl.handle.net/4263537/5555%23resolve

will the proxy correctly resolve the handle. If it is not encoded, #test will be treated as a fragment and removed by the web browser before it gets to the proxy, which will then attempt to resolve 4263537/5555 instead.

To speed resolution, the proxy servers cache handle values, with the TTL set to 24 hours. This means that if a handle value is changed, it can take up to 24 hours before the new value is returned.

 
Updated 9 July 2007

Send inquiries to hdladmin@cnri.reston.va.us