A distributed Hash table
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, February 2006. === Includes bibliographical references (p. 123-132) and index. === DHash is a new system that harnesses the storage and network resources of computers distributed across the...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | English |
Published: |
Massachusetts Institute of Technology
2008
|
Subjects: | |
Online Access: | http://dspace.mit.edu/handle/1721.1/35525 http://hdl.handle.net/1721.1/35525 |
id |
ndltd-MIT-oai-dspace.mit.edu-1721.1-35525 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-MIT-oai-dspace.mit.edu-1721.1-355252019-05-02T15:53:24Z A distributed Hash table DHash table Dabek, Frank (Frank Edward), 1977- Robert T. Morris. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, February 2006. Includes bibliographical references (p. 123-132) and index. DHash is a new system that harnesses the storage and network resources of computers distributed across the Internet by providing a wide-area storage service, DHash. DHash frees applications from re-implementing mechanisms common to any system that stores data on a collection of machines: it maintains a mapping of objects to servers, replicates data for durability, and balances load across participating servers. Applications access data stored in DHash through a familiar hash-table interface: put stores data in the system under a key; get retrieves the data. DHash has proven useful to a number of application builders and has been used to build a content-distribution system [31], a Usenet replacement [115], and new Internet naming architectures [130, 129]. These applications demand low-latency, high-throughput access to durable data. Meeting this demand is challenging in the wide-area environment. The geographic distribution of nodes means that latencies between nodes are likely to be high: to provide a low-latency get operation the system must locate a nearby copy of the data without traversing high-latency links. (cont.) Also, wide-area network links are likely to be less reliable and have lower capacities than local-area network links: to provide durability efficiently the system must minimize the number of copies of data items it sends over these limited capacity links in response to node failure. This thesis describes the design and implementation of the DHash distributed hash table and presents algorithms and techniques that address these challenges. DHash provides low-latency operations by using a synthetic network coordinate system (Vivaldi) to find nearby copies of data without sending messages over high-latency links. A network transport (STP), designed for applications that contact a large number of nodes, lets DHash provide high throughput by striping a download across many servers without causing high packet loss or exhausting local resources. Sostenuto, a data maintenance algorithm, lets DHash maintain data durability while minimizing the number of copies of data that the system sends over limited-capacity links. by Frank Dabek. Ph.D. 2008-01-10T17:26:40Z 2008-01-10T17:26:40Z 2005 2006 Thesis http://dspace.mit.edu/handle/1721.1/35525 http://hdl.handle.net/1721.1/35525 72671778 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/35525 http://dspace.mit.edu/handle/1721.1/7582 134 p. application/pdf Massachusetts Institute of Technology |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
Electrical Engineering and Computer Science. |
spellingShingle |
Electrical Engineering and Computer Science. Dabek, Frank (Frank Edward), 1977- A distributed Hash table |
description |
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, February 2006. === Includes bibliographical references (p. 123-132) and index. === DHash is a new system that harnesses the storage and network resources of computers distributed across the Internet by providing a wide-area storage service, DHash. DHash frees applications from re-implementing mechanisms common to any system that stores data on a collection of machines: it maintains a mapping of objects to servers, replicates data for durability, and balances load across participating servers. Applications access data stored in DHash through a familiar hash-table interface: put stores data in the system under a key; get retrieves the data. DHash has proven useful to a number of application builders and has been used to build a content-distribution system [31], a Usenet replacement [115], and new Internet naming architectures [130, 129]. These applications demand low-latency, high-throughput access to durable data. Meeting this demand is challenging in the wide-area environment. The geographic distribution of nodes means that latencies between nodes are likely to be high: to provide a low-latency get operation the system must locate a nearby copy of the data without traversing high-latency links. === (cont.) Also, wide-area network links are likely to be less reliable and have lower capacities than local-area network links: to provide durability efficiently the system must minimize the number of copies of data items it sends over these limited capacity links in response to node failure. This thesis describes the design and implementation of the DHash distributed hash table and presents algorithms and techniques that address these challenges. DHash provides low-latency operations by using a synthetic network coordinate system (Vivaldi) to find nearby copies of data without sending messages over high-latency links. A network transport (STP), designed for applications that contact a large number of nodes, lets DHash provide high throughput by striping a download across many servers without causing high packet loss or exhausting local resources. Sostenuto, a data maintenance algorithm, lets DHash maintain data durability while minimizing the number of copies of data that the system sends over limited-capacity links. === by Frank Dabek. === Ph.D. |
author2 |
Robert T. Morris. |
author_facet |
Robert T. Morris. Dabek, Frank (Frank Edward), 1977- |
author |
Dabek, Frank (Frank Edward), 1977- |
author_sort |
Dabek, Frank (Frank Edward), 1977- |
title |
A distributed Hash table |
title_short |
A distributed Hash table |
title_full |
A distributed Hash table |
title_fullStr |
A distributed Hash table |
title_full_unstemmed |
A distributed Hash table |
title_sort |
distributed hash table |
publisher |
Massachusetts Institute of Technology |
publishDate |
2008 |
url |
http://dspace.mit.edu/handle/1721.1/35525 http://hdl.handle.net/1721.1/35525 |
work_keys_str_mv |
AT dabekfrankfrankedward1977 adistributedhashtable AT dabekfrankfrankedward1977 dhashtable AT dabekfrankfrankedward1977 distributedhashtable |
_version_ |
1719030604712902656 |