Font Size: a A A

High bandwidth data dissemination for large-scale distributed systems

Posted on:2006-02-24Degree:Ph.DType:Dissertation
University:Duke UniversityCandidate:Kostic, Dejan ManojloFull Text:PDF
GTID:1458390008471900Subject:Computer Science
Abstract/Summary:
Built as logical networks on top of the Internet, overlay networks have recently emerged as a fundamental building block for evolving the network architecture. Sample uses include application-level multicast, scalable object location and routing, improving end-to-end path characteristics, service discovery, and increasing resistance to DoS attacks. This dissertation focuses on the multi-receiver data dissemination problem. Traditionally, researchers thought that this problem would be solved by multicasting data at the IP (network) level. Although IP multicast was unsuccessful for a variety of reasons, its tree-based distribution approach was carried over to overlays. I argue however, that trees have two fundamental limitations for data dissemination. First, since all data comes from a single parent, participants are forced to continuously expend bandwidth for probing in search of a parent with an acceptable level of bandwidth. Second, due to packet losses and failures, the bandwidth in an overlay tree is monotonically decreasing down the tree.; In this dissertation, I describe Bullet, a data dissemination mesh that takes advantage of the computational and storage capabilities of end hosts to create a distribution structure where a node receives data in parallel from multiple peers. For the mesh to deliver improved bandwidth and reliability, we need to solve several key problems: (i) disseminating disjoint data over the mesh, (ii) locating missing content, (iii) finding who to peer with (peering strategy), (iv) retrieving data at the right rate from all peers (flow control), and (v) recovering from failures and adapting to dynamically changing network conditions. Additionally, the system should be self-adjusting and should have few user-adjustable parameter settings. I describe my approach to addressing all of these problems in a working, deployed system across the Internet. Bullet outperforms the state-of-the-art systems, including BitTorrent, by 25--70%, and exhibits strong performance and reliability in a range of deployment settings. In addition, I find that, relative to tree-based solutions, Bullet reduces the need to perform expensive bandwidth probing.
Keywords/Search Tags:Bandwidth, Data
Related items