Notice

This is not the latest version of this item. The latest version can be found at: https://www.research-collection.ethz.ch/handle/20.500.11850/284907

Show simple item record

dc.contributor.author
Kuratli, Jonas
dc.contributor.supervisor
Müller, Ingo
dc.contributor.supervisor
Barthels, Claude
dc.contributor.supervisor
Alonso, Gustavo
dc.date.accessioned
2018-09-04T09:59:39Z
dc.date.available
2018-08-27T14:06:13Z
dc.date.available
2018-09-04T09:59:39Z
dc.date.issued
2017-09-27
dc.identifier.uri
http://hdl.handle.net/20.500.11850/284907
dc.identifier.doi
10.3929/ethz-b-000284907
dc.description.abstract
Distributed Joins over a network have been researched for decades, usually focusing on adapting the join to the network connecting the nodes holding the relations. Most research has gone into optimizing the join itself, i.e. the identification of matching tuples, however the effective materialization of the join result is equally important. The main performance issue identified by materialization strategies is that the network performs significantly worse than the local processing nodes, i.e. the transfer speed between nodes is the limiting factor. The conclusion drawn from this is that a materialization approach should reduce the amount of transmitted data by spending CPU time on the creation of optimal transfer schedules. In this thesis, we explore the possible changes to this materialization approach by considering a high-performance network. We propose a late-materialization approach with two different strategies for the exchange of data. We focus on optimizing CPU time and interleave communication and computation for the exchange of data. We then perform experiments for a wide range of parameters. The results show that, despite the interleaving of communication and computation, the implementation is network bound, thus concluding that even in high-performance networks, the data transfer has to be optimized.
en_US
dc.format
application/pdf
en_US
dc.language.iso
en
en_US
dc.publisher
Systems Group, Department of Computer Science, ETH Zurich
en_US
dc.rights.uri
http://rightsstatements.org/page/InC-NC/1.0/
dc.title
Distributed Join Result Materialization over High-Performance Networks
en_US
dc.type
Master Thesis
dc.rights.license
In Copyright - Non-Commercial Use Permitted
dc.date.published
2018-09-04
ethz.journal.volume
176
en_US
ethz.size
65 p.
en_US
ethz.publication.place
Zurich
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02663 - Institut für Computing Platforms / Institute for Computing Platforms::03506 - Alonso, Gustavo / Alonso, Gustavo
en_US
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02663 - Institut für Computing Platforms / Institute for Computing Platforms::03506 - Alonso, Gustavo / Alonso, Gustavo
en_US
ethz.date.deposited
2018-08-27T14:06:14Z
ethz.source
FORM
ethz.eth
yes
en_US
ethz.availability
Open access
en_US
ethz.rosetta.installDate
2018-09-04T09:59:47Z
ethz.rosetta.lastUpdated
2018-09-04T09:59:47Z
ethz.rosetta.exportRequired
false
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Distributed%20Join%20Result%20Materialization%20over%20High-Performance%20Networks&rft.date=2017-09-27&rft.volume=176&rft.au=Kuratli,%20Jonas&rft.genre=unknown&rft.btitle=Distributed%20Join%20Result%20Materialization%20over%20High-Performance%20Networks
 Search print copy at ETH Library

Files in this item

Thumbnail

Publication type

Show simple item record