Show simple item record

dc.contributor.author
Lodrant, Luka
dc.contributor.supervisor
Basin, David
dc.contributor.supervisor
Kubicek, Karel
dc.date.accessioned
2022-03-03T09:20:25Z
dc.date.available
2022-03-01T13:02:04Z
dc.date.available
2022-03-01T14:12:51Z
dc.date.available
2022-03-03T09:20:25Z
dc.date.issued
2022-01-17
dc.identifier.uri
http://hdl.handle.net/20.500.11850/534764
dc.identifier.doi
10.3929/ethz-b-000534764
dc.description.abstract
While users deserve security and privacy when using web services, these properties are at odds with the financial interests of website owners both in terms of work required to keep websites secure and revenues generated by exploiting sensitive data resulting in a violation of the user’s privacy. Countries, therefore, introduced regulations to balance the inequity. Namely, European Union’s General Data Protection Regulation (GDPR) specifies that any data collection and processing can only be done with the informed and specific consent of the user, including sharing of the said data with 3rd parties. Automated and large-scale detection of violations and security flaws is difficult because of the non-standardized behavior of website authentication mechanisms. We developed a web crawler for detecting and submitting mainly registration web forms. This crawler enables novel privacy and security research on a larger scale than was previously possible. The completely automated crawler can navigate the site to find the required form, fill the form, avoid bot detection mechanisms, submit the form, and validate the submission success. In 17 days, we crawled over 600,000 domains intending to create new user accounts. Our automated crawler detected a sign-up form on 22% of all the reachable websites with a 6.4% registration success rate. We have also received at least one email from 2.3% of all crawled pages. This significantly surpasses the prior version of this project and the best widely-used published tool.
en_US
dc.format
application/pdf
en_US
dc.language.iso
en
en_US
dc.publisher
ETH Zurich
en_US
dc.rights.uri
http://rightsstatements.org/page/InC-NC/1.0/
dc.title
Designing a generic web forms crawler to enable legal compliance analysis of authentication sections
en_US
dc.type
Master Thesis
dc.rights.license
In Copyright - Non-Commercial Use Permitted
ethz.size
53 p.
en_US
ethz.code.ddc
DDC - DDC::0 - Computer science, information & general works::004 - Data processing, computer science
en_US
ethz.publication.place
Zurich
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02660 - Institut für Informationssicherheit / Institute of Information Security::03634 - Basin, David / Basin, David
en_US
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02660 - Institut für Informationssicherheit / Institute of Information Security::03634 - Basin, David / Basin, David
en_US
ethz.date.deposited
2022-03-01T13:02:09Z
ethz.source
FORM
ethz.eth
yes
en_US
ethz.availability
Open access
en_US
ethz.rosetta.installDate
2022-03-03T09:20:33Z
ethz.rosetta.lastUpdated
2023-02-07T00:18:16Z
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Designing%20a%20generic%20web%20forms%20crawler%20to%20enable%20legal%20compliance%20analysis%20of%20authentication%20sections&rft.date=2022-01-17&rft.au=Lodrant,%20Luka&rft.genre=unknown&rft.btitle=Designing%20a%20generic%20web%20forms%20crawler%20to%20enable%20legal%20compliance%20analysis%20of%20authentication%20sections
 Search print copy at ETH Library

Files in this item

Thumbnail

Publication type

Show simple item record