Skip to content

wsg-ariadne/calliope

Repository files navigation

Calliope

📜 Language clarity model for Ariadne.

Calliope is a Naive-Bayes classifier that is meant to process the text from a cookie banner and classify it into the following classes:

  1. GOOD indicating that the language used in the extract is likely to be clear, descriptive, and provides options to provide or deny cookie consent
  2. BAD indicating that the language used in the extract is likely to be confusing, vague, and assuming that cookie consent will be provided

This classifier allows Ariadne to determine whether a website uses deceptive design in the form of unclear language on its cookie banner.

Usage

Requirements

Install Python 3.8+ (tested on 3.8.16) and the packages in requirements.txt using pip install -r requirements.txt.

Generating the model

Run python generate.py to generate the model, save the model as a pickle, and test the pickled model.

Using the model

Run python test.py to have the program provide an opportunity to input text and provide more explicit output regarding the result.

Details on the Model

Dataset

The dataset used to train this model includes photos selected from the Soe, Norberg, Guribye, and Slakovik made available here. The transcription through OCR, classification, and labeling applied for this project were done by the developers of ariadne.

@inproceedings{10.1145/3419249.3420132,
author = {Soe, Than Htut and Nordberg, Oda Elise and Guribye, Frode and Slavkovik, Marija},
title = {Circumvention by Design - Dark Patterns in Cookie Consent for Online News Outlets},
year = {2020},
isbn = {9781450375795},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3419249.3420132},
doi = {10.1145/3419249.3420132},
abstract = { To ensure that users of online services understand what data are collected and how they are used in algorithmic decision-making, the European Union’s General Data Protection Regulation (GDPR) specifies informed consent as a minimal requirement. For online news outlets consent is commonly elicited through interface design elements in the form of a pop-up. We have manually analyzed 300 data collection consent notices from news outlets that are built to ensure compliance with GDPR. The analysis uncovered a variety of strategies or dark patterns that circumvent the intent of GDPR by design. We further study the presence and variety of these dark patterns in these “cookie consents” and use our observations to specify the concept of dark pattern in the context of consent elicitation.},
booktitle = {Proceedings of the 11th Nordic Conference on Human-Computer Interaction: Shaping Experiences, Shaping Society},
articleno = {19},
numpages = {12},
keywords = {dark patterns, cookie consent notice, CCPA, GDPR},
location = {Tallinn, Estonia},
series = {NordiCHI '20}
}

Training

The training can be found in calliope.ipynb and follows DataCamp's guide here.

About

📜 Language clarity model for Ariadne

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published