GitHub - igorlfs/poc-code: Projeto Orientado em Computação I 2024/1

Analyze Classification Models by highlighting regions where performance is abnormal

Dependencies

Dependencies are managed with uv. Since it generates a requirements.txt, it should be compatible with other tools, like pip.

uv pip install -r requirements.txt

Input

There are 5 parameters, 3 of which are required:

A CSV containing the dataset. (required)
A CSV containing the model's errors. Each instance should have an error associated with each class. You can use a binary approach: if the prediction for a given instance is right, all of it's errors are zero. Otherwise, if the prediction is wrong, the all of it's errors are zero except for the actual class, which receives an error of one. (required)
The dataset's column that represents the output. By default, it's "target".
The maximum number of subgroups to generate. By default, it's 20.
A class from the dataset, to filter the subgroups for that specific class. (required)

This repo contains a sample dataset (iris) and a model's errors to provide an example execution:

source .venv/bin/activate # assuming your virtual env is located at .venv
python -m src --data ./data/iris.csv --errors ./data/errors/iris.csv --class setosa

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
.github/workflows		.github/workflows
.vscode		.vscode
assets		assets
data		data
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dependencies

Input

About

Releases

Packages

Languages

License

igorlfs/poc-code

Folders and files

Latest commit

History

Repository files navigation

Dependencies

Input

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages