Skip to content
This repository has been archived by the owner on Mar 16, 2024. It is now read-only.

Commit

Permalink
feat: first alpha version
Browse files Browse the repository at this point in the history
  • Loading branch information
MrlolDev committed Jun 11, 2023
1 parent 3e8c628 commit f2dbcb3
Show file tree
Hide file tree
Showing 8 changed files with 2,397 additions and 0 deletions.
4 changes: 4 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -28,6 +28,10 @@ coverage

# Grunt intermediate storage (https://gruntjs.com/creating-plugins#storing-task-files)
.grunt
data/
steps/
cleaned_data/
log.txt

# Bower dependency directory (https://bower.io/)
bower_components
Expand Down
33 changes: 33 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,2 +1,35 @@
# Data-cleaning

A tool for cleaning datasets from turing ai

# Usage guide

1. Clone the repository

```
git clone https://github.com/TuringAI-Team/Data-cleaning.git
```

2. Install the requirements

```
npm install
```

3. Create folders, where you will store your datasets and results
Create a folder in root called data
Create a folder in root called steps
4. Export your dataset from turing ai to the data folder and name it data.csv
5. Run the CLI

```
npm run dev
```

6. Follow the instructions in the CLI
7. The results will be stored in the steps folder

# To do

- [ ] Fix bug where sometimes it answer with invalid json
- [ ] Improve cleaning to not remove so much data
Loading

0 comments on commit f2dbcb3

Please sign in to comment.