Prerequites:
- nltk.tokenize (sudo pip install -U nltk)
- tkinter
- enchant.checker
Input: file .tsv with many raws that are labeled
(" tweet_id positive The apple is gooood #fruits @pippo
tweet_id negative My moother eat fruits!!!!
....")
Output file .tsv cleaned
You can choose with an interactive GUI to clean your data using various techniques
To execute the GUI: launch cleaner_GUI.py
Article available at: http://ceur-ws.org/Vol-1748/paper-06.pdf