Skip to content

rasgaard/02462-text_classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Exam project in 02462 - Signals and data

This repository contains project work in the course "Signals and data" taught at the Technical University of Denmark. It serves as experiments for a comparison between two text classification methods.

The project is divided into two notebooks. Common functions are found in common.py.

The experiments for each methods can be found in in the notebook in their respective folders.

Directory structure

.
├── baseline
│   ├── baseline.ipynb
│   └── glove.6B.50d.txt
├── common.py
├── emails.csv
├── fasttext
│   ├── CV
│   ├── fasttext.ipynb
│   ├── news_fasttext_classifier.p
│   ├── news_train_emb.p
│   ├── spam_fasttext_classifier.p
│   ├── spam_train_emb.p
│   └── text_classifier.py 
├── __init__.py
├── news_data.npz
├── news_data.zip
├── readme.md
├── similar_news.txt
├── spam_data.npz
└── spam_data.zip

Articles used

'World' = 'blue' 0, 'sports' = 'red' 1, 'Business'= 'green' 2, 'Sci/Tec'='cyan' 3

formatted as real, could be, text(first paragraph). gathered from frontpage of https://www.bbc.com/news on 08-05-2020

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published