compare-nlp-models

Comparing how different Language Models and context free embeddings perform on very small amount of labeled training data.

Models to compare:
Baseline:Majority class (always predict majority class)\

FLAIR:

Glove
en-twitter (static word embedding)
Fasttext: en crawl
Fasttext: news wiki
ELMO
Flair-embedding
Bert base-cased
BPE - Byte Pair Embedding
GPT-1

Data sets:
Hatespeech Detection in Tweets Automated Hate Speech Detection and the Problem of Offensive Language
Paper: https://aaai.org/ocs/index.php/ICWSM/ICWSM17/paper/view/15665
Data: https://github.com/t-davidson/hate-speech-and-offensive-language/

Sentiment analysis of Business news
Benchmarks and models for entity-oriented polarity detection
Paper: http://www.aclweb.org/anthology/N18-3016
Dataset: http://puls.cs.helsinki.fi/polarity

Train data sizes:
0, 100, 200, 500, 1000, 3000, 7000, 18783 (business news articles)
0, 100, 200, 500, 1000, 3000, 7000, 13718 (hatespeech Tweets)

Baseline
I will use predicting always majority class as a baseline, which models should improveover.

For Hatespeech data set, the majority class (1 - offensive language) represents 76 %of samples in test data (0.7596). Predicting always majority gives also f1-score of 0.76, which we will use as a lower bound.

For Business news data set, the majority class is 3 "positive" with 31.7 % of data.This gives f1 baseline of 0.317. Having total of 5 classes in news data, compared to 3 of hatespeech, the news dataset gives harder problem to score numerically high.

Results

F1 per train samples: Hatespeech / Tweets

F1 per train samples: Business new

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
conda environment		conda environment
dataset_businessnews/output		dataset_businessnews/output
dataset_hatespeech/output		dataset_hatespeech/output
images		images
.gitignore		.gitignore
1_download_data.ipynb		1_download_data.ipynb
2_preprocess_data-ver-HATESPEECH.ipynb		2_preprocess_data-ver-HATESPEECH.ipynb
2_preprocess_data-ver-NEWS.ipynb		2_preprocess_data-ver-NEWS.ipynb
3A-model-FLAIR-MAIN-batch-NEWS.ipynb		3A-model-FLAIR-MAIN-batch-NEWS.ipynb
3B-model-FLAIR-MAIN-batch-HATESPEECH.ipynb		3B-model-FLAIR-MAIN-batch-HATESPEECH.ipynb
3X(C)-model-tf-hub-bert-hatespeech.ipynb		3X(C)-model-tf-hub-bert-hatespeech.ipynb
3X(D)-model-tf-hub-univ_sentence-hatespeech.ipynb		3X(D)-model-tf-hub-univ_sentence-hatespeech.ipynb
4-analyze-NEWS.ipynb		4-analyze-NEWS.ipynb
4-analyze-hatespeech.ipynb		4-analyze-hatespeech.ipynb
LICENSE		LICENSE
README.md		README.md
loss.tsv		loss.tsv
mytable_f1_trainsize.tex		mytable_f1_trainsize.tex
mytable_f1_trainsize_NEWS.tex		mytable_f1_trainsize_NEWS.tex
test.tsv		test.tsv
training.log		training.log
weights.txt		weights.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

compare-nlp-models

About

Releases

Packages

Languages

License

jannenev/compare-nlp-models

Folders and files

Latest commit

History

Repository files navigation

compare-nlp-models

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages