reuters

This project tests different features and parameters for CNN and LSTM and their effects on news article classification. Glove is used as basic embedding.

Task is multi-label assigning to news articles of Reuters-data. About 300.000 articles in XML-form.

The code produces large files, especilly reuters_all.pkl of 435,8 MB, and its splits to train/dev/test. These are added to .gitignore to NOT be uploaded to github. Instead they can be produced by the code locally.

Results

Small amount of training data

3426 samples.

Even on small data CNN based models converge robustly very fast, on couple epochs, while LSTM based models took a long time, a bit over 50 epochs to start showing any sign convergence. On small data CNN was already reached its point of overfitting, before LSTM started to converge. The small data was not enough for LSTM. Possibly because with 126 classes, the number of some individual classes remains small. (we did not balance the training set, but used random sample ). The same was true for the LSTM+CNN architecture.

Training time on GTX 1070 GPU / 8GB.

Full training data

299773 samples. On full data CNN is still superior in training time. Here though LSTM can match in accuracy. Test-accuracy was quite equal, also having a bit fluctuation with different runs. CNN+LSTM achieved a quite similar accuracy (0.837) to the best convolutional model,

There are no big differences in accuracy of models. Also accuracy of single model varies a bit between training with different random initialization.

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
images		images
input		input
models		models
output		output
.gitignore		.gitignore
1-preprocess_data.ipynb		1-preprocess_data.ipynb
2-compare_models.ipynb		2-compare_models.ipynb
3-final_model.ipynb		3-final_model.ipynb
4-final_model.ipynb		4-final_model.ipynb
README.md		README.md
models.py		models.py
results.pkl		results.pkl
simple_version.ipynb		simple_version.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

reuters

Results

Small amount of training data

Full training data

About

Releases

Packages

Contributors 2

Languages

jannenev/reuters

Folders and files

Latest commit

History

Repository files navigation

reuters

Results

Small amount of training data

Full training data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages