Model description

[More Information Needed]

Intended uses & limitations

[More Information Needed]

Training Procedure

[More Information Needed]

Hyperparameters

Click to expand
Hyperparameter Value
memory
steps [('count', CountVectorizer()), ('clf', MultinomialNB())]
verbose False
count CountVectorizer()
clf MultinomialNB()
count__analyzer word
count__binary False
count__decode_error strict
count__dtype <class 'numpy.int64'>
count__encoding utf-8
count__input content
count__lowercase True
count__max_df 1.0
count__max_features
count__min_df 1
count__ngram_range (1, 1)
count__preprocessor
count__stop_words
count__strip_accents
count__token_pattern (?u)\b\w\w+\b
count__tokenizer
count__vocabulary
clf__alpha 1.0
clf__class_prior
clf__fit_prior True
clf__force_alpha True

Model Plot

Pipeline(steps=[('count', CountVectorizer()), ('clf', MultinomialNB())])
In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

Evaluation Results

Metric Value
accuracy 0.822975
f1 score 0.822975

How to Get Started with the Model

[More Information Needed]

Model Card Authors

This model card is written by following authors:

[More Information Needed]

Model Card Contact

You can contact the model card authors through following channels: [More Information Needed]

Citation

Below you can find information related to citation.

BibTeX:

[More Information Needed]

citation_bibtex

bibtex @inproceedings{...,year={2020}}

get_started_code

import pickle with open(pkl_filename, 'rb') as file: clf = pickle.load(file)

model_card_authors

skops_user

limitations

This model is not ready to be used in production.

model_description

This is a Multinomial Naive Bayes model trained on 20 news groups dataset.Count vectorizer is used for vectorization.

eval_method

The model is evaluated on validation data from 20 news group's test split, using accuracy and F1-score with micro average.

Confusion matrix

Confusion matrix

Classification Report

Click to expand
index precision recall f1-score support
alt.atheism 0.927007 0.940741 0.933824 135
comp.graphics 0.6 0.831325 0.69697 166
comp.os.ms-windows.misc 0.833333 0.0588235 0.10989 170
comp.sys.ibm.pc.hardware 0.603306 0.802198 0.688679 182
comp.sys.mac.hardware 0.962687 0.704918 0.81388 183
comp.windows.x 0.665236 0.91716 0.771144 169
misc.forsale 0.905983 0.616279 0.733564 172
rec.autos 0.895028 0.848168 0.870968 191
rec.motorcycles 0.925532 0.878788 0.901554 198
rec.sport.baseball 0.987421 0.934524 0.960245 168
rec.sport.hockey 0.963415 0.969325 0.966361 163
sci.crypt 0.767932 0.933333 0.842593 195
sci.electronics 0.803468 0.785311 0.794286 177
sci.med 0.925287 0.936047 0.930636 172
sci.space 0.891192 0.977273 0.932249 176
soc.religion.christian 0.737069 0.93956 0.826087 182
talk.politics.guns 0.863874 0.953757 0.906593 173
talk.politics.mideast 0.795 0.99375 0.883333 160
talk.politics.misc 0.901316 0.878205 0.88961 156
talk.religion.misc 1 0.429907 0.601307 107
macro avg 0.847704 0.81647 0.802689 3395
weighted avg 0.843632 0.822975 0.805774 3395