Model description

[More Information Needed]

Intended uses & limitations

[More Information Needed]

Training Procedure

[More Information Needed]

Hyperparameters

Click to expand

Hyperparameter	Value
memory
steps	[('count', CountVectorizer()), ('clf', MultinomialNB())]
verbose	False
count	CountVectorizer()
clf	MultinomialNB()
count__analyzer	word
count__binary	False
count__decode_error	strict
count__dtype	<class 'numpy.int64'>
count__encoding	utf-8
count__input	content
count__lowercase	True
count__max_df	1.0
count__max_features
count__min_df	1
count__ngram_range	(1, 1)
count__preprocessor
count__stop_words
count__strip_accents
count__token_pattern	(?u)\b\w\w+\b
count__tokenizer
count__vocabulary
clf__alpha	1.0
clf__class_prior
clf__fit_prior	True
clf__force_alpha	True

Model Plot

Pipeline(steps=[('count', CountVectorizer()), ('clf', MultinomialNB())])

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

Evaluation Results

Metric	Value
accuracy	0.822975
f1 score	0.822975

How to Get Started with the Model

[More Information Needed]

Model Card Authors

This model card is written by following authors:

[More Information Needed]

Model Card Contact

You can contact the model card authors through following channels: [More Information Needed]

Citation

Below you can find information related to citation.

BibTeX:

[More Information Needed]

citation_bibtex

bibtex @inproceedings{...,year={2020}}

get_started_code

import pickle with open(pkl_filename, 'rb') as file: clf = pickle.load(file)

model_card_authors

skops_user

limitations

This model is not ready to be used in production.

model_description

This is a Multinomial Naive Bayes model trained on 20 news groups dataset.Count vectorizer is used for vectorization.

eval_method

The model is evaluated on validation data from 20 news group's test split, using accuracy and F1-score with micro average.

Confusion matrix

Classification Report

Click to expand

index	precision	recall	f1-score	support
alt.atheism	0.927007	0.940741	0.933824	135
comp.graphics	0.6	0.831325	0.69697	166
comp.os.ms-windows.misc	0.833333	0.0588235	0.10989	170
comp.sys.ibm.pc.hardware	0.603306	0.802198	0.688679	182
comp.sys.mac.hardware	0.962687	0.704918	0.81388	183
comp.windows.x	0.665236	0.91716	0.771144	169
misc.forsale	0.905983	0.616279	0.733564	172
rec.autos	0.895028	0.848168	0.870968	191
rec.motorcycles	0.925532	0.878788	0.901554	198
rec.sport.baseball	0.987421	0.934524	0.960245	168
rec.sport.hockey	0.963415	0.969325	0.966361	163
sci.crypt	0.767932	0.933333	0.842593	195
sci.electronics	0.803468	0.785311	0.794286	177
sci.med	0.925287	0.936047	0.930636	172
sci.space	0.891192	0.977273	0.932249	176
soc.religion.christian	0.737069	0.93956	0.826087	182
talk.politics.guns	0.863874	0.953757	0.906593	173
talk.politics.mideast	0.795	0.99375	0.883333	160
talk.politics.misc	0.901316	0.878205	0.88961	156
talk.religion.misc	1	0.429907	0.601307	107
macro avg	0.847704	0.81647	0.802689	3395
weighted avg	0.843632	0.822975	0.805774	3395