Chinese-Test-Classification

Use jieba to do Chinese Word Segmentation, then transform and persist as the type of Bunch which is defined in Scikit-Learn .Use TF-IDF method to vectorize it.Finally ,use Naive Bayes to train and test it.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
test_corpus_seg		test_corpus_seg
test_word_bag		test_word_bag
train_corpus_seg		train_corpus_seg
train_corpus_small		train_corpus_small
train_word_bag		train_word_bag
Predict.py		Predict.py
README.md		README.md
corpus_segment.py		corpus_segment.py
segment2Bunch.py		segment2Bunch.py
test2Bunch.py		test2Bunch.py
test_space.py		test_space.py
vector_space.py		vector_space.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chinese-Test-Classification

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Chinese-Test-Classification

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages