Skip to content

Chinese–English Stopword List (3,076 entries, including special symbols)

Notifications You must be signed in to change notification settings

endNone/stopwords

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

中文  |  English


Stopwords

Chinese and English Stopwords List (3,076 words, including some special characters)

This stopwords list aggregates resources from multiple authoritative sources, including the Harbin Institute of Technology stopwords list, Baidu stopwords list, Sichuan University Machine Intelligence Laboratory stopwords repository, as well as various resources from the CSDN and GitHub communities. It has been carefully curated and integrated to include common but semantically weak words and symbols in both Chinese and English, aimed at removing noise from text data to improve data processing quality and efficiency. This list is suitable for various text mining, sentiment analysis, keyword extraction and other scenarios.

Star History

Star History Chart

About

Chinese–English Stopword List (3,076 entries, including special symbols)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors