中文 | English
Chinese and English Stopwords List (3,076 words, including some special characters)
This stopwords list aggregates resources from multiple authoritative sources, including the Harbin Institute of Technology stopwords list, Baidu stopwords list, Sichuan University Machine Intelligence Laboratory stopwords repository, as well as various resources from the CSDN and GitHub communities. It has been carefully curated and integrated to include common but semantically weak words and symbols in both Chinese and English, aimed at removing noise from text data to improve data processing quality and efficiency. This list is suitable for various text mining, sentiment analysis, keyword extraction and other scenarios.