Negative words in Amazon dataset. This dataset contains an overview of negative words used in bad and good ratings. A list of negative words was downloaded from - https://www.cs.uic.edu/~liub/FBS/sentiment-analysis.html#lexicon - http://www.cs.uic.edu/~liub/FBS/opinion-lexicon-English.rar A list of movie reviews was downloaded from - https://snap.stanford.edu/data/web-Movies.html - https://snap.stanford.edu/data/movies.txt.gz For 1-star and 5-star Amazon reviews, the fields "text" and "summary" were used. In these texts it was checked, which negative words occur. For each negative word, the number of reviews (containing the word) was counted. combined.tar.gz - List format: (word, count in 1-star docs, count in 5-star docs) - negative-words.pickle - negative-words.csv - Used code: https://github.com/EML4U/Drift-detector-comparison/blob/7713df4107ea9917c330aa6b952172649c80b89c/word2vec/opinion_words_amazon_combine.py extracted.tar.gz - Counted documents where single negative words occur at least once in 1-star and 5-star documents - negative-words-score-one.pickle - negative-words-score-five.pickle - Used code: https://github.com/EML4U/Drift-detector-comparison/blob/7713df4107ea9917c330aa6b952172649c80b89c/word2vec/opinion_words_amazon_extract.py Also available at Google Drive: https://docs.google.com/spreadsheets/d/1MB0uB1uJz2RqFNTPLA5UoA0Brq7cjGeEUTSs_pJ4jdI/edit?usp=sharing