This is a library for sentiment analysis in dictionary framework. Two dictionaries are provided in the library, namely, Harvard IV-4 and Loughran and McDonald Financial Sentiment Dictionaries, which are sentiment dictionaries for general and financial sentiment analysis.
See also http://www.wjh.harvard.edu/~inquirer/ and https://www3.nd.edu/~mcdonald/Word_Lists.html .
Positive
and Negative
are word counts for the words in positive and negative sets.
Polarity
and Subjectivity
are calculated in the same way of Lydia system.
See also http://www.cs.sunysb.edu/~skiena/lydia/
Install pysentiment2
:
pip install pysentiment2
A simple example:
import pysentiment2
# Do something with pysentiment2
To use the Harvard IV-4 dictionary, create an instance of the HIV4
class
import pysentiment2 as ps
hiv4 = ps.HIV4()
tokens = hiv4.tokenize(text) # text can be tokenized by other ways
# however, dict in HIV4 is preprocessed
# by the default tokenizer in the library
score = hiv4.get_score(tokens)
HIV4
is a subclass for pysentiment2.base.BaseDict
. BaseDict
can be inherited by
implmenting init_dict
to initialize _posset
and _negset
for the dictionary
to calculate 'positive' or 'negative' scores for terms.
Similarly, to use the Loughran and McDonald dictionary:
import pysentiment2 as ps
lm = ps.LM()
tokens = lm.tokenize(text)
score = lm.get_score(tokens)
See the documentation here.
pysentiment2
created by Nick DeRobertis but based on pysentiment
by Zhichao Han. GNU GPL License.