TKDE_SA.pdf (939.26 kB)
Cross-domain sentiment classification using a sentiment sensitive thesaurus
journal contribution
posted on 2023-06-08, 14:12 authored by Danushka Bollegala, David WeirDavid Weir, John CarrollAutomatic classification of sentiment is important for numerous applications such as opinion mining, opinion summarization, contextual advertising, and market analysis. However, sentiment is expressed differently in different domains, and annotating corpora for every possible domain of interest is costly. Applying a sentiment classifier trained using labeled data for a particular domain to classify sentiment of user reviews on a different domain often results in poor performance. We propose a method to overcome this problem in cross-domain sentiment classification. First, we create a sentiment sensitive distributional thesaurus using labeled data for the source domains and unlabeled data for both source and target domains. Sentiment sensitivity is achieved in the thesaurus by incorporating document level sentiment labels in the context vectors used as the basis for measuring the distributional similarity between words. Next, we use the created thesaurus to expand feature vectors during train and test times in a binary classifier. The proposed method significantly outperforms numerous baselines and returns results that are comparable with previously proposed cross-domain sentiment classification methods. We conduct an extensive empirical analysis of the proposed method on single and multi-source domain adaptation, unsupervised and supervised domain adaptation, and numerous similarity measures for creating the sentiment sensitive thesaurus.
History
Publication status
- Published
File Version
- Accepted version
Journal
IEEE Transactions on Knowledge and Data EngineeringISSN
1041-4347Publisher
Institute of Electrical and Electronics EngineersExternal DOI
Issue
8Volume
25Page range
1719-1731Department affiliated with
- Informatics Publications
Full text available
- Yes
Peer reviewed?
- Yes
Legacy Posted Date
2013-03-18First Open Access (FOA) Date
2013-03-18First Compliant Deposit (FCD) Date
2013-03-18Usage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC