acl acl2010 acl2010-123 knowledge-graph by maker-knowledge-mining

123 acl-2010-Generating Focused Topic-Specific Sentiment Lexicons


Source: pdf

Author: Valentin Jijkoun ; Maarten de Rijke ; Wouter Weerkamp

Abstract: We present a method for automatically generating focused and accurate topicspecific subjectivity lexicons from a general purpose polarity lexicon that allow users to pin-point subjective on-topic information in a set of relevant documents. We motivate the need for such lexicons in the field of media analysis, describe a bootstrapping method for generating a topic-specific lexicon from a general purpose polarity lexicon, and evaluate the quality of the generated lexicons both manually and using a TREC Blog track test set for opinionated blog post retrieval. Although the generated lexicons can be an order of magnitude more selective than the general purpose lexicon, they maintain, or even improve, the performance of an opin- ion retrieval system.

Reference: text


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 nl j Abstract We present a method for automatically generating focused and accurate topicspecific subjectivity lexicons from a general purpose polarity lexicon that allow users to pin-point subjective on-topic information in a set of relevant documents. [sent-3, score-0.931]

2 Although the generated lexicons can be an order of magnitude more selective than the general purpose lexicon, they maintain, or even improve, the performance of an opin- ion retrieval system. [sent-5, score-0.451]

3 Specifically, media analysis concerns the following system task: given a topic and list of documents (discussing the topic), find all instances of attitudes toward the topic (e. [sent-7, score-0.637]

4 How can technology developed for sentiment analysis be applied to media analysis? [sent-18, score-0.487]

5 In order to use a sentiment extraction system for a media analysis problem, a system would have to be able to determine which of the extracted sentiments are actually relevant, i. [sent-19, score-0.654]

6 , it would not only have to identify specific targets of all extracted sentiments, but also decide which of the targets are relevant for the topic at hand. [sent-21, score-1.018]

7 In sentiment retrieval, on the other hand, the topic is initially present in the task definition, but it is left to the user to identify sources and targets of sentiments, as systems typically return a list of documents ranked by relevance and opinionatedness. [sent-30, score-1.054]

8 To use a traditional sentiment retrieval system in media analysis, one would still have to manually go through ranked lists of documents returned by the system. [sent-31, score-0.715]

9 c As2s0o1c0ia Atisosnoc foiart Cionom fopru Ctaotmiopnuatla Lti on gaulis Lti cnsg,u piasgtiecs 585–594, To be able to support media analysis, we need to combine the specificity of (phrase- or word-level) sentiment analysis with the topicality provided by sentiment retrieval. [sent-34, score-0.834]

10 Moreover, we should be able to identify sources and specific targets of opinions. [sent-35, score-0.429]

11 In order to move towards the requirements of media analysis, in this paper we focus on two of the problems identified above: (1) pinpointing evidence for a system’s decisions about the presence of sentiment in text, and (2) identifying specific targets of sentiment. [sent-41, score-0.837]

12 We address these problems by introducing a special type of lexical resource: a topic-specific subjectivity lexicon that indicates specific relevant targets for which sentiments may be expressed; for a given topic, such a lexicon consists of pairs (syntactic clue, target). [sent-42, score-1.136]

13 We evaluate the quality of the lexicon both manually and in the setting of an opinionated blog post retrieval task. [sent-44, score-0.907]

14 We demonstrate that such a lexicon is highly focused, allowing one to effectively pinpoint evidence for sentiment, while being competetive with traditional subjectivity lexicons consisting of (a large number of) clue words. [sent-45, score-0.839]

15 Instead, we make an existing lexicon more focused, so that it can be used to actually pin-point subjectivity in documents relevant to a given topic. [sent-47, score-0.53]

16 We discuss related work in four parts: sentiment analysis in general, domain- and targetspecific sentiment analysis, product review mining and sentiment retrieval. [sent-49, score-1.105]

17 Kim and Hovy (2004) select candidate sentiment sentences and use word-based sentiment classifiers to classify unseen words into a negative or positive class. [sent-53, score-0.694]

18 (2005): a classifier is learnt that distinguishes between polar and neutral sentences, based on a prior polarity lexicon and an annotated corpus. [sent-58, score-0.39]

19 After this initial step, the sentiment sentences are classified as negative or positive; again, a prior polarity lexicon and syntactic features are used. [sent-60, score-0.744]

20 2 Domain- and target-specific sentiment The way authors express their attitudes varies with the domain: An unpredictable movie can be positive, but unpredictable politicians are usually something negative. [sent-66, score-0.488]

21 (2007) aim at measuring overall subjectivity or polarity towards a certain entity; they identify sentiments using domain-specific lexicons. [sent-69, score-0.464]

22 All named entites in a sentence containing a clue from a lexicon are 586 considered targets of sentiment for counting. [sent-71, score-1.15]

23 They identify “sentiment topics,” noun phrases assumed to be linked to a sentiment clue in the same expression. [sent-75, score-0.634]

24 Next, the sentiment topics are identified, and based on these sentiment topics and the current list of clues, new potential clues are extracted. [sent-78, score-0.898]

25 Fahrni and Klenner (2008) identify potential targets in a given domain, and create a targetspecific polarity adjective lexicon. [sent-80, score-0.58]

26 In contrast to much of the work in the literature, we need to specialize subjectivity lexicons not for a domain and target, but for “topics. [sent-86, score-0.391]

27 However, topics in our setting go beyond concrete products, and the diversity and generality of possible topics makes it difficult to apply such supervised or thesaurus-based methods to identify opinion targets. [sent-94, score-0.386]

28 Moreover, in our method we directly use associations between targets and opinions to extract both. [sent-95, score-0.417]

29 4 Sentiment retrieval At TREC, the Text REtrieval Conference, there has been interest in a specific type of sentiment analysis: opinion retrieval. [sent-97, score-0.753]

30 Finding blog posts that are not just about a topic, but also contain an opinion on the topic, proves to be a difficult task. [sent-100, score-0.647]

31 In stage (2) one commonly uses either a binary classifier to distinguish between opinionated and non-opinionated documents or applies reranking of the initial result list using some opinion score. [sent-103, score-0.603]

32 The authors use SentiWordNet and a corpus-derived lexicon to construct an opinion score for each post in an initial ranking of blog posts. [sent-107, score-0.813]

33 This opinion score is combined with the relevance score, and posts are reranked according to this new score. [sent-108, score-0.449]

34 This domain-specific lexicon is constructed using feedback-style learning: retrieve an initial list of documents and use the top documents as training data to learn an opinion lexicon. [sent-112, score-0.687]

35 3 Generating Topic-Specific Lexicons In this section we describe how we generate a lexicon of subjectivity clues and targets for a given topic and a list of relevant documents (e. [sent-115, score-1.125]

36 As the background corpus, we used the set of documents from the assessment pools of TREC 2006–2008 opinion retrieval tasks (described in detail in section 4). [sent-120, score-0.538]

37 1 Step 1: Extracting syntactic contexts We start with a general domain-independent prior polarity lexicon of 8,821 clue words (Wilson et al. [sent-126, score-0.686]

38 First, we identify syntactic contexts in which specific clue words can be used to express 1http : / / nlp . [sent-128, score-0.406]

39 shtml attitude: we try to find how a clue word can be syntactically linked to targets of sentiments. [sent-131, score-0.595]

40 Our entropy-driven selection of syntactic contexts of a clue word is based on the following assumption: Assumption 1: In text, targets of sentiments are more diverse than sources of sentiments or other accompanying attributes such as location, time, manner, etc. [sent-147, score-0.959]

41 To summarize, at the end of Step 1 of our method, we have extracted a list of pairs (clue word, syntactic context) such that for occurrences of the clue word, the words at the endpoint of the syntactic dependency are likely to be targets of sentiments. [sent-150, score-0.744]

42 2 Step 2: Selecting potential targets Here, we use the extracted syntantic clues to identify words that are likely to serve as specific targets for opinions about the topic in the relevant documents. [sent-153, score-1.184]

43 Table 1: Examples of subjective syntactic contexts of clue words (based on Stanford dependencies). [sent-160, score-0.419]

44 Assumption 2: The list of relevant documents contains a substantial number of documents on the topic which, moreover, contain sentiments about the topic. [sent-161, score-0.579]

45 To identify potential attitude targets in the relevant documents, we compare their frequency in the relevant documents to the frequency in the background corpus using the standard χ2 statistics. [sent-163, score-0.743]

46 This technique is based on the following assumption: Assumption 3: Sentiment targets related to the topic occur more often in subjective context in the set of relevant documents, than in the background corpus. [sent-164, score-0.684]

47 In other words, while the background corpus contains sentiments towards very diverse subjects, the relevant documents tend to express attitudes related to the topic. [sent-165, score-0.442]

48 As the result of Steps 1 and 2, as candidate targets for a given topic, we only select words that occur in subjective contexts, and that do so more often than we would normally expect. [sent-167, score-0.442]

49 Table 2 shows examples of extracted targets for three TREC topics (see below for a description of our experimental data). [sent-168, score-0.436]

50 For extrinsic evaluation we apply our lexicon generation method to a collection of documents containing opinionated utterances: blog posts. [sent-178, score-0.768]

51 The Blogs06 collection (Macdonald and Ounis, 2006) is a crawl of blog posts from 100,649 blogs over a period of 11 weeks (06/12/2005– 21/02/2006), with 3,215,171 posts in total. [sent-179, score-0.569]

52 TREC 2006–2008 came with the task of opinionated blog post retrieval (Ounis et al. [sent-184, score-0.699]

53 Every topic comes with a set of relevance judgments: Given a topic, a blog post can be either (i) nonrelevant, (ii) relevant, but not opinionated, or (iii) relevant and opinionated. [sent-192, score-0.618]

54 Note that for the opinion retrieval task a document is considered relevant if it is on topic and contains opinions or sentiments towards the topic. [sent-196, score-0.806]

55 For the quantative experiments in Section 6 we need a topical baseline: a set of blog posts potentially relevant to each topic. [sent-200, score-0.549]

56 Because our topic-specific lexicons consist of triples (clue word, syntactic context, target), they actually contain more words than topic-independent lexicons of the same size, but topic-specific entries are more selective, which makes the lexicon more focused. [sent-205, score-0.831]

57 Table 3 compares the application of topic-independent and topic-specific lexicons to on-topic blog text. [sent-206, score-0.503]

58 Table 3: Posts with highlighted targets (bold) and subjectivity clues (blue) using topic-independent (left) and topic-specific (right) lexicons. [sent-219, score-0.594]

59 ” We assigned 186 matches of lexicon entries in 30 documents into four classes: • REL: sentiment towards a relevant target; • CREOLN:T seEnXtiTm: esnentt tiomwearndt sto aw rearledvsa a target ;that iCs OirNreTleEvXaTnt: t soe tnhtiem topic odwuea tdos c aon ttaerxgte (e. [sent-221, score-0.954]

60 , opinion about a target “film”, but refering to • • a film different from the topic); IRREL: sentiment towards irrelevant target (e. [sent-223, score-0.586]

61 , “game” fmoern a topic ardbsou itrr a movie); NOSENT: no sentiment at all In total only 8% of matches were manually classified as REL, with 62% classified as NOSENT, 23% as CONTEXT, and 6% as IRREL. [sent-225, score-0.524]

62 On the other hand, among documents assessed as opionionated by TREC assessors, only 13% did not contain matches of the lexicon entries, compared to 27% of non-opinionated documents, which does indicate that our lexicon does attempt to separate non-opinionated documents from opinionated. [sent-226, score-0.697]

63 To this end we deploy our lexicons to the task of opinionated blog post retrieval (Ounis et al. [sent-228, score-0.955]

64 A commonly used approach to this task works in two stages: (1) identify topically rel- evant blog posts, and (2) classify these posts as being opinionated or not. [sent-230, score-0.648]

65 Our experiments have two goals: to compare the use of topic-independent and topic-specific lexicons for the opinionated post retrieval task, and to examine how different settings for the parameters of the lexicon generation affect the empirical quality. [sent-239, score-0.916]

66 1 Reranking using a lexicon To rerank a list of posts retrieved for a given topic, we opt to use the method that showed best performance at TREC 2008. [sent-241, score-0.423]

67 For the opinion score, terms from a (topic-independent) lexicon are matched against the post content, and weighted with the probability of term’s subjectivity. [sent-244, score-0.566]

68 2: Opinion(D) = X P(sub|w) · n(w,D), (2) wX∈O where O is the set of terms in the sentiment lexicon, P(sub|w) indicates the probability of term w being subjective, iacnadt n(w, D) ibs tbhiel ntyum ofbe terr omf times term w occurs in document D. [sent-251, score-0.373]

69 The opinion scoring can weigh lexicon terms differently, using P(sub|w); it normalizes scores to cancel out the Pef(fsecutb o|fw varying mdoacluizmesen stc soirzeess. [sent-252, score-0.505]

70 edu /mpqa / topic-independent lexicon to the topic-dependent lexicons our method generates, which are also plugged into the reranking of Lee et al. [sent-257, score-0.515]

71 1 Results and observations There are several parameters that we can vary when generating a topic-specific lexicon and when using it for reranking: D: the number of syntactic contexts per clue T: the number of extracted targets Sop(D) : the opinion scoring function. [sent-263, score-1.195]

72 Note that α does not affect the lexicon creation, but only how the lexicon is used in reranking. [sent-265, score-0.416]

73 Since we want to assess the quality of lexicons, not in the opinionated retrieval performance as such, we factor out α by selecting the best setting for each lexicon (including the topic-independent) and each evaluation measure. [sent-266, score-0.541]

74 In Table 4 we present the results of evaluation of several lexicons in the context of opinionated blog post retrieval. [sent-267, score-0.815]

75 When comparing topic-specific lexicons to the topic-independent one, most of the differences are not statistically significant, which is surprising given the fact that most topic-specific lexicons we evaluated are substantially smaller (see the two rightmost columns in the table). [sent-269, score-0.541]

76 The smallest lexicon in Table 4 is seven times more selective than the general one, in terms of the number of lexicon matches per document. [sent-270, score-0.512]

77 The only evaluation measure where the topicindependent lexicon consistently outperforms topic-specific ones, is Mean Reciprocal Rank that depends on a single relevant opinionated document high in a ranking. [sent-271, score-0.526]

78 317 42683 Table 4: Evaluation of topic-specific lexicons applied to the opinion retrieval task, compared to the topicindependent lexicon. [sent-284, score-0.667]

79 The two rightmost columns show the number of lexicon entries (average per topic) and the number of matches of lexicon entries in blog posts (average for top 1,000 posts). [sent-285, score-0.964]

80 is that the large general lexicon easily finds a few “obviously subjective” posts (those with heavily used subjective words), but is not better at detecting less obvious ones, as indicated by the recalloriented MAP and R-precision. [sent-286, score-0.456]

81 Interestingly, increasing the number of syntactic contexts considered for a clue word (parameter D) and the number of selected targets (parameter T) leads to substantially larger lexicons, but only gives marginal improvements when lexicons are used for opinion retrieval. [sent-287, score-1.182]

82 This shows that our bootstrapping method is effective at filtering out non-relevant sentiment targets and syntactic clues. [sent-288, score-0.745]

83 The evaluation results also show that the choice of opinion scoring function (Okapi or raw counts) depends on the lexicon size: for smaller, more focused lexicons unnormalized counts are more effective. [sent-289, score-0.733]

84 This also confirms our intuition that for small, focused lexicons simple presence of a sentiment clue in text is a good indication of subjectivity, while for larger lexicons an overall subjectivity scoring of texts has to be used, which can be hard to interpret for (media analysis) users. [sent-290, score-1.264]

85 2 Query expansion with lexicons In this section we evaluate the quality of targets extracted as part of the lexicons by using them for query expansion. [sent-292, score-1.074]

86 Query expansion is a commonly used technique in information retrieval, aimed at getting a better representation of the user’s information need by adding terms to the original retrieval query; for user-generated content, selective query expansion has proved very beneficial (Weerkamp et al. [sent-293, score-0.466]

87 We hypothesize that if our method manages to identify targets that correspond to issues, subtopics or features associated Run MAP P@ 10 Topical blog post retrieval Baseline 0. [sent-295, score-0.95]

88 with the topic, the extracted targets should be good candidates for query expansion. [sent-319, score-0.467]

89 For every test topic, we select the 20 top-scoring targets as expansion terms, and use Indri to return 1,000 most relevant documents for the expanded query. [sent-321, score-0.637]

90 We evaluate the resulting ranking using both topical retrieval and opinionated retrieval measures. [sent-322, score-0.547]

91 The results show that on topical retrieval query expansion using targets significantly improves retrieval performance, while using relevance models actually hurts all evaluation measures. [sent-325, score-0.934]

92 The failure of the latter expansion method can be attributed to the relatively large amount of noise in user-generated content, such as boilerplate 592 material, timestamps of blog posts, comments etc. [sent-326, score-0.374]

93 The consistent improvement for topical retrieval suggests that a topic-specific lexicon can be used both for query expansion (as described in this section) and for opinion reranking (as described in Section 6. [sent-330, score-0.888]

94 We have evaluated the quality of generated lexicons both manually and using a TREC Blog track test set for opinionated blog post retrieval. [sent-334, score-0.841]

95 Although the generated lexicons can be an order of magnitude more selective, they maintain, or even improve, the performance of an opinion retrieval system. [sent-335, score-0.635]

96 As to future work, we intend to combine our method with known methods for topic-specific lexicon expansion (our method is rather concerned with lexicon “restriction”). [sent-336, score-0.511]

97 Existing sentenceor phrase-level (trained) sentiment classifiers can also be used easily: when collecting/counting targets we can weigh them by “prior” score provided by such classifiers. [sent-337, score-0.73]

98 We would also like to extend potential opinion targets to include multi-word phrases (NPs and VPs), in addition to individual words. [sent-340, score-0.594]

99 Finally, we do not identify polarity yet: this can be partially inherited from the initial lexicon and refined automatically via bootstrapping. [sent-341, score-0.401]

100 A generative blog post retrieval model that uses query expansion based on external collections. [sent-458, score-0.682]


similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('targets', 0.355), ('sentiment', 0.347), ('lexicons', 0.256), ('blog', 0.247), ('clue', 0.24), ('opinion', 0.239), ('lexicon', 0.208), ('opinionated', 0.193), ('posts', 0.161), ('trec', 0.152), ('polarity', 0.146), ('retrieval', 0.14), ('topic', 0.136), ('sentiments', 0.136), ('subjectivity', 0.135), ('documents', 0.12), ('post', 0.119), ('media', 0.108), ('clues', 0.104), ('ounis', 0.096), ('expansion', 0.095), ('subjective', 0.087), ('query', 0.081), ('sop', 0.08), ('attitudes', 0.08), ('topical', 0.074), ('weerkamp', 0.07), ('relevant', 0.067), ('opinions', 0.062), ('movie', 0.061), ('selective', 0.055), ('rijke', 0.052), ('okapi', 0.052), ('reranking', 0.051), ('topics', 0.05), ('relevance', 0.049), ('contexts', 0.049), ('attitude', 0.048), ('indri', 0.048), ('identify', 0.047), ('wilson', 0.045), ('syntactic', 0.043), ('subtopics', 0.042), ('matches', 0.041), ('kim', 0.04), ('background', 0.039), ('polar', 0.036), ('macdonald', 0.036), ('croft', 0.036), ('entries', 0.035), ('atoms', 0.034), ('wiebe', 0.033), ('triples', 0.033), ('choi', 0.033), ('abramoff', 0.032), ('altheide', 0.032), ('boilerplate', 0.032), ('endpoint', 0.032), ('freezing', 0.032), ('godbole', 0.032), ('nosent', 0.032), ('penguins', 0.032), ('saturday', 0.032), ('snatching', 0.032), ('stepmum', 0.032), ('targetspecific', 0.032), ('topicindependent', 0.032), ('topicspecific', 0.032), ('tragic', 0.032), ('analysis', 0.032), ('extracted', 0.031), ('mrr', 0.031), ('march', 0.031), ('subjects', 0.031), ('lee', 0.031), ('scoring', 0.03), ('rightmost', 0.029), ('notebook', 0.028), ('predators', 0.028), ('rerank', 0.028), ('macbook', 0.028), ('dad', 0.028), ('awesome', 0.028), ('bowl', 0.028), ('christmas', 0.028), ('eggs', 0.028), ('errands', 0.028), ('fahrni', 0.028), ('kanayama', 0.028), ('weigh', 0.028), ('specific', 0.027), ('sub', 0.027), ('track', 0.026), ('programme', 0.026), ('retrieved', 0.026), ('evening', 0.026), ('nasukawa', 0.026), ('document', 0.026), ('sigir', 0.025), ('toward', 0.025)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 1.0000002 123 acl-2010-Generating Focused Topic-Specific Sentiment Lexicons

Author: Valentin Jijkoun ; Maarten de Rijke ; Wouter Weerkamp

Abstract: We present a method for automatically generating focused and accurate topicspecific subjectivity lexicons from a general purpose polarity lexicon that allow users to pin-point subjective on-topic information in a set of relevant documents. We motivate the need for such lexicons in the field of media analysis, describe a bootstrapping method for generating a topic-specific lexicon from a general purpose polarity lexicon, and evaluate the quality of the generated lexicons both manually and using a TREC Blog track test set for opinionated blog post retrieval. Although the generated lexicons can be an order of magnitude more selective than the general purpose lexicon, they maintain, or even improve, the performance of an opin- ion retrieval system.

2 0.453908 22 acl-2010-A Unified Graph Model for Sentence-Based Opinion Retrieval

Author: Binyang Li ; Lanjun Zhou ; Shi Feng ; Kam-Fai Wong

Abstract: There is a growing research interest in opinion retrieval as on-line users’ opinions are becoming more and more popular in business, social networks, etc. Practically speaking, the goal of opinion retrieval is to retrieve documents, which entail opinions or comments, relevant to a target subject specified by the user’s query. A fundamental challenge in opinion retrieval is information representation. Existing research focuses on document-based approaches and documents are represented by bag-of-word. However, due to loss of contextual information, this representation fails to capture the associative information between an opinion and its corresponding target. It cannot distinguish different degrees of a sentiment word when associated with different targets. This in turn seriously affects opinion retrieval performance. In this paper, we propose a sentence-based approach based on a new information representa- , tion, namely topic-sentiment word pair, to capture intra-sentence contextual information between an opinion and its target. Additionally, we consider inter-sentence information to capture the relationships among the opinions on the same topic. Finally, the two types of information are combined in a unified graph-based model, which can effectively rank the documents. Compared with existing approaches, experimental results on the COAE08 dataset showed that our graph-based model achieved significant improvement. 1

3 0.3317543 251 acl-2010-Using Anaphora Resolution to Improve Opinion Target Identification in Movie Reviews

Author: Niklas Jakob ; Iryna Gurevych

Abstract: unkown-abstract

4 0.31944892 208 acl-2010-Sentence and Expression Level Annotation of Opinions in User-Generated Discourse

Author: Cigdem Toprak ; Niklas Jakob ; Iryna Gurevych

Abstract: In this paper, we introduce a corpus of consumer reviews from the rateitall and the eopinions websites annotated with opinion-related information. We present a two-level annotation scheme. In the first stage, the reviews are analyzed at the sentence level for (i) relevancy to a given topic, and (ii) expressing an evaluation about the topic. In the second stage, on-topic sentences containing evaluations about the topic are further investigated at the expression level for pinpointing the properties (semantic orientation, intensity), and the functional components of the evaluations (opinion terms, targets and holders). We discuss the annotation scheme, the inter-annotator agreement for different subtasks and our observations.

5 0.24795395 209 acl-2010-Sentiment Learning on Product Reviews via Sentiment Ontology Tree

Author: Wei Wei ; Jon Atle Gulla

Abstract: Existing works on sentiment analysis on product reviews suffer from the following limitations: (1) The knowledge of hierarchical relationships of products attributes is not fully utilized. (2) Reviews or sentences mentioning several attributes associated with complicated sentiments are not dealt with very well. In this paper, we propose a novel HL-SOT approach to labeling a product’s attributes and their associated sentiments in product reviews by a Hierarchical Learning (HL) process with a defined Sentiment Ontology Tree (SOT). The empirical analysis against a humanlabeled data set demonstrates promising and reasonable performance of the proposed HL-SOT approach. While this paper is mainly on sentiment analysis on reviews of one product, our proposed HLSOT approach is easily generalized to labeling a mix of reviews of more than one products.

6 0.24319355 210 acl-2010-Sentiment Translation through Lexicon Induction

7 0.23421392 105 acl-2010-Evaluating Multilanguage-Comparability of Subjectivity Analysis Systems

8 0.22277404 18 acl-2010-A Study of Information Retrieval Weighting Schemes for Sentiment Analysis

9 0.2159576 134 acl-2010-Hierarchical Sequential Learning for Extracting Opinions and Their Attributes

10 0.18789279 188 acl-2010-Optimizing Informativeness and Readability for Sentiment Summarization

11 0.14858997 141 acl-2010-Identifying Text Polarity Using Random Walks

12 0.13341105 50 acl-2010-Bilingual Lexicon Generation Using Non-Aligned Signatures

13 0.13255887 42 acl-2010-Automatically Generating Annotator Rationales to Improve Sentiment Classification

14 0.11859902 204 acl-2010-Recommendation in Internet Forums and Blogs

15 0.11785912 85 acl-2010-Detecting Experiences from Weblogs

16 0.11247898 245 acl-2010-Understanding the Semantic Structure of Noun Phrase Queries

17 0.10358789 157 acl-2010-Last but Definitely Not Least: On the Role of the Last Sentence in Automatic Polarity-Classification

18 0.09732569 78 acl-2010-Cross-Language Text Classification Using Structural Correspondence Learning

19 0.094693467 176 acl-2010-Mood Patterns and Affective Lexicon Access in Weblogs

20 0.092293836 79 acl-2010-Cross-Lingual Latent Topic Extraction


similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, -0.215), (1, 0.197), (2, -0.35), (3, 0.38), (4, -0.24), (5, 0.062), (6, -0.032), (7, 0.188), (8, 0.051), (9, -0.04), (10, 0.045), (11, -0.002), (12, 0.01), (13, 0.036), (14, -0.0), (15, 0.0), (16, -0.021), (17, -0.03), (18, -0.024), (19, -0.04), (20, -0.033), (21, -0.032), (22, 0.003), (23, 0.01), (24, 0.044), (25, -0.021), (26, -0.025), (27, -0.018), (28, -0.02), (29, -0.009), (30, 0.078), (31, -0.023), (32, 0.026), (33, -0.026), (34, -0.006), (35, -0.104), (36, -0.018), (37, -0.009), (38, -0.031), (39, 0.014), (40, 0.003), (41, -0.012), (42, 0.008), (43, 0.021), (44, 0.031), (45, 0.075), (46, -0.051), (47, 0.013), (48, 0.025), (49, 0.023)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.9687655 123 acl-2010-Generating Focused Topic-Specific Sentiment Lexicons

Author: Valentin Jijkoun ; Maarten de Rijke ; Wouter Weerkamp

Abstract: We present a method for automatically generating focused and accurate topicspecific subjectivity lexicons from a general purpose polarity lexicon that allow users to pin-point subjective on-topic information in a set of relevant documents. We motivate the need for such lexicons in the field of media analysis, describe a bootstrapping method for generating a topic-specific lexicon from a general purpose polarity lexicon, and evaluate the quality of the generated lexicons both manually and using a TREC Blog track test set for opinionated blog post retrieval. Although the generated lexicons can be an order of magnitude more selective than the general purpose lexicon, they maintain, or even improve, the performance of an opin- ion retrieval system.

2 0.885436 22 acl-2010-A Unified Graph Model for Sentence-Based Opinion Retrieval

Author: Binyang Li ; Lanjun Zhou ; Shi Feng ; Kam-Fai Wong

Abstract: There is a growing research interest in opinion retrieval as on-line users’ opinions are becoming more and more popular in business, social networks, etc. Practically speaking, the goal of opinion retrieval is to retrieve documents, which entail opinions or comments, relevant to a target subject specified by the user’s query. A fundamental challenge in opinion retrieval is information representation. Existing research focuses on document-based approaches and documents are represented by bag-of-word. However, due to loss of contextual information, this representation fails to capture the associative information between an opinion and its corresponding target. It cannot distinguish different degrees of a sentiment word when associated with different targets. This in turn seriously affects opinion retrieval performance. In this paper, we propose a sentence-based approach based on a new information representa- , tion, namely topic-sentiment word pair, to capture intra-sentence contextual information between an opinion and its target. Additionally, we consider inter-sentence information to capture the relationships among the opinions on the same topic. Finally, the two types of information are combined in a unified graph-based model, which can effectively rank the documents. Compared with existing approaches, experimental results on the COAE08 dataset showed that our graph-based model achieved significant improvement. 1

3 0.7586543 251 acl-2010-Using Anaphora Resolution to Improve Opinion Target Identification in Movie Reviews

Author: Niklas Jakob ; Iryna Gurevych

Abstract: unkown-abstract

4 0.71410173 42 acl-2010-Automatically Generating Annotator Rationales to Improve Sentiment Classification

Author: Ainur Yessenalina ; Yejin Choi ; Claire Cardie

Abstract: One ofthe central challenges in sentimentbased text categorization is that not every portion of a document is equally informative for inferring the overall sentiment of the document. Previous research has shown that enriching the sentiment labels with human annotators’ “rationales” can produce substantial improvements in categorization performance (Zaidan et al., 2007). We explore methods to automatically generate annotator rationales for document-level sentiment classification. Rather unexpectedly, we find the automatically generated rationales just as helpful as human rationales.

5 0.70423764 18 acl-2010-A Study of Information Retrieval Weighting Schemes for Sentiment Analysis

Author: Georgios Paltoglou ; Mike Thelwall

Abstract: Most sentiment analysis approaches use as baseline a support vector machines (SVM) classifier with binary unigram weights. In this paper, we explore whether more sophisticated feature weighting schemes from Information Retrieval can enhance classification accuracy. We show that variants of the classic tf.idf scheme adapted to sentiment analysis provide significant increases in accuracy, especially when using a sublinear function for term frequency weights and document frequency smoothing. The techniques are tested on a wide selection of data sets and produce the best accuracy to our knowledge.

6 0.69627166 208 acl-2010-Sentence and Expression Level Annotation of Opinions in User-Generated Discourse

7 0.68599731 209 acl-2010-Sentiment Learning on Product Reviews via Sentiment Ontology Tree

8 0.67733783 134 acl-2010-Hierarchical Sequential Learning for Extracting Opinions and Their Attributes

9 0.60674572 210 acl-2010-Sentiment Translation through Lexicon Induction

10 0.55891079 105 acl-2010-Evaluating Multilanguage-Comparability of Subjectivity Analysis Systems

11 0.5223195 176 acl-2010-Mood Patterns and Affective Lexicon Access in Weblogs

12 0.50212902 141 acl-2010-Identifying Text Polarity Using Random Walks

13 0.44246763 188 acl-2010-Optimizing Informativeness and Readability for Sentiment Summarization

14 0.40867361 157 acl-2010-Last but Definitely Not Least: On the Role of the Last Sentence in Automatic Polarity-Classification

15 0.35596606 204 acl-2010-Recommendation in Internet Forums and Blogs

16 0.35386804 256 acl-2010-Vocabulary Choice as an Indicator of Perspective

17 0.33543625 177 acl-2010-Multilingual Pseudo-Relevance Feedback: Performance Study of Assisting Languages

18 0.33291289 80 acl-2010-Cross Lingual Adaptation: An Experiment on Sentiment Classifications

19 0.32650542 85 acl-2010-Detecting Experiences from Weblogs

20 0.31776607 78 acl-2010-Cross-Language Text Classification Using Structural Correspondence Learning


similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(7, 0.015), (25, 0.042), (39, 0.012), (42, 0.409), (59, 0.062), (72, 0.017), (73, 0.058), (76, 0.014), (78, 0.023), (83, 0.074), (84, 0.024), (98, 0.133)]

similar papers list:

simIndex simValue paperId paperTitle

1 0.96163505 178 acl-2010-Non-Cooperation in Dialogue

Author: Brian Pluss

Abstract: This paper presents ongoing research on computational models for non-cooperative dialogue. We start by analysing different levels of cooperation in conversation. Then, inspired by findings from an empirical study, we propose a technique for measuring non-cooperation in political interviews. Finally, we describe a research programme towards obtaining a suitable model and discuss previous accounts for conflictive dialogue, identifying the differences with our work.

2 0.90345436 149 acl-2010-Incorporating Extra-Linguistic Information into Reference Resolution in Collaborative Task Dialogue

Author: Ryu Iida ; Syumpei Kobayashi ; Takenobu Tokunaga

Abstract: This paper proposes an approach to reference resolution in situated dialogues by exploiting extra-linguistic information. Recently, investigations of referential behaviours involved in situations in the real world have received increasing attention by researchers (Di Eugenio et al., 2000; Byron, 2005; van Deemter, 2007; Spanger et al., 2009). In order to create an accurate reference resolution model, we need to handle extra-linguistic information as well as textual information examined by existing approaches (Soon et al., 2001 ; Ng and Cardie, 2002, etc.). In this paper, we incorporate extra-linguistic information into an existing corpus-based reference resolution model, and investigate its effects on refer- ence resolution problems within a corpus of Japanese dialogues. The results demonstrate that our proposed model achieves an accuracy of 79.0% for this task.

same-paper 3 0.86368108 123 acl-2010-Generating Focused Topic-Specific Sentiment Lexicons

Author: Valentin Jijkoun ; Maarten de Rijke ; Wouter Weerkamp

Abstract: We present a method for automatically generating focused and accurate topicspecific subjectivity lexicons from a general purpose polarity lexicon that allow users to pin-point subjective on-topic information in a set of relevant documents. We motivate the need for such lexicons in the field of media analysis, describe a bootstrapping method for generating a topic-specific lexicon from a general purpose polarity lexicon, and evaluate the quality of the generated lexicons both manually and using a TREC Blog track test set for opinionated blog post retrieval. Although the generated lexicons can be an order of magnitude more selective than the general purpose lexicon, they maintain, or even improve, the performance of an opin- ion retrieval system.

4 0.76451123 150 acl-2010-Inducing Domain-Specific Semantic Class Taggers from (Almost) Nothing

Author: Ruihong Huang ; Ellen Riloff

Abstract: This research explores the idea of inducing domain-specific semantic class taggers using only a domain-specific text collection and seed words. The learning process begins by inducing a classifier that only has access to contextual features, forcing it to generalize beyond the seeds. The contextual classifier then labels new instances, to expand and diversify the training set. Next, a cross-category bootstrapping process simultaneously trains a suite of classifiers for multiple semantic classes. The positive instances for one class are used as negative instances for the others in an iterative bootstrapping cycle. We also explore a one-semantic-class-per-discourse heuristic, and use the classifiers to dynam- ically create semantic features. We evaluate our approach by inducing six semantic taggers from a collection of veterinary medicine message board posts.

5 0.69026864 214 acl-2010-Sparsity in Dependency Grammar Induction

Author: Jennifer Gillenwater ; Kuzman Ganchev ; Joao Graca ; Fernando Pereira ; Ben Taskar

Abstract: A strong inductive bias is essential in unsupervised grammar induction. We explore a particular sparsity bias in dependency grammars that encourages a small number of unique dependency types. Specifically, we investigate sparsity-inducing penalties on the posterior distributions of parent-child POS tag pairs in the posterior regularization (PR) framework of Graça et al. (2007). In ex- periments with 12 languages, we achieve substantial gains over the standard expectation maximization (EM) baseline, with average improvement in attachment accuracy of 6.3%. Further, our method outperforms models based on a standard Bayesian sparsity-inducing prior by an average of 4.9%. On English in particular, we show that our approach improves on several other state-of-the-art techniques.

6 0.68503183 22 acl-2010-A Unified Graph Model for Sentence-Based Opinion Retrieval

7 0.66690463 231 acl-2010-The Prevalence of Descriptive Referring Expressions in News and Narrative

8 0.66171515 251 acl-2010-Using Anaphora Resolution to Improve Opinion Target Identification in Movie Reviews

9 0.64705336 208 acl-2010-Sentence and Expression Level Annotation of Opinions in User-Generated Discourse

10 0.61180264 134 acl-2010-Hierarchical Sequential Learning for Extracting Opinions and Their Attributes

11 0.59853601 167 acl-2010-Learning to Adapt to Unknown Users: Referring Expression Generation in Spoken Dialogue Systems

12 0.53067756 42 acl-2010-Automatically Generating Annotator Rationales to Improve Sentiment Classification

13 0.5145129 142 acl-2010-Importance-Driven Turn-Bidding for Spoken Dialogue Systems

14 0.51040798 50 acl-2010-Bilingual Lexicon Generation Using Non-Aligned Signatures

15 0.50742829 105 acl-2010-Evaluating Multilanguage-Comparability of Subjectivity Analysis Systems

16 0.50494254 58 acl-2010-Classification of Feedback Expressions in Multimodal Data

17 0.50257766 199 acl-2010-Preferences versus Adaptation during Referring Expression Generation

18 0.50086021 172 acl-2010-Minimized Models and Grammar-Informed Initialization for Supertagging with Highly Ambiguous Lexicons

19 0.49826491 188 acl-2010-Optimizing Informativeness and Readability for Sentiment Summarization

20 0.49556673 82 acl-2010-Demonstration of a Prototype for a Conversational Companion for Reminiscing about Images