emnlp emnlp2010 emnlp2010-49 knowledge-graph by maker-knowledge-mining

49 emnlp-2010-Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields


Source: pdf

Author: Niklas Jakob ; Iryna Gurevych

Abstract: In this paper, we focus on the opinion target extraction as part of the opinion mining task. We model the problem as an information extraction task, which we address based on Conditional Random Fields (CRF). As a baseline we employ the supervised algorithm by Zhuang et al. (2006), which represents the state-of-the-art on the employed data. We evaluate the algorithms comprehensively on datasets from four different domains annotated with individual opinion target instances on a sentence level. Furthermore, we investigate the performance of our CRF-based approach and the baseline in a single- and cross-domain opinion target extraction setting. Our CRF-based approach improves the performance by 0.077, 0.126, 0.071 and 0. 178 regarding F-Measure in the single-domain extraction in the four domains. In the crossdomain setting our approach improves the performance by 0.409, 0.242, 0.294 and 0.343 regarding F-Measure over the baseline.

Reference: text


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Abstract In this paper, we focus on the opinion target extraction as part of the opinion mining task. [sent-3, score-1.543]

2 We evaluate the algorithms comprehensively on datasets from four different domains annotated with individual opinion target instances on a sentence level. [sent-7, score-0.895]

3 Furthermore, we investigate the performance of our CRF-based approach and the baseline in a single- and cross-domain opinion target extraction setting. [sent-8, score-0.889]

4 As opinion mining is typically an enabling technology for another task, this overlaying system defines requirements regarding the level of granularity. [sent-20, score-0.881]

5 All of these tasks have in common that in order to fulfill them, the opinion mining system must be capable of identifying what the opinions in the individual sentences are about, hence extract the opinion targets. [sent-33, score-1.454]

6 Our goal in this work is to extract opinion targets from user-generated discourse, a discourse type which is quite frequently encountered today, due to the explosive growth of Web 2. [sent-34, score-0.983]

7 The opinion targets which we aim to extract are underlined in the sentences, the corresponding opinion expressions are shown in italics. [sent-37, score-1.71]

8 The extraction of opinion targets can be considered as an instance of an information extraction (IE) task (Cowie and Lehnert, 1996). [sent-40, score-1.171]

9 In the opinion mining context this question has been prominently investigated with respect to opinion polarity analysis (sentiment analysis) in previous research (Aue and Gamon, 2005; Blitzer et al. [sent-46, score-1.378]

10 Terms as “unpredictable” can express a positive opinion when uttered about the storyline of a movie but a negative opinion when the handling of a car is described. [sent-48, score-1.392]

11 However to the best of our knowledge, these effects have not been investigated regarding the extraction of opinion targets. [sent-50, score-0.971]

12 The contribution of this paper is a CRF-based approach for opinion targets extraction which tackles the problem of domain portability. [sent-51, score-1.137]

13 2 Related Work In the following we will discuss the related work regarding opinion target extraction and domain adaptation in opinion mining. [sent-60, score-1.779]

14 The discussion of the related work on opinion target extraction is separated in supervised and unsupervised approaches. [sent-61, score-0.843]

15 We conclude with a discussion of the related work on domain adaptation in opinion mining. [sent-62, score-0.755]

16 1 Unsupervised Opinion Target Extraction The first work on opinion target extraction was done on customer reviews of consumer electronics. [sent-64, score-0.927]

17 A dataset of customer reviews from five domains was annotated by the authors regarding mentioned product features with respective opinion polarities. [sent-67, score-1.137]

18 They present and evaluate a complete system for opinion extraction which is based on a statistical analysis based on the Likelihood Ratio Test for opinion target extraction. [sent-73, score-1.5]

19 00 in the task of opinion target (product feature) extraction, recall values are not reported. [sent-76, score-0.769]

20 Popescu and Etzioni (2005) present the OPINE system for opinion mining on product reviews. [sent-77, score-0.723]

21 They evaluate the opinion target extraction separately on the dataset by Hu and Liu (2004). [sent-79, score-0.955]

22 (2007) manually create taxonomies of opinion targets for two datasets. [sent-82, score-0.983]

23 With a handcrafted set of dependency tree paths their algorithm identifies related opinion expressions and targets. [sent-83, score-0.872]

24 Due to the lack of a dataset annotated with opinion expressions and targets, they just evaluate the accuracy of several aspects of their algorithm by man- ually assessing an output sample. [sent-84, score-0.91]

25 Kim and Hovy (2006) aim at extracting opinion holders and opinion targets in newswire with semantic role labeling. [sent-87, score-1.682]

26 They define a mapping of the semantic roles identified with FrameNet to the respective opinion elements. [sent-88, score-0.657]

27 As a baseline, they implement an approach based on a dependency parser, which identifies the targets following the dependencies of opinion expressions. [sent-89, score-1.041]

28 (2006) present a supervised algorithm for the extraction of opinion expression - opinion target pairs. [sent-97, score-1.594]

29 Their algorithm learns the opinion target candidates and a combination of dependency and part-of-speech paths connecting such pairs from an annotated dataset. [sent-98, score-0.958]

30 488 in the task of extracting opinion target - opinion expression pairs. [sent-103, score-1.5]

31 Kessler and Nicolov (2009) solely focus on identifying which opinion expression is linked to which opinion target in a sentence. [sent-104, score-1.501]

32 They present a dataset of car and camera reviews in which opinion expressions and opinion targets are annotated. [sent-105, score-1.948]

33 Starting with this information, they train a machine learning classifier for identifying related opinion expressions and targets. [sent-106, score-0.749]

34 Their algorithm receives the opinion expression and opinion target annotations as input during runtime. [sent-107, score-1.5]

35 As the development data, we used 29 documents from the movies dataset, 23 documents from the web-services dataset and 15 documents from the cars & cameras datasets. [sent-126, score-0.937]

36 If the vocabulary of targets is rather compact for a certain domain (corresponding to a low target type / target ratio), the training data is likely to contain the majority of the target types, which should hence be a good indicator. [sent-129, score-0.697]

37 At the same time, the CRF algorithm is provided with additional information to extract opinion targets which are multiword expressions, i. [sent-138, score-1.027]

38 Short Dependency Path Previous research has successfully employed paths in the dependency parse tree to link opinion expressions and the corresponding targets (Zhuang et al. [sent-142, score-1.195]

39 Both works identify direct dependency relations such as “amod” and “nsubj” as the most frequent and at the same time highly accurate connections between a target and an opinion expression. [sent-144, score-0.807]

40 We hence label all tokens which have a direct dependency relation to an opinion expression in a sentence. [sent-145, score-0.823]

41 (2006) we can infer that opinion expressions and their target(s) are not always connected via short paths in the dependency parse tree. [sent-149, score-0.851]

42 Since we cannot capture such paths with the abovementioned feature we introduce another feature which acts as heuristic for identifying the target to a given opinion expression. [sent-150, score-0.891]

43 (2003) have shown that (base) noun phrases are good candidates for opinion targets in the datasets of product reviews. [sent-152, score-1.106]

44 We therefore label the token(s) in the closest noun phrase regarding word distance to each opinion expression in a sentence. [sent-153, score-0.911]

45 Opinion Sentence With this feature, we simply label all tokens occurring in a sentence containing an opinion expression. [sent-155, score-0.657]

46 This feature shall enable the CRF algorithm to distinguish between the occurence of a certain token in a sentence which contains an opinion vs. [sent-156, score-0.76]

47 shtml 1038 Our goal is to extract individual instances of opinion targets from sentences which contain an opinion expression. [sent-162, score-1.64]

48 We represent the possible labels following the IOB scheme: B-Target, identifying the beginning of an opinion target, I-Target identifying the continuation of a target, and O for other (non-target) tokens. [sent-171, score-0.701]

49 (2006) and annotated regarding opinion target - opinion expression pairs. [sent-181, score-1.69]

50 The version of the dataset used in our experiments consists of 179 blog postings regarding different digital cameras and 336 reviews of different cars. [sent-187, score-0.698]

51 In the description of their annotation guidelines, Kessler and Nicolov (2009) refer to opinion targets as mentions. [sent-188, score-0.983]

52 However, not only mentions which occur as opinion targets were originally annotated, but also mentions which occur in non-opinion sentences. [sent-190, score-1.037]

53 In our experiments, we only use the mentions which occur as targets of opinion expressions. [sent-191, score-1.01]

54 edu/ All three datasets contain annotations regarding the antecedents of anaphoric opinion targets. [sent-195, score-0.904]

55 In our experimental setup, we do not require the algorithms to also correctly resolve the antecedent of an opinion target representy by a pronoun, as we are solely interested in evaluating the opinion target extraction not any anaphora resolution. [sent-196, score-1.592]

56 As shown in rows 4 and 5 of Table 1, the documents from the cars and the cameras datasets exhibit a much higher density of opinions per document. [sent-197, score-0.711]

57 5% of the sentences from the cars dataset contain an opinion and in the cameras dataset even 56. [sent-199, score-1.444]

58 Furthermore in the cars and the cameras datasets the lexical variability regarding the opinion targets is substantially larger than in the other two datasets: We calculate target types by counting the number of distinct opinion targets in a dataset. [sent-203, score-2.892]

59 We divide this by the sum of all opinion target instances in the dataset. [sent-204, score-0.749]

60 In terms of reviews this means, that in the movie reviews the same movie aspects are repeatedly commented on, while in the cars and the cameras datasets many different aspects of these entities are discussed, which in turn each occur infrequently. [sent-210, score-0.922]

61 2 Baseline System In the task of opinion target extraction the supervised algorithm by Zhuang et al. [sent-212, score-0.864]

62 A set of paths in a dependency tree which iden- tify valid opinion target - opinion expression pairs In our experiments, we learn the full set of opinion targets from the labeled training data in the first step. [sent-217, score-2.586]

63 , 2006), but we expect that this modification should be beneficial for the overall performance in terms of recall, as we do not remove any learned opinion targets from the candidate list. [sent-219, score-1.001]

64 For each opinion target opinion expression pair from the gold standard, the shortest path connecting them is extracted from the dependency graph. [sent-221, score-1.592]

65 Thereby, the baseline system is also capable of extracting multiword opinion targets. [sent-225, score-0.727]

66 3 Metrics We employ the following requirements in our evaluation of the opinion target extraction: An opinion target must be extracted with exactly the span boundaries as annotated in the gold standard. [sent-227, score-1.572]

67 5 Results and Discussion We investigate the performance of the baseline and the CRF-based approach for opinion target extraction in a single- and cross-domain setting. [sent-237, score-0.889]

68 2 we present the results of the two systems in the cross-domain opinion target extraction. [sent-244, score-0.749]

69 In our task, the algorithm uses the opinion expression annotation from the gold standard. [sent-259, score-0.774]

70 We do not remove any learned opinion target candidates from the training data (See Section 4. [sent-261, score-0.783]

71 We assume that the recall of the algorithm is limited by a large variety of possible dependency paths between opinion targets and opinion expressions, since the algorithm cannot link targets and opinion expressions in the testing data if there is no valid candidate dependency path. [sent-264, score-2.977]

72 Furthermore, we observe that for the cars dataset the size of the dependency path candidate list (6642 entries) was approximately five times larger than the dependency graph candidate list for the web-services dataset (1237 entries), which has a comparable size regarding documents. [sent-265, score-0.879]

73 We assume that a large number of both the target candidates as well as the dependency path candidates introduces many false positives during the target extraction, hence lowering the precision of the algorithm on the cars dataset considerably. [sent-267, score-0.751]

74 2 Our CRF-based Approach Table 3 shows the results of the opinion target extraction using the CRF algorithm. [sent-270, score-0.843]

75 Although the CRF-based approach clearly outperforms the baseline system on all four datasets, we also observe the same general trend regarding the individual results: The CRF yields the best results on the movies dataset and the worst results on the cars & cameras dataset. [sent-276, score-1.185]

76 We observe that the higher the lexical variability of the opinion targets is in a dataset, the lower the results are. [sent-278, score-1.06]

77 The precision is considerably increased on all datasets, on the movies and cars & cameras datasets even reaching the overall highest value. [sent-281, score-0.857]

78 The observation made in previous research that short paths in the dependency graph are a high precision indicator of related opinion expressions - opinion targets (Kessler and Nicolov, 2009) is confirmed on all datasets. [sent-283, score-1.876]

79 Adding the information regarding opinion sentences to the basic features ofthe token string and 1041 the part-of-speech tag (row 4) yields the biggest improvements regarding F-Measure on the movies and web-services dataset (+0. [sent-284, score-1.458]

80 On the cars & cameras dataset the recall is relatively low again. [sent-287, score-0.695]

81 We assume that this is again due to the high lexical variability, so that the CRF algorithm will encounter many actual opinion targets in the testing data which have not occurred in the training data and will hence not be extracted. [sent-288, score-1.061]

82 We conclude that these two features are complementary, as they apparently indicate different kinds of opinion targets which are then correctly extracted by the CRF. [sent-290, score-1.008]

83 If we combine each of the opinion expression related features with the label which identifies opinion sentences in general (rows 6 & 7), we observe that this feature is also complementary to the others. [sent-291, score-1.467]

84 We have run some additional experiments in which we did not rely on the annotated opinion expressions, but employed a general pupose subjectivity lexicon4. [sent-300, score-0.705]

85 We observe that most of the recall errors result from one-word opinion targets or the beginning of opinion targets (B-Targets) being missclassified as non-targets (movies 83%, web-services 73%, cars 68%, cameras 64%). [sent-310, score-2.602]

86 Our results indicate that the dLn and wDs features are complementary, but apparently there are quite a few cases in which the opinion target is neither directly related to the opinion expression in the dependency graph nor close to it in the sentence. [sent-313, score-1.581]

87 None ofthe actual targets “lens cap”, “strap” and “camera” have a short dependency path to the opinion expression and “speed” is simply the closest noun to it. [sent-317, score-1.146]

88 Note that although both “speed” and “usability” are attributes of a camera, the opinion in this sentence is about the “lens cap” and “strap”, hence only these attributes are annotated as targets. [sent-318, score-0.722]

89 1 Zhuang Baseline Table 4 shows the results of the opinion target extraction with the state-of-the-art system in the crossdomain setting. [sent-325, score-0.911]

90 A quantitative error analysis has revealed that there is hardly any overlap in the opinion target candidates between domains, as reflected by the low recall in all configurations. [sent-327, score-0.823]

91 The vocabularies of the opinion targets are too different, hence the performance of the algorithm by Zhuang et al. [sent-328, score-1.066]

92 As we also observed in the analysis of the baseline results, the overlap of the opinion target vocabularies between domains is low, which resulted in a very small number of targets extracted by the CRF. [sent-352, score-1.198]

93 We noticed that on the cameras dataset the results regarding F-Measure remained stable if the token feature is not used in the training. [sent-361, score-0.738]

94 1043 In isolation, training only on the cars data yields the second highest results on the movies and webservices datasets and the highest results regarding F-Measure on the cameras data. [sent-362, score-1.06]

95 However, the results of the cars + cameras training data combination indicate that the cameras data does not contribute any additional information during the learning, since the results on both the movies and the web-services datasets are lower than when training only on the cameras data. [sent-363, score-1.524]

96 On the cars and the cameras dataset the cross-domain results are even closer to the single-domain results. [sent-368, score-0.675]

97 6 Conclusions In this paper, we have shown how a CRF-based approach for opinion target extraction performs in a single- and cross-domain setting. [sent-370, score-0.843]

98 Our error analysis indicates that additional features, which can capture opinions in more complex sentences, are required to improve the performance of the opinion target extraction. [sent-373, score-0.789]

99 The features we employ scale well across domains, given that the opinion target vocabularies are substantially different. [sent-375, score-0.797]

100 Since three of the features we employed in 1044 our CRF-based approach are based on the respective opinion expressions, it is to investigate how to mitigate the possible negative effects introduced by errors in the opinion expression identification if they are not annotated in the gold standard. [sent-378, score-1.496]


similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('opinion', 0.657), ('cameras', 0.345), ('targets', 0.326), ('cars', 0.218), ('movies', 0.205), ('regarding', 0.181), ('zhuang', 0.17), ('dataset', 0.112), ('extraction', 0.094), ('target', 0.092), ('crf', 0.081), ('expression', 0.073), ('kessler', 0.072), ('expressions', 0.07), ('crossdomain', 0.068), ('dln', 0.066), ('datasets', 0.066), ('paths', 0.066), ('nicolov', 0.062), ('reviews', 0.06), ('domain', 0.06), ('dependency', 0.058), ('token', 0.055), ('movie', 0.053), ('darmstadt', 0.053), ('wds', 0.053), ('observe', 0.053), ('domains', 0.05), ('sentiment', 0.045), ('yields', 0.045), ('mining', 0.043), ('row', 0.042), ('camera', 0.041), ('opinions', 0.04), ('aue', 0.04), ('strap', 0.04), ('adaptation', 0.038), ('isolation', 0.035), ('hence', 0.035), ('blitzer', 0.035), ('bloom', 0.034), ('lens', 0.034), ('candidates', 0.034), ('path', 0.032), ('cap', 0.031), ('annotated', 0.03), ('hu', 0.029), ('feature', 0.027), ('mentions', 0.027), ('commented', 0.027), ('cowie', 0.027), ('hochschulstra', 0.027), ('niklas', 0.027), ('opine', 0.027), ('ssn', 0.027), ('toprak', 0.027), ('vocabularies', 0.027), ('baseline', 0.026), ('apparently', 0.025), ('car', 0.025), ('variability', 0.024), ('customer', 0.024), ('product', 0.023), ('precision', 0.023), ('gold', 0.023), ('battery', 0.023), ('iryna', 0.023), ('multiword', 0.023), ('fields', 0.023), ('rows', 0.023), ('string', 0.022), ('testing', 0.022), ('identifying', 0.022), ('employ', 0.021), ('extracting', 0.021), ('investigated', 0.021), ('algorithm', 0.021), ('jakob', 0.021), ('technische', 0.021), ('holders', 0.021), ('germany', 0.021), ('investigate', 0.02), ('overlap', 0.02), ('recall', 0.02), ('aspects', 0.02), ('counted', 0.019), ('review', 0.019), ('usability', 0.019), ('graph', 0.019), ('documents', 0.019), ('combinations', 0.019), ('speed', 0.019), ('employed', 0.018), ('stable', 0.018), ('slight', 0.018), ('effects', 0.018), ('gamon', 0.018), ('popescu', 0.018), ('usa', 0.018), ('candidate', 0.018)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.99999988 49 emnlp-2010-Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields

Author: Niklas Jakob ; Iryna Gurevych

Abstract: In this paper, we focus on the opinion target extraction as part of the opinion mining task. We model the problem as an information extraction task, which we address based on Conditional Random Fields (CRF). As a baseline we employ the supervised algorithm by Zhuang et al. (2006), which represents the state-of-the-art on the employed data. We evaluate the algorithms comprehensively on datasets from four different domains annotated with individual opinion target instances on a sentence level. Furthermore, we investigate the performance of our CRF-based approach and the baseline in a single- and cross-domain opinion target extraction setting. Our CRF-based approach improves the performance by 0.077, 0.126, 0.071 and 0. 178 regarding F-Measure in the single-domain extraction in the four domains. In the crossdomain setting our approach improves the performance by 0.409, 0.242, 0.294 and 0.343 regarding F-Measure over the baseline.

2 0.48540705 70 emnlp-2010-Jointly Modeling Aspects and Opinions with a MaxEnt-LDA Hybrid

Author: Xin Zhao ; Jing Jiang ; Hongfei Yan ; Xiaoming Li

Abstract: Discovering and summarizing opinions from online reviews is an important and challenging task. A commonly-adopted framework generates structured review summaries with aspects and opinions. Recently topic models have been used to identify meaningful review aspects, but existing topic models do not identify aspect-specific opinion words. In this paper, we propose a MaxEnt-LDA hybrid model to jointly discover both aspects and aspect-specific opinion words. We show that with a relatively small amount of training data, our model can effectively identify aspect and opinion words simultaneously. We also demonstrate the domain adaptability of our model.

3 0.090286568 120 emnlp-2010-What's with the Attitude? Identifying Sentences with Attitude in Online Discussions

Author: Ahmed Hassan ; Vahed Qazvinian ; Dragomir Radev

Abstract: Mining sentiment from user generated content is a very important task in Natural Language Processing. An example of such content is threaded discussions which act as a very important tool for communication and collaboration in the Web. Threaded discussions include e-mails, e-mail lists, bulletin boards, newsgroups, and Internet forums. Most of the work on sentiment analysis has been centered around finding the sentiment toward products or topics. In this work, we present a method to identify the attitude of participants in an online discussion toward one another. This would enable us to build a signed network representation of participant interaction where every edge has a sign that indicates whether the interaction is positive or negative. This is different from most of the research on social networks that has focused almost exclusively on positive links. The method is exper- imentally tested using a manually labeled set of discussion posts. The results show that the proposed method is capable of identifying attitudinal sentences, and their signs, with high accuracy and that it outperforms several other baselines.

4 0.087874971 64 emnlp-2010-Incorporating Content Structure into Text Analysis Applications

Author: Christina Sauper ; Aria Haghighi ; Regina Barzilay

Abstract: In this paper, we investigate how modeling content structure can benefit text analysis applications such as extractive summarization and sentiment analysis. This follows the linguistic intuition that rich contextual information should be useful in these tasks. We present a framework which combines a supervised text analysis application with the induction of latent content structure. Both of these elements are learned jointly using the EM algorithm. The induced content structure is learned from a large unannotated corpus and biased by the underlying text analysis task. We demonstrate that exploiting content structure yields significant improvements over approaches that rely only on local context.1

5 0.081676327 41 emnlp-2010-Efficient Graph-Based Semi-Supervised Learning of Structured Tagging Models

Author: Amarnag Subramanya ; Slav Petrov ; Fernando Pereira

Abstract: We describe a new scalable algorithm for semi-supervised training of conditional random fields (CRF) and its application to partof-speech (POS) tagging. The algorithm uses a similarity graph to encourage similar ngrams to have similar POS tags. We demonstrate the efficacy of our approach on a domain adaptation task, where we assume that we have access to large amounts of unlabeled data from the target domain, but no additional labeled data. The similarity graph is used during training to smooth the state posteriors on the target domain. Standard inference can be used at test time. Our approach is able to scale to very large problems and yields significantly improved target domain accuracy.

6 0.072866812 83 emnlp-2010-Multi-Level Structured Models for Document-Level Sentiment Classification

7 0.06929379 119 emnlp-2010-We're Not in Kansas Anymore: Detecting Domain Changes in Streams

8 0.061738066 104 emnlp-2010-The Necessity of Combining Adaptation Methods

9 0.05741914 58 emnlp-2010-Holistic Sentiment Analysis Across Languages: Multilingual Supervised Latent Dirichlet Allocation

10 0.057222076 46 emnlp-2010-Evaluating the Impact of Alternative Dependency Graph Encodings on Solving Event Extraction Tasks

11 0.056118254 102 emnlp-2010-Summarizing Contrastive Viewpoints in Opinionated Text

12 0.056093425 33 emnlp-2010-Cross Language Text Classification by Model Translation and Semi-Supervised Learning

13 0.05480041 100 emnlp-2010-Staying Informed: Supervised and Semi-Supervised Multi-View Topical Analysis of Ideological Perspective

14 0.047729924 25 emnlp-2010-Better Punctuation Prediction with Dynamic Conditional Random Fields

15 0.046795778 39 emnlp-2010-EMNLP 044

16 0.044378929 11 emnlp-2010-A Semi-Supervised Approach to Improve Classification of Infrequent Discourse Relations Using Feature Vector Extension

17 0.044280235 122 emnlp-2010-WikiWars: A New Corpus for Research on Temporal Expressions

18 0.043923728 30 emnlp-2010-Confidence in Structured-Prediction Using Confidence-Weighted Models

19 0.043519251 67 emnlp-2010-It Depends on the Translation: Unsupervised Dependency Parsing via Word Alignment

20 0.038125388 8 emnlp-2010-A Multi-Pass Sieve for Coreference Resolution


similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.171), (1, 0.145), (2, -0.145), (3, -0.076), (4, 0.076), (5, -0.061), (6, 0.46), (7, -0.026), (8, 0.048), (9, 0.021), (10, -0.073), (11, 0.08), (12, 0.393), (13, -0.25), (14, -0.049), (15, -0.358), (16, -0.068), (17, 0.04), (18, 0.182), (19, -0.049), (20, -0.068), (21, -0.012), (22, 0.012), (23, 0.067), (24, -0.006), (25, 0.042), (26, -0.009), (27, -0.026), (28, -0.015), (29, 0.018), (30, 0.058), (31, -0.097), (32, -0.036), (33, -0.006), (34, 0.044), (35, -0.088), (36, -0.052), (37, -0.017), (38, 0.036), (39, -0.035), (40, 0.02), (41, -0.005), (42, -0.001), (43, 0.03), (44, -0.009), (45, -0.029), (46, -0.001), (47, -0.007), (48, -0.013), (49, 0.016)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.97935367 49 emnlp-2010-Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields

Author: Niklas Jakob ; Iryna Gurevych

Abstract: In this paper, we focus on the opinion target extraction as part of the opinion mining task. We model the problem as an information extraction task, which we address based on Conditional Random Fields (CRF). As a baseline we employ the supervised algorithm by Zhuang et al. (2006), which represents the state-of-the-art on the employed data. We evaluate the algorithms comprehensively on datasets from four different domains annotated with individual opinion target instances on a sentence level. Furthermore, we investigate the performance of our CRF-based approach and the baseline in a single- and cross-domain opinion target extraction setting. Our CRF-based approach improves the performance by 0.077, 0.126, 0.071 and 0. 178 regarding F-Measure in the single-domain extraction in the four domains. In the crossdomain setting our approach improves the performance by 0.409, 0.242, 0.294 and 0.343 regarding F-Measure over the baseline.

2 0.9546563 70 emnlp-2010-Jointly Modeling Aspects and Opinions with a MaxEnt-LDA Hybrid

Author: Xin Zhao ; Jing Jiang ; Hongfei Yan ; Xiaoming Li

Abstract: Discovering and summarizing opinions from online reviews is an important and challenging task. A commonly-adopted framework generates structured review summaries with aspects and opinions. Recently topic models have been used to identify meaningful review aspects, but existing topic models do not identify aspect-specific opinion words. In this paper, we propose a MaxEnt-LDA hybrid model to jointly discover both aspects and aspect-specific opinion words. We show that with a relatively small amount of training data, our model can effectively identify aspect and opinion words simultaneously. We also demonstrate the domain adaptability of our model.

3 0.2732811 120 emnlp-2010-What's with the Attitude? Identifying Sentences with Attitude in Online Discussions

Author: Ahmed Hassan ; Vahed Qazvinian ; Dragomir Radev

Abstract: Mining sentiment from user generated content is a very important task in Natural Language Processing. An example of such content is threaded discussions which act as a very important tool for communication and collaboration in the Web. Threaded discussions include e-mails, e-mail lists, bulletin boards, newsgroups, and Internet forums. Most of the work on sentiment analysis has been centered around finding the sentiment toward products or topics. In this work, we present a method to identify the attitude of participants in an online discussion toward one another. This would enable us to build a signed network representation of participant interaction where every edge has a sign that indicates whether the interaction is positive or negative. This is different from most of the research on social networks that has focused almost exclusively on positive links. The method is exper- imentally tested using a manually labeled set of discussion posts. The results show that the proposed method is capable of identifying attitudinal sentences, and their signs, with high accuracy and that it outperforms several other baselines.

4 0.21142979 64 emnlp-2010-Incorporating Content Structure into Text Analysis Applications

Author: Christina Sauper ; Aria Haghighi ; Regina Barzilay

Abstract: In this paper, we investigate how modeling content structure can benefit text analysis applications such as extractive summarization and sentiment analysis. This follows the linguistic intuition that rich contextual information should be useful in these tasks. We present a framework which combines a supervised text analysis application with the induction of latent content structure. Both of these elements are learned jointly using the EM algorithm. The induced content structure is learned from a large unannotated corpus and biased by the underlying text analysis task. We demonstrate that exploiting content structure yields significant improvements over approaches that rely only on local context.1

5 0.20678864 41 emnlp-2010-Efficient Graph-Based Semi-Supervised Learning of Structured Tagging Models

Author: Amarnag Subramanya ; Slav Petrov ; Fernando Pereira

Abstract: We describe a new scalable algorithm for semi-supervised training of conditional random fields (CRF) and its application to partof-speech (POS) tagging. The algorithm uses a similarity graph to encourage similar ngrams to have similar POS tags. We demonstrate the efficacy of our approach on a domain adaptation task, where we assume that we have access to large amounts of unlabeled data from the target domain, but no additional labeled data. The similarity graph is used during training to smooth the state posteriors on the target domain. Standard inference can be used at test time. Our approach is able to scale to very large problems and yields significantly improved target domain accuracy.

6 0.20086601 119 emnlp-2010-We're Not in Kansas Anymore: Detecting Domain Changes in Streams

7 0.19412929 102 emnlp-2010-Summarizing Contrastive Viewpoints in Opinionated Text

8 0.16490296 104 emnlp-2010-The Necessity of Combining Adaptation Methods

9 0.16082188 46 emnlp-2010-Evaluating the Impact of Alternative Dependency Graph Encodings on Solving Event Extraction Tasks

10 0.15643016 100 emnlp-2010-Staying Informed: Supervised and Semi-Supervised Multi-View Topical Analysis of Ideological Perspective

11 0.15523128 83 emnlp-2010-Multi-Level Structured Models for Document-Level Sentiment Classification

12 0.13769361 122 emnlp-2010-WikiWars: A New Corpus for Research on Temporal Expressions

13 0.13693261 25 emnlp-2010-Better Punctuation Prediction with Dynamic Conditional Random Fields

14 0.13290536 39 emnlp-2010-EMNLP 044

15 0.13066955 67 emnlp-2010-It Depends on the Translation: Unsupervised Dependency Parsing via Word Alignment

16 0.12960196 9 emnlp-2010-A New Approach to Lexical Disambiguation of Arabic Text

17 0.12492181 62 emnlp-2010-Improving Mention Detection Robustness to Noisy Input

18 0.12412853 30 emnlp-2010-Confidence in Structured-Prediction Using Confidence-Weighted Models

19 0.12338918 114 emnlp-2010-Unsupervised Parse Selection for HPSG

20 0.12109073 115 emnlp-2010-Uptraining for Accurate Deterministic Question Parsing


similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(3, 0.014), (10, 0.022), (12, 0.043), (29, 0.073), (30, 0.026), (45, 0.243), (52, 0.033), (56, 0.112), (62, 0.015), (66, 0.173), (72, 0.083), (76, 0.022), (87, 0.015), (89, 0.018)]

similar papers list:

simIndex simValue paperId paperTitle

1 0.86256939 110 emnlp-2010-Turbo Parsers: Dependency Parsing by Approximate Variational Inference

Author: Andre Martins ; Noah Smith ; Eric Xing ; Pedro Aguiar ; Mario Figueiredo

Abstract: We present a unified view of two state-of-theart non-projective dependency parsers, both approximate: the loopy belief propagation parser of Smith and Eisner (2008) and the relaxed linear program of Martins et al. (2009). By representing the model assumptions with a factor graph, we shed light on the optimization problems tackled in each method. We also propose a new aggressive online algorithm to learn the model parameters, which makes use of the underlying variational representation. The algorithm does not require a learning rate parameter and provides a single framework for a wide family of convex loss functions, includ- ing CRFs and structured SVMs. Experiments show state-of-the-art performance for 14 languages.

same-paper 2 0.81713903 49 emnlp-2010-Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields

Author: Niklas Jakob ; Iryna Gurevych

Abstract: In this paper, we focus on the opinion target extraction as part of the opinion mining task. We model the problem as an information extraction task, which we address based on Conditional Random Fields (CRF). As a baseline we employ the supervised algorithm by Zhuang et al. (2006), which represents the state-of-the-art on the employed data. We evaluate the algorithms comprehensively on datasets from four different domains annotated with individual opinion target instances on a sentence level. Furthermore, we investigate the performance of our CRF-based approach and the baseline in a single- and cross-domain opinion target extraction setting. Our CRF-based approach improves the performance by 0.077, 0.126, 0.071 and 0. 178 regarding F-Measure in the single-domain extraction in the four domains. In the crossdomain setting our approach improves the performance by 0.409, 0.242, 0.294 and 0.343 regarding F-Measure over the baseline.

3 0.67201239 120 emnlp-2010-What's with the Attitude? Identifying Sentences with Attitude in Online Discussions

Author: Ahmed Hassan ; Vahed Qazvinian ; Dragomir Radev

Abstract: Mining sentiment from user generated content is a very important task in Natural Language Processing. An example of such content is threaded discussions which act as a very important tool for communication and collaboration in the Web. Threaded discussions include e-mails, e-mail lists, bulletin boards, newsgroups, and Internet forums. Most of the work on sentiment analysis has been centered around finding the sentiment toward products or topics. In this work, we present a method to identify the attitude of participants in an online discussion toward one another. This would enable us to build a signed network representation of participant interaction where every edge has a sign that indicates whether the interaction is positive or negative. This is different from most of the research on social networks that has focused almost exclusively on positive links. The method is exper- imentally tested using a manually labeled set of discussion posts. The results show that the proposed method is capable of identifying attitudinal sentences, and their signs, with high accuracy and that it outperforms several other baselines.

4 0.66984838 69 emnlp-2010-Joint Training and Decoding Using Virtual Nodes for Cascaded Segmentation and Tagging Tasks

Author: Xian Qian ; Qi Zhang ; Yaqian Zhou ; Xuanjing Huang ; Lide Wu

Abstract: Many sequence labeling tasks in NLP require solving a cascade of segmentation and tagging subtasks, such as Chinese POS tagging, named entity recognition, and so on. Traditional pipeline approaches usually suffer from error propagation. Joint training/decoding in the cross-product state space could cause too many parameters and high inference complexity. In this paper, we present a novel method which integrates graph structures of two subtasks into one using virtual nodes, and performs joint training and decoding in the factorized state space. Experimental evaluations on CoNLL 2000 shallow parsing data set and Fourth SIGHAN Bakeoff CTB POS tagging data set demonstrate the superiority of our method over cross-product, pipeline and candidate reranking approaches.

5 0.66487467 35 emnlp-2010-Discriminative Sample Selection for Statistical Machine Translation

Author: Sankaranarayanan Ananthakrishnan ; Rohit Prasad ; David Stallard ; Prem Natarajan

Abstract: Production of parallel training corpora for the development of statistical machine translation (SMT) systems for resource-poor languages usually requires extensive manual effort. Active sample selection aims to reduce the labor, time, and expense incurred in producing such resources, attaining a given performance benchmark with the smallest possible training corpus by choosing informative, nonredundant source sentences from an available candidate pool for manual translation. We present a novel, discriminative sample selection strategy that preferentially selects batches of candidate sentences with constructs that lead to erroneous translations on a held-out development set. The proposed strategy supports a built-in diversity mechanism that reduces redundancy in the selected batches. Simulation experiments on English-to-Pashto and Spanish-to-English translation tasks demon- strate the superiority of the proposed approach to a number of competing techniques, such as random selection, dissimilarity-based selection, as well as a recently proposed semisupervised active learning strategy.

6 0.66445315 82 emnlp-2010-Multi-Document Summarization Using A* Search and Discriminative Learning

7 0.6610496 11 emnlp-2010-A Semi-Supervised Approach to Improve Classification of Infrequent Discourse Relations Using Feature Vector Extension

8 0.65585822 58 emnlp-2010-Holistic Sentiment Analysis Across Languages: Multilingual Supervised Latent Dirichlet Allocation

9 0.65405256 43 emnlp-2010-Enhancing Domain Portability of Chinese Segmentation Model Using Chi-Square Statistics and Bootstrapping

10 0.65329325 45 emnlp-2010-Evaluating Models of Latent Document Semantics in the Presence of OCR Errors

11 0.65199518 107 emnlp-2010-Towards Conversation Entailment: An Empirical Investigation

12 0.64944959 104 emnlp-2010-The Necessity of Combining Adaptation Methods

13 0.64789903 86 emnlp-2010-Non-Isomorphic Forest Pair Translation

14 0.64749002 25 emnlp-2010-Better Punctuation Prediction with Dynamic Conditional Random Fields

15 0.64722246 9 emnlp-2010-A New Approach to Lexical Disambiguation of Arabic Text

16 0.64722186 85 emnlp-2010-Negative Training Data Can be Harmful to Text Classification

17 0.6471417 92 emnlp-2010-Predicting the Semantic Compositionality of Prefix Verbs

18 0.64597797 105 emnlp-2010-Title Generation with Quasi-Synchronous Grammar

19 0.64533108 20 emnlp-2010-Automatic Detection and Classification of Social Events

20 0.64480531 67 emnlp-2010-It Depends on the Translation: Unsupervised Dependency Parsing via Word Alignment