emnlp emnlp2011 emnlp2011-105 knowledge-graph by maker-knowledge-mining

105 emnlp-2011-Predicting Thread Discourse Structure over Technical Web Forums

Source: pdf

Author: Li Wang ; Marco Lui ; Su Nam Kim ; Joakim Nivre ; Timothy Baldwin

Abstract: Online discussion forums are a valuable means for users to resolve specific information needs, both interactively for the participants and statically for users who search/browse over historical thread data. However, the complex structure of forum threads can make it difficult for users to extract relevant information. The discourse structure of web forum threads, in the form of labelled dependency relationships between posts, has the potential to greatly improve information access over web forum archives. In this paper, we present the task of parsing user forum threads to determine the labelled dependencies between posts. Three methods, including a dependency parsing approach, are proposed to jointly classify the links (relationships) between posts and the dialogue act (type) of each link. The proposed methods significantly surpass an informed baseline. We also experiment with “in situ” classification of evolving threads, and establish that our best methods are able to perform equivalently well over partial threads as complete threads.

Reference: text

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 net Abstract Online discussion forums are a valuable means for users to resolve specific information needs, both interactively for the participants and statically for users who search/browse over historical thread data. [sent-12, score-0.741]

2 However, the complex structure of forum threads can make it difficult for users to extract relevant information. [sent-13, score-0.385]

3 The discourse structure of web forum threads, in the form of labelled dependency relationships between posts, has the potential to greatly improve information access over web forum archives. [sent-14, score-0.533]

4 In this paper, we present the task of parsing user forum threads to determine the labelled dependencies between posts. [sent-15, score-0.461]

5 Three methods, including a dependency parsing approach, are proposed to jointly classify the links (relationships) between posts and the dialogue act (type) of each link. [sent-16, score-0.895]

6 We also experiment with “in situ” classification of evolving threads, and establish that our best methods are able to perform equivalently well over partial threads as complete threads. [sent-18, score-0.312]

7 This research aims at enhancing information access and support sharing, by mining the discourse structure of troubleshooting-oriented web user forum threads. [sent-28, score-0.386]

8 Previous research has shown that simple thread structure information (e. [sent-29, score-0.563]

9 In doing so, we hope to be able to perform richer visualisation of thread structure (e. [sent-34, score-0.563]

10 highlighting the key posts which appear to have led to a successful resolution to a problem), and more finegrained weighting of posts in threads for search purposes. [sent-36, score-0.965]

11 To illustrate the task, we use an example thread, made up of 5 posts from 4 distinct participants, from the CNET forum dataset of Kim et al. [sent-37, score-0.491]

12 The discourse structure of the thread is modelled as a rooted directed acyclic graph ProceEed i n bgusr ogfh t,h Sec 2o0t1la1n Cd,o UnfKer, Jeunlcye o 2n7– E3m1,p 2ir0ic1a1l. [sent-39, score-0.704]

13 c e2th0o1d1s A ins Nocaitautiroanl L foarn Cguoamgpeu Ptartoicoensaslin Lgin,g puaigsetisc 1s3–25, 0+Question-QØuestion Figure 1: A snippeted and annotated CNET thread (DAG) with a dialogue act label associated with each edge of the graph. [sent-41, score-0.915]

14 In this example, UserA initiates the thread with a question (dialogue act = QuestionQuestion) in the first post, by asking how to create an interactive input box on a webpage. [sent-42, score-0.684]

15 UserA responds to UserC to confirm the details of the solution (dialogue act = Answer-Confirmation), and at the same time, adds extra information to his/her original question (dialogue act = Question-Add); i. [sent-44, score-0.304]

16 , this one post has two distinct dependency links associated with it. [sent-46, score-0.292]

17 To predict thread discourse structure of this type, we jointly classify the links and dialogue acts between posts, experimenting with a variety of supervised classification methods, namely dependency parsing and linear-chain conditional random fields. [sent-48, score-1.229]

18 (2010b) who first proposed the task of thread discourse analysis, but only carried out experiments on post linking and post dialogue act classification as separate tasks. [sent-50, score-1.551]

19 , 2010b), whereby a novel post-level dialogue act set was proposed, and used as the basis for annotation of a set of threads taken from CNET. [sent-54, score-0.584]

20 In the original work, we proposed a set of novel features, which we applied to the separate tasks ofpost link classification and dialogue act classification. [sent-55, score-0.601]

21 We later applied the same basic methodology to dialogue act classification over one-on-one live chat data with provided message dependencies (Kim et al. [sent-56, score-0.472]

22 In both cases, however, we tackled only a single task, either link classification (optionally given dialogue act tags) or dialogue act classification, but never the two together. [sent-58, score-0.986]

23 In this paper, we take the obvious step of exploring joint classification of post link and dialogue act tags, to generate full thread discourse structures. [sent-59, score-1.511]

24 link classification) and dialogue act tagging have been studied largely as independent tasks. [sent-62, score-0.514]

25 Discourse disentanglement is the task of dividing a conversation thread (Elsner and Charniak, 2008; Lemon et al. [sent-63, score-0.557]

26 , 2002) or document thread (Wolf and Gibson, 2005) into a set of distinct sub-discourses. [sent-64, score-0.53]

27 For a more complete review of models for discourse disentanglement and dialogue act tagging, see Kim et al. [sent-83, score-0.553]

28 (1997) jointly performed segmentation and dialogue act classification over a German spontaneous speech corpus. [sent-87, score-0.472]

29 In their approach, the predictions of a multi-layer perceptron classifier on dialogue act boundaries were fed into an n-gram language model, which was used for the joint segmentation and classification of dialogue acts. [sent-88, score-0.808]

30 These results suggest that the thread structural representation used in this research, which includes both linking struc15 ture and the dialogue act associated with each link, could potentially provide even greater leverage in these retrieval tasks. [sent-104, score-0.938]

31 Another related research area is post-level classification, such as general post quality classification (Weimer et al. [sent-105, score-0.291]

32 , 2008; Lui and Baldwin, 2009) that thread discourse structure can significantly improve the classification accuracy for postlevel tasks. [sent-112, score-0.791]

33 question–answer, assessment–agreement, and blame–denial) from online forums have the potential to enhance thread summarisation or automatically generate knowledge bases for Community Question Answering (cQA) services such as Yahoo! [sent-115, score-0.657]

34 Our thread discourse structure prediction task includes joint classification of post roles (i. [sent-121, score-1.03]

35 3 Task Description and Data Set The main task performed in this research is joint classification of inter-post links (Link) and dialogue acts (DA) within forum threads. [sent-124, score-0.575]

36 In this, we assume that a post can only link to an earlier post (or a virtual root node), and that dialogue acts are labels on edges. [sent-125, score-0.848]

37 if a post both confirms the validity of an answer and adds extra information to the original question (as happens in Post4 in Figure 1). [sent-128, score-0.295]

38 We experiment with two different approaches to joint classification: (1) a linear-chain CRF over combined Link/DA post labels; and (2) a dependency parser. [sent-129, score-0.293]

39 in a 4-post thread, posts 2 and 3 may be dependent on post 1, and post 4 dependent on post 2; around 2% of the threads in our dataset contain nonprojective dependencies. [sent-133, score-1.194]

40 multi-headedness: it is possible for a given post to have multiple heads, including the possibility of multiple dependency links to the same post (e. [sent-134, score-0.496]

41 adding extra information to a question [Question-Add] as well as retracting information from the original question [QuestionCorrection]); around 6% of the threads in our dataset contain multi-headed dependencies. [sent-136, score-0.285]

42 in instances where a user hijacks a thread to ask their own unrelated question, or submit an unrelated spam post; around 2% of the threads in our dataset contain disconnected sub-graphs. [sent-139, score-0.903]

43 In addition to performing evaluation in batch mode over complete threads, we consider the task of “in situ thread classification”, whereby we predict the discourse structure of a thread after each post. [sent-144, score-1.33]

44 This is intended to simulate the more realistic setting of incrementally crawling/updating thread data, but needing to predict discourse structure for partial 16 threads. [sent-145, score-0.73]

45 (2010b),1 which contains 1332 annotated posts spanning 315 threads, collected from the Operating System, Software, Hardware and Web Development sub-forums of cnet. [sent-149, score-0.383]

46 2 Each post is labelled with one or more links (including the possibility of null-links, where the post doesn’t link to any other post), and each link is labelled with a dialogue act. [sent-150, score-1.035]

47 The dialogue act set is made up of 5 super-categories: Question, Answer, Resolution (confirmation of the question being resolved), Reproduction (external confirmation of a proposed so- lution working) and Other. [sent-151, score-0.457]

48 For full details of the dialogue act tagset, see Kim et al. [sent-157, score-0.385]

49 Unless otherwise noted, evaluation is over the combined link and dialogue act tag, including the combination of superclass and subclass for the Question and Answer dialogue acts. [sent-162, score-0.798]

50 1 Learners To predict thread discourse structure, we use a structured classification approach based on the findings of Kim et al. [sent-167, score-0.758]

51 In our case, our tokens are thread posts, with much greater scope for feature engineering than single words, and techni- cal challenges in scaling the underlying implementations to handle potentially much larger feature sets. [sent-183, score-0.53]

52 In presenting the thread data to MaltParser, we represent the nulllink from the initial post of each thread, as well as any disconnected posts, as the root. [sent-190, score-0.83]

53 To the best of our knowledge, there is no past work on using dependency parsing to learn thread discourse structure. [sent-191, score-0.764]

54 In our choice of parsing algorithm, we are also unable to detect posts with multiple heads, but can potentially detect disconnected sub-graphs. [sent-194, score-0.518]

55 2 Features The features used in our classifiers are as follows: Structural Features: Initiator a binary feature indicating whether the current post’s author is the thread initiator. [sent-196, score-0.552]

56 Position the relative position of the current post, as a ratio over the total number of posts in the thread. [sent-197, score-0.383]

57 the UserProf features for the standalone linking task take the form of the link labels (and not dialogue act labels) of the posts by the relevant author in the training data. [sent-207, score-0.919]

58 Table 1 shows the feature representation of the third post in a thread FeatureValueExplanation Initiator1. [sent-208, score-0.734]

59 0 most similar to post 1 UserProf counts for posts of each class from the same author in the training data x Table 1: The feature presentation of the third post in a thread of length 8 of length 8. [sent-215, score-1.368]

60 5 Classification Methodology All our experiments were carried out based on stratified 10-fold cross-validation, stratifying at the thread level to ensure that all posts from a given thread occur in a single fold. [sent-219, score-1.443]

61 the proportion of threads where all posts have been correctly classified4), where space allows. [sent-222, score-0.582]

62 Initial experiments showed it is hard for learners to discover which posts have multiple links, largely due to the sparsity of multi-headed posts (which account for less than 5% of the total posts). [sent-225, score-0.794]

63 html # pars ingalg 4Classification accuracy = F-score at the thread-level, each thread is assigned a single label of correct or incorrect. [sent-228, score-0.53]

64 Even if the same number of labels is predicted for both Link and DA, if multiple tags are predicted in both cases, we are left with the problem of determining which link label to combine with which dialogue act label. [sent-233, score-0.514]

65 2 In Situ Thread Classification One of the biggest challenges in classifying the discourse structure of a forum thread is that threads evolve over time, as new posts are posted. [sent-237, score-1.425]

66 In order to capture this phenomenon, and compare the accuracy of different models when applied to partial thread data (artificially cutting off a thread at post N) vs. [sent-238, score-1.29]

67 5 This is done in the following way: classification over the first two posts only ([1, 2]), the first four posts ([1, 4]), the first six posts ([1, 6]), the first eight posts ([1, 8]), and all posts ([all]). [sent-240, score-2.002]

68 In each case, we limit the test data only, meaning that the only variable in play is the extent of thread context used to learn the thread discourse structure for the given set of posts. [sent-241, score-1.234]

69 5In practice, completeness is defined at a given point in time, when the crawl was done, and it is highly likely that some of the “complete” threads had extra posts after the crawl. [sent-245, score-0.608]

70 A stronger baseline is to classify all first posts as 0+QuestionQuestion and all subsequent posts as 1+Answeranswer, which achieves a post-level F-score of 0. [sent-258, score-0.766]

71 2 Post Position-based Result Breakdown One question in thread discourse structure classification is how accurate the predictions are at different depths in a thread (e. [sent-357, score-1.391]

72 A breakdown of results across posts at different positions is presented in Figure 2. [sent-361, score-0.426]

73 The overall trend for both CRFSGD and MaltParser is that it becomes increasingly hard to classify posts as we continue through a thread, due to greater variability in discourse structure and greater sparsity in the data. [sent-362, score-0.557]

74 However, it is interesting to note that the results for CRFSGD actually improve from posts 7 and 8 ([7, 8]) to posts 9 and onwards ([9, ]). [sent-363, score-0.766]

75 To further investigate this effect, we performed class decomposition over the joint classification predictions, and performed a similar breakdown of posts 20 Figure 2: Breakdown of post-level Link-DA results for CRFSGD and MaltParser based on post position Decomposed Link Fµ0. [sent-364, score-0.777]

76 It is clear that the anomaly for CRFSGD comes from the DA component, due to there being greater predictability in the dialogue for final posts in a thread (users tend to confirm a successful resolution of the problem, or report on successful external reproduction of the solution). [sent-367, score-1.174]

77 MaltParser seems less adept at identifying that a post is at the end of a thread, and predicting the dialogue act accordingly. [sent-368, score-0.617]

78 The higher results for Link are to be expected, as throughout the thread, most posts tend to link locally. [sent-370, score-0.512]

79 l7]38 Table 5: Post-level Link-DA F-score for CRFSGD/MaltParser, based on in situ classification over sub-threads of different lengths (indicated in the rows), broken down over different post extents (indicated in the columns) 6. [sent-381, score-0.408]

80 2, we simulate in situ thread discourse structure prediction by removing differing numbers ofposts from the tail ofthe thread, and applying the trained model over the resultant sub-threads. [sent-383, score-0.8]

81 we cannot return results for posts 1–4 ([1, 4]) when the size of the test thread was only two posts ([1, 2]). [sent-387, score-1.296]

82 From this, we can conclude that it is possible to apply our method to partial threads without any reduction in effectiveness relative to classification over complete threads. [sent-389, score-0.312]

83 To gain a deeper insight into the behaviour of the feature, we binned the posts according to the number oftimes the author had posted in the training data, evaluated based on a 21 BHiinghus2c24or. [sent-393, score-0.43]

84 742∼ 14 8134505793 3097957 Table 6: Statistics for the 4 groups of users user score (uscore) for each user: uscorei=Pjn=in1ispi,j where ni is the number of posts by user i, and spi,j is the number of posts by user ithat occur as training instances for other posts by the same author. [sent-395, score-1.428]

85 uscore reflects the average training–test post ratio per user in cross-validation. [sent-396, score-0.346]

86 Note that as we include all posts from a given thread in a single partition during crossvalidation, it is possible for an author to have posted 4 times, but have a uscore of 0 due to those posts all occurring in the same thread. [sent-397, score-1.382]

87 The users were binned into 4 groups of roughly equal post size. [sent-399, score-0.274]

88 The detailed statistics are shown in Table 6, noting that the high-frequency bin (“High”) contains posts from a single user. [sent-400, score-0.383]

89 We present the post-level micro-averaged F-score for posts in each bin based on CRFSGD, with and without user profile features, in Figure 4. [sent-401, score-0.5]

90 In fact, a statistically significant difference was observed only for users with no posts in the training data (uscore = 0), where the F-score jumped over 10% in absolute terms for both the Low and Very Low bins. [sent-403, score-0.428]

91 Our explanation for this effect is that the Post-level joinUts celra Gsrosuipfication Figure 4: results for users binned by uscore, based on CRFSGD with and without UserProf features) lack of user profile information is predictive of the sort of posts we can expect from a user (i. [sent-404, score-0.648]

92 7 Conclusions and Future Work In this research, we explored the joint classification of web user forum thread discourse structure, in the form of a rooted directed acyclic graph over posts, with edges labelled with dialogue acts. [sent-407, score-1.303]

93 Three classification approaches were proposed: separately predicting Link and DA labels, and composing them into a joint class; predicting a combined Link-DA class using a structured classifier; and applying dependency parsing to the problem. [sent-408, score-0.296]

94 We also examined the task of in situ classification of dialogue structure, in the form of predicting the discourse structure of partial threads, as contrasted with classifying only complete threads. [sent-410, score-0.703]

95 We found that there was no drop in F-score over different subextents of the thread in classifying partial threads, despite the relative lack of thread context. [sent-411, score-1.117]

96 Our user profile features were found to be the pick of our features, but counter-intuitively, to bene22 fit users with no posts in the training data, rather than prolific users. [sent-414, score-0.545]

97 Acknowledgements The authors wish to acknowledge the development efforts of Johan Hall in configuring MaltParser to handle numeric features, and be able to parse thread structures. [sent-416, score-0.53]

98 It pays to be picky: An evaluation of thread retrieval in online forums. [sent-482, score-0.58]

99 Automatic instant messaging dialogue using statistical models and dialogue acts. [sent-507, score-0.549]

100 Towards internet-age pharmacovigilance: Extracting adverse drug reactions from user posts in healthrelated social networks. [sent-531, score-0.461]

similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('thread', 0.53), ('posts', 0.383), ('crfsgd', 0.34), ('dialogue', 0.261), ('post', 0.204), ('threads', 0.199), ('maltparser', 0.161), ('discourse', 0.141), ('link', 0.129), ('act', 0.124), ('userprof', 0.117), ('forum', 0.108), ('forums', 0.1), ('disconnected', 0.096), ('situ', 0.096), ('classification', 0.087), ('lui', 0.085), ('user', 0.078), ('da', 0.074), ('kim', 0.071), ('uscore', 0.064), ('ros', 0.055), ('dependency', 0.054), ('acts', 0.05), ('initiator', 0.046), ('seo', 0.046), ('users', 0.045), ('breakdown', 0.043), ('carolyn', 0.042), ('confirmation', 0.042), ('weimer', 0.042), ('predictions', 0.04), ('parsing', 0.039), ('profile', 0.039), ('labelled', 0.037), ('joakim', 0.037), ('cong', 0.037), ('dummy', 0.036), ('joint', 0.035), ('answer', 0.035), ('baldwin', 0.034), ('nivre', 0.034), ('links', 0.034), ('crf', 0.033), ('newsgroup', 0.033), ('structure', 0.033), ('cnet', 0.032), ('fscores', 0.032), ('initiation', 0.032), ('shriberg', 0.032), ('signifies', 0.032), ('wanas', 0.032), ('wolf', 0.032), ('classifying', 0.031), ('question', 0.03), ('timothy', 0.03), ('wang', 0.029), ('composition', 0.029), ('predicting', 0.028), ('learners', 0.028), ('carvalho', 0.027), ('messaging', 0.027), ('disentanglement', 0.027), ('online', 0.027), ('web', 0.026), ('extra', 0.026), ('email', 0.026), ('mcdonald', 0.026), ('partial', 0.026), ('class', 0.025), ('binned', 0.025), ('nam', 0.025), ('pages', 0.024), ('chan', 0.024), ('retrieval', 0.023), ('elsner', 0.023), ('subclass', 0.023), ('proceedings', 0.022), ('author', 0.022), ('decomposed', 0.022), ('classifications', 0.022), ('lemon', 0.022), ('labelling', 0.022), ('conference', 0.021), ('answeranswer', 0.021), ('conor', 0.021), ('decomp', 0.021), ('elsas', 0.021), ('excount', 0.021), ('extents', 0.021), ('fortuna', 0.021), ('hayes', 0.021), ('interactively', 0.021), ('mcaarltflpsagrsder', 0.021), ('muthmann', 0.021), ('postsim', 0.021), ('qucount', 0.021), ('questionquestion', 0.021), ('schuth', 0.021), ('sondhi', 0.021)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 1.0000005 105 emnlp-2011-Predicting Thread Discourse Structure over Technical Web Forums

Author: Li Wang ; Marco Lui ; Su Nam Kim ; Joakim Nivre ; Timothy Baldwin

2 0.20379986 27 emnlp-2011-Classifying Sentences as Speech Acts in Message Board Posts

Author: Ashequl Qadir ; Ellen Riloff

Abstract: This research studies the text genre of message board forums, which contain a mixture of expository sentences that present factual information and conversational sentences that include communicative acts between the writer and readers. Our goal is to create sentence classifiers that can identify whether a sentence contains a speech act, and can recognize sentences containing four different speech act classes: Commissives, Directives, Expressives, and Representatives. We conduct experiments using a wide variety of features, including lexical and syntactic features, speech act word lists from external resources, and domain-specific semantic class features. We evaluate our results on a collection of message board posts in the domain of veterinary medicine.

3 0.14252537 126 emnlp-2011-Structural Opinion Mining for Graph-based Sentiment Representation

Author: Yuanbin Wu ; Qi Zhang ; Xuanjing Huang ; Lide Wu

Abstract: Based on analysis of on-line review corpus we observe that most sentences have complicated opinion structures and they cannot be well represented by existing methods, such as frame-based and feature-based ones. In this work, we propose a novel graph-based representation for sentence level sentiment. An integer linear programming-based structural learning method is then introduced to produce the graph representations of input sentences. Experimental evaluations on a manually labeled Chinese corpus demonstrate the effectiveness of the proposed approach.

4 0.1159291 38 emnlp-2011-Data-Driven Response Generation in Social Media

Author: Alan Ritter ; Colin Cherry ; William B. Dolan

Abstract: Ottawa, Ontario, K1A 0R6 Co l . Cherry@ nrc-cnrc . gc . ca in Redmond, WA 98052 bi l ldol @mi cro so ft . com large corpus of status-response pairs found on Twitter to create a system that responds to Twitter status We present a data-driven approach to generating responses to Twitter status posts, based on phrase-based Statistical Machine Translation. We find that mapping conversational stimuli onto responses is more difficult than translating between languages, due to the wider range of possible responses, the larger fraction of unaligned words/phrases, and the presence of large phrase pairs whose alignment cannot be further decomposed. After addressing these challenges, we compare approaches based on SMT and Information Retrieval in a human evaluation. We show that SMT outperforms IR on this task, and its output is preferred over actual human responses in 15% of cases. As far as we are aware, this is the first work to investigate the use of phrase-based SMT to directly translate a linguistic stimulus into an appropriate response.

5 0.094828919 142 emnlp-2011-Unsupervised Discovery of Discourse Relations for Eliminating Intra-sentence Polarity Ambiguities

Author: Lanjun Zhou ; Binyang Li ; Wei Gao ; Zhongyu Wei ; Kam-Fai Wong

Abstract: Polarity classification of opinionated sentences with both positive and negative sentiments1 is a key challenge in sentiment analysis. This paper presents a novel unsupervised method for discovering intra-sentence level discourse relations for eliminating polarity ambiguities. Firstly, a discourse scheme with discourse constraints on polarity was defined empirically based on Rhetorical Structure Theory (RST). Then, a small set of cuephrase-based patterns were utilized to collect a large number of discourse instances which were later converted to semantic sequential representations (SSRs). Finally, an unsupervised method was adopted to generate, weigh and filter new SSRs without cue phrases for recognizing discourse relations. Experimental results showed that the proposed methods not only effectively recognized the defined discourse relations but also achieved significant improvement by integrating discourse information in sentence-level polarity classification.

6 0.068010211 4 emnlp-2011-A Fast, Accurate, Non-Projective, Semantically-Enriched Parser

7 0.066966973 94 emnlp-2011-Modelling Discourse Relations for Arabic

8 0.063678339 75 emnlp-2011-Joint Models for Chinese POS Tagging and Dependency Parsing

9 0.058691613 24 emnlp-2011-Bootstrapping Semantic Parsers from Conversations

10 0.058300357 104 emnlp-2011-Personalized Recommendation of User Comments via Factor Models

11 0.055139851 50 emnlp-2011-Evaluating Dependency Parsing: Robust and Heuristics-Free Cross-Annotation Evaluation

12 0.054722164 41 emnlp-2011-Discriminating Gender on Twitter

13 0.053866848 92 emnlp-2011-Minimally Supervised Event Causality Identification

14 0.053026736 84 emnlp-2011-Learning the Information Status of Noun Phrases in Spoken Dialogues

15 0.049791183 62 emnlp-2011-Generating Subsequent Reference in Shared Visual Scenes: Computation vs Re-Use

16 0.049451955 28 emnlp-2011-Closing the Loop: Fast, Interactive Semi-Supervised Annotation With Queries on Features and Instances

17 0.048920039 102 emnlp-2011-Parse Correction with Specialized Models for Difficult Attachment Types

18 0.047102679 8 emnlp-2011-A Model of Discourse Predictions in Human Sentence Processing

19 0.043046776 117 emnlp-2011-Rumor has it: Identifying Misinformation in Microblogs

20 0.040441655 7 emnlp-2011-A Joint Model for Extended Semantic Role Labeling

similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.153), (1, -0.081), (2, 0.012), (3, 0.081), (4, 0.012), (5, -0.072), (6, 0.013), (7, 0.022), (8, -0.0), (9, -0.237), (10, -0.133), (11, -0.192), (12, -0.034), (13, 0.041), (14, -0.04), (15, 0.082), (16, 0.092), (17, -0.045), (18, 0.109), (19, -0.086), (20, 0.043), (21, 0.09), (22, -0.415), (23, -0.021), (24, 0.06), (25, 0.112), (26, -0.104), (27, -0.111), (28, -0.067), (29, -0.129), (30, 0.029), (31, 0.023), (32, -0.22), (33, 0.09), (34, -0.005), (35, -0.029), (36, 0.202), (37, 0.046), (38, -0.012), (39, -0.06), (40, -0.018), (41, -0.03), (42, -0.151), (43, -0.027), (44, -0.079), (45, 0.024), (46, -0.026), (47, -0.059), (48, -0.006), (49, 0.018)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.94434094 105 emnlp-2011-Predicting Thread Discourse Structure over Technical Web Forums

Author: Li Wang ; Marco Lui ; Su Nam Kim ; Joakim Nivre ; Timothy Baldwin

2 0.82479346 27 emnlp-2011-Classifying Sentences as Speech Acts in Message Board Posts

Author: Ashequl Qadir ; Ellen Riloff

3 0.45723638 126 emnlp-2011-Structural Opinion Mining for Graph-based Sentiment Representation

Author: Yuanbin Wu ; Qi Zhang ; Xuanjing Huang ; Lide Wu

4 0.32361978 38 emnlp-2011-Data-Driven Response Generation in Social Media

Author: Alan Ritter ; Colin Cherry ; William B. Dolan

5 0.31066242 142 emnlp-2011-Unsupervised Discovery of Discourse Relations for Eliminating Intra-sentence Polarity Ambiguities

Author: Lanjun Zhou ; Binyang Li ; Wei Gao ; Zhongyu Wei ; Kam-Fai Wong

6 0.28157002 24 emnlp-2011-Bootstrapping Semantic Parsers from Conversations

7 0.24157336 94 emnlp-2011-Modelling Discourse Relations for Arabic

8 0.23837806 84 emnlp-2011-Learning the Information Status of Noun Phrases in Spoken Dialogues

9 0.20895876 104 emnlp-2011-Personalized Recommendation of User Comments via Factor Models

10 0.19421275 75 emnlp-2011-Joint Models for Chinese POS Tagging and Dependency Parsing

11 0.19095746 117 emnlp-2011-Rumor has it: Identifying Misinformation in Microblogs

12 0.18599138 4 emnlp-2011-A Fast, Accurate, Non-Projective, Semantically-Enriched Parser

13 0.18425199 62 emnlp-2011-Generating Subsequent Reference in Shared Visual Scenes: Computation vs Re-Use

14 0.16688953 148 emnlp-2011-Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation.

15 0.16503513 28 emnlp-2011-Closing the Loop: Fast, Interactive Semi-Supervised Annotation With Queries on Features and Instances

16 0.16308889 32 emnlp-2011-Computing Logical Form on Regulatory Texts

17 0.16193499 41 emnlp-2011-Discriminating Gender on Twitter

18 0.1616191 7 emnlp-2011-A Joint Model for Extended Semantic Role Labeling

19 0.16025071 12 emnlp-2011-A Weakly-supervised Approach to Argumentative Zoning of Scientific Documents

20 0.15896659 102 emnlp-2011-Parse Correction with Specialized Models for Difficult Attachment Types

similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(17, 0.324), (23, 0.095), (36, 0.03), (37, 0.032), (45, 0.062), (53, 0.021), (54, 0.02), (57, 0.023), (62, 0.015), (64, 0.026), (66, 0.019), (79, 0.037), (82, 0.026), (87, 0.038), (90, 0.021), (96, 0.062), (98, 0.031)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.74552613 105 emnlp-2011-Predicting Thread Discourse Structure over Technical Web Forums

Author: Li Wang ; Marco Lui ; Su Nam Kim ; Joakim Nivre ; Timothy Baldwin

2 0.58775175 79 emnlp-2011-Lateen EM: Unsupervised Training with Multiple Objectives, Applied to Dependency Grammar Induction

Author: Valentin I. Spitkovsky ; Hiyan Alshawi ; Daniel Jurafsky

Abstract: We present new training methods that aim to mitigate local optima and slow convergence in unsupervised training by using additional imperfect objectives. In its simplest form, lateen EM alternates between the two objectives of ordinary “soft” and “hard” expectation maximization (EM) algorithms. Switching objectives when stuck can help escape local optima. We find that applying a single such alternation already yields state-of-the-art results for English dependency grammar induction. More elaborate lateen strategies track both objectives, with each validating the moves proposed by the other. Disagreements can signal earlier opportunities to switch or terminate, saving iterations. De-emphasizing fixed points in these ways eliminates some guesswork from tuning EM. An evaluation against a suite of unsupervised dependency parsing tasks, for a vari- ety of languages, showed that lateen strategies significantly speed up training of both EM algorithms, and improve accuracy for hard EM.

3 0.43463773 27 emnlp-2011-Classifying Sentences as Speech Acts in Message Board Posts

Author: Ashequl Qadir ; Ellen Riloff

4 0.41039822 75 emnlp-2011-Joint Models for Chinese POS Tagging and Dependency Parsing

Author: Zhenghua Li ; Min Zhang ; Wanxiang Che ; Ting Liu ; Wenliang Chen ; Haizhou Li

Abstract: Part-of-speech (POS) is an indispensable feature in dependency parsing. Current research usually models POS tagging and dependency parsing independently. This may suffer from error propagation problem. Our experiments show that parsing accuracy drops by about 6% when using automatic POS tags instead of gold ones. To solve this issue, this paper proposes a solution by jointly optimizing POS tagging and dependency parsing in a unique model. We design several joint models and their corresponding decoding algorithms to incorporate different feature sets. We further present an effective pruning strategy to reduce the search space of candidate POS tags, leading to significant improvement of parsing speed. Experimental results on Chinese Penn Treebank 5 show that our joint models significantly improve the state-of-the-art parsing accuracy by about 1.5%. Detailed analysis shows that the joint method is able to choose such POS tags that are more helpful and discriminative from parsing viewpoint. This is the fundamental reason of parsing accuracy improvement.

5 0.40809497 108 emnlp-2011-Quasi-Synchronous Phrase Dependency Grammars for Machine Translation

Author: Kevin Gimpel ; Noah A. Smith

Abstract: We present a quasi-synchronous dependency grammar (Smith and Eisner, 2006) for machine translation in which the leaves of the tree are phrases rather than words as in previous work (Gimpel and Smith, 2009). This formulation allows us to combine structural components of phrase-based and syntax-based MT in a single model. We describe a method of extracting phrase dependencies from parallel text using a target-side dependency parser. For decoding, we describe a coarse-to-fine approach based on lattice dependency parsing of phrase lattices. We demonstrate performance improvements for Chinese-English and UrduEnglish translation over a phrase-based baseline. We also investigate the use of unsupervised dependency parsers, reporting encouraging preliminary results.

6 0.40008953 136 emnlp-2011-Training a Parser for Machine Translation Reordering

7 0.39838982 28 emnlp-2011-Closing the Loop: Fast, Interactive Semi-Supervised Annotation With Queries on Features and Instances

8 0.39655808 59 emnlp-2011-Fast and Robust Joint Models for Biomedical Event Extraction

9 0.39249191 98 emnlp-2011-Named Entity Recognition in Tweets: An Experimental Study

10 0.39167425 1 emnlp-2011-A Bayesian Mixture Model for PoS Induction Using Multiple Features

11 0.3915447 68 emnlp-2011-Hypotheses Selection Criteria in a Reranking Framework for Spoken Language Understanding

12 0.39137134 87 emnlp-2011-Lexical Generalization in CCG Grammar Induction for Semantic Parsing

13 0.3911849 123 emnlp-2011-Soft Dependency Constraints for Reordering in Hierarchical Phrase-Based Translation

14 0.39048213 128 emnlp-2011-Structured Relation Discovery using Generative Models

15 0.39004779 134 emnlp-2011-Third-order Variational Reranking on Packed-Shared Dependency Forests

16 0.39004192 137 emnlp-2011-Training dependency parsers by jointly optimizing multiple objectives

17 0.38876069 132 emnlp-2011-Syntax-Based Grammaticality Improvement using CCG and Guided Search

18 0.38815257 81 emnlp-2011-Learning General Connotation of Words using Graph-based Algorithms

19 0.38803217 53 emnlp-2011-Experimental Support for a Categorical Compositional Distributional Model of Meaning

20 0.38620654 22 emnlp-2011-Better Evaluation Metrics Lead to Better Machine Translation