acl acl2011 acl2011-328 knowledge-graph by maker-knowledge-mining

328 acl-2011-Using Cross-Entity Inference to Improve Event Extraction


Source: pdf

Author: Yu Hong ; Jianfeng Zhang ; Bin Ma ; Jianmin Yao ; Guodong Zhou ; Qiaoming Zhu

Abstract: Event extraction is the task of detecting certain specified types of events that are mentioned in the source language data. The state-of-the-art research on the task is transductive inference (e.g. cross-event inference). In this paper, we propose a new method of event extraction by well using cross-entity inference. In contrast to previous inference methods, we regard entitytype consistency as key feature to predict event mentions. We adopt this inference method to improve the traditional sentence-level event extraction system. Experiments show that we can get 8.6% gain in trigger (event) identification, and more than 11.8% gain for argument (role) classification in ACE event extraction. 1

Reference: text


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 cn Abstract Event extraction is the task of detecting certain specified types of events that are mentioned in the source language data. [sent-3, score-0.312]

2 The state-of-the-art research on the task is transductive inference (e. [sent-4, score-0.26]

3 In this paper, we propose a new method of event extraction by well using cross-entity inference. [sent-7, score-0.854]

4 In contrast to previous inference methods, we regard entitytype consistency as key feature to predict event mentions. [sent-8, score-1.094]

5 We adopt this inference method to improve the traditional sentence-level event extraction system. [sent-9, score-0.985]

6 6% gain in trigger (event) identification, and more than 11. [sent-11, score-0.172]

7 8% gain for argument (role) classification in ACE event extraction. [sent-12, score-0.849]

8 1 Introduction The event extraction task in ACE (Automatic Content Extraction) evaluation involves three challenging issues: distinguishing events of different types, finding the participants of an event and determin- ing the roles of the participants. [sent-13, score-1.871]

9 The recent researches on the task show the availability of transductive inference, such as that of the following methods: cross-document, crosssentence and cross-event inferences. [sent-14, score-0.149]

10 Transductive inference is a process to use the known instances to predict the attributes of unknown instances. [sent-15, score-0.226]

11 As an example, given a target event, the cross-event inference can predict its type by well using the related events co-occurred with it within the same document. [sent-16, score-0.44]

12 it is hard to tell whether it is a Transport event in ACE, which means that he left the place; or an End-Position event, which means that he retired from the company. [sent-18, score-0.841]

13 But cross-event inference can use a related event “Then he went shopping” within 1127 the same document to identify it as a Transport event correctly. [sent-19, score-1.677]

14 As the above example might suggest, the availability of transductive inference for event extraction relies heavily on the known evidences of an event occurrence in specific condition. [sent-20, score-1.985]

15 However, the evidence supporting the inference is normally unclear or absent. [sent-21, score-0.224]

16 For instance, the relation among events is the key clue for cross-event inference to predict a target event type, as shown in the inference process of the sentence (1). [sent-22, score-1.299]

17 But event relation extraction itself is a hard task in Information Extraction. [sent-23, score-0.895]

18 So cross-event inference often suffers from some false evidence (viz. [sent-24, score-0.209]

19 , misleading by unrelated events) or lack of valid evidence (viz. [sent-25, score-0.049]

20 In this paper, we propose a new method of transductive inference, named cross-entity inference, for event extraction by well using the relations among entities. [sent-27, score-0.983]

21 This method is firstly motivated by the inherent ability of entity types in revealing event types. [sent-28, score-0.997]

22 it is easy to identify the sentence (2) as a Transport event in ACE, which means that he left the place, because nobody would retire (End-Position type) from a bathroom. [sent-31, score-0.818]

23 And compared to the entities in sentence (1) and (2), the entity “Microsoft” in (3) would give us more confidence to tag the “left” event as an End-Position type, because people are used to giving the full name of the place where they retired. [sent-32, score-1.16]

24 The cross-entity inference is also motivated by the phenomenon that the entities of the same type often attend similar events. [sent-33, score-0.41]

25 That gives us a way to predict event type based on entity-type consistency. [sent-34, score-0.909]

26 it is hard to identify it as an Elect event in ACE, which means Obama wins the Presidential Election, ProceedingPso orftla thned 4,9 Otrhe Agonnn,u Jauln Mee 1e9t-i2ng4, o 2f0 t1h1e. [sent-36, score-0.813]

27 c s 2o0ci1a1ti Aonss foocria Ctioomnp fourta Ctioomnaplu Ltaintigouniaslti Lcisn,g puaigsetsic 1s127–1136, or an Attack event, which means Obama roughs somebody up. [sent-38, score-0.02]

28 But if we have the priori knowledge that the sentence “Bush beats McCain” is an Elect event, and “Obama” was a presidential contender just like “Bush” (strict type consistency), we have ample evidence to predict that the sentence (4) is also an Elect event. [sent-39, score-0.319]

29 Indeed above cross-entity inference for eventtype identification is not the only use of entity-type consistency. [sent-40, score-0.159]

30 As we shall describe below, we can make use of it at all issues of event extraction: ? [sent-41, score-0.773]

31 For event type: the entities of the same type are most likely to attend similar events. [sent-42, score-1.052]

32 And the events often use consistent or synonymous trigger. [sent-43, score-0.173]

33 For event argument (participant): the entities of the same type normally co-occur with simi- lar participants in the events of the same type. [sent-45, score-1.335]

34 For argument role: the arguments of the same type, for the most part, play the same roles in similar events. [sent-47, score-0.172]

35 With the help of above characteristics of entity, we can perform a step-by-step inference in this order: ? [sent-48, score-0.131]

36 Step 1: predicting event type and labeling trigger given the entities of the same type. [sent-49, score-1.181]

37 Step 2: identifying arguments in certain event given priori entity type, event type and trigger that obtained by step 1. [sent-51, score-2.161]

38 Step 3: determining argument roles in certain event given entity type, event type, trigger and arguments that obtained by step 1 and step 2. [sent-53, score-2.15]

39 On the basis, we give a blind cross-entity inference method for event extraction in this paper. [sent-54, score-0.985]

40 In the method, we first regard entities as queries to retrieve their related documents from large-scale language resources, and use the global evidences of the documents to generate entity-type descriptions. [sent-55, score-0.221]

41 Second we determine the type consistency of entities by measuring the similarity of the type descriptions. [sent-56, score-0.442]

42 Finally, given the priori attributes of events in the training data, with the help of the entities of the same type, we perform the step-by-step cross-entity inference on the attributes of test events (candidate sentences). [sent-57, score-0.775]

43 In contrast to other transductive inference methods on event extraction, the cross-entity inference makes every effort to strengthen effects of entities in predicting event occurrences. [sent-58, score-2.081]

44 Thus the inferential process can benefit from following aspects: 1) less false evidence, viz. [sent-59, score-0.049]

45 less false entity-type consistency (the key clue of cross-entity inference), 1128 because the consistency can be more precisely determined with the help of fully entity-type description that obtained based on the related information from Web; 2) more valid evidence, viz. [sent-60, score-0.285]

46 more entities of the same type (the key references for the inference), because any entity never lack its congeners. [sent-61, score-0.43]

47 2 Task Description The event extraction task we addressing is that of the Automatic Content Extraction (ACE) evaluations, where an event is defined as a specific occurrence involving participants. [sent-62, score-1.66]

48 And event extraction task requires that certain specified types of events that are mentioned in the source language data be detected. [sent-63, score-1.085]

49 Entity: an object or a set of objects in one of the semantic categories of interest, referred to in the document by one or more (co-referential) entity mentions. [sent-65, score-0.194]

50 Entity mention: a reference to an entity (typically, a noun phrase). [sent-67, score-0.215]

51 Event trigger: the main word that most clearly expresses an event occurrence (An ACE event trigger is generally a verb or a noun). [sent-69, score-1.751]

52 Event arguments: the entity mentions that are involved in an event (viz. [sent-71, score-1.043]

53 Argument roles: the relation of arguments to the event where they participate. [sent-74, score-0.85]

54 Event mention: a phrase or sentence within which an event is described, including trigger and arguments. [sent-76, score-0.945]

55 The 2005 ACE evaluation had 8 types of events, with 33 subtypes; for the purpose of this paper, we will treat these simply as 33 separate event types and do not consider the hierarchical structure among them. [sent-77, score-0.833]

56 Besides, the ACE evaluation plan defines the following standards to determine the correctness of an event extraction: ? [sent-78, score-0.773]

57 A trigger is correctly labeled if its event type and offset (viz. [sent-79, score-1.057]

58 , the position of the trigger word in text) match a reference trigger. [sent-80, score-0.193]

59 An argument is correctly identified if its event type and offsets match any of the reference argument mentions, in other word, correctly recognizing participants in an event. [sent-82, score-1.111]

60 An argument is correctly classified if its role matches any of the reference argument mentions. [sent-84, score-0.234]

61 Consider the sentence: (5) It has refused in the last five years to revoke the license of a single doctor for committing medical errors. [sent-85, score-0.137]

62 Thereinto, the name identification is another hard task in ACE evaluation and not the focus in this paper. [sent-87, score-0.07]

63 So we skip the phase and instead directly use the entity labels provided by ACE. [sent-88, score-0.194]

64 3 Related Work Almost all the current ACE event extraction systems focus on processing one sentence at a time (Grishman et al. [sent-89, score-0.854]

65 However, there have been several studies using high-level information from a wider scope: Maslennikov and Chua (2007) use discourse trees and local syntactic dependencies in a patternbased framework to incorporate wider context to refine the performance of relation extraction. [sent-92, score-0.116]

66 They claimed that discourse information could filter noisy dependency paths as well as increasing the reliability of dependency path extraction. [sent-93, score-0.044]

67 (2005) used Gibbs sampling, a simple Monte Carlo method used to perform approximate inference in factored probabilistic models. [sent-95, score-0.131]

68 By using simulated annealing in place of Viterbi decoding in sequence models such as HMMs, CMMs, and CRFs, it is possible to incorporate non-local structure while preserving tractable inference. [sent-96, score-0.029]

69 They used this technique to augment an information extraction system with long-distance dependency models, enforcing label consistency and extraction template consistency constraints. [sent-97, score-0.39]

70 02” in 1129 rowsky, 1995); they extended the scope from a single document to a cluster of topic-related documents and employed a rule-based approach to propagate consistent trigger classification and event arguments across sentences and documents. [sent-101, score-1.003]

71 Combining global evidence from related documents with local decisions, they obtained an appreciable improvement in both event and event argument identification. [sent-102, score-1.712]

72 This unified probabilistic model allows the two components to jointly make decisions based upon both the local evidence sur- rounding each phrase and the “peripheral vision”. [sent-104, score-0.07]

73 Liao and Grishman (2010) propose document level cross-event inference to improve event extraction. [sent-106, score-0.904]

74 In contrast to Gupta’s work, Liao do not limit themselves to time information for events, but rather use related events and event-type consistency to make predictions or resolve ambiguities regarding a given event. [sent-107, score-0.287]

75 4 Motivation In event extraction, current transductive inference methods focus on the issue that many events are missing or spuriously tagged because the local information is not sufficient to make a confident decision. [sent-108, score-1.248]

76 The solution is to mine credible evidences of event occurrences from global information and regard that as priori knowledge to predict unknown event attributes, such as that of cross-document and cross-event inference methods. [sent-109, score-1.87]

77 However, by analyzing the sentence-level baseline event extraction, we found that the entities within a sentence, as the most important local in- formation, actually contain sufficient clues for event detection. [sent-110, score-1.711]

78 It is only based on the premise that we know the backgrounds of the entities beforehand. [sent-111, score-0.174]

79 For instance, if we knew the entity “vesuvius” is an active volcano, we could easily identify the word “erupt”, which co-occurred with the entity, as the trigger of a “volcanic eruption” event but not that of a “spotty rash”. [sent-112, score-1.159]

80 In spite of that, it is actually difficult to use an entity to directly infer an event occurrence because we normally don’t know the inevitable connection between the background of the entity and the event attributes. [sent-113, score-2.089]

81 But we can well use the entities of the same background to perform the inference. [sent-114, score-0.171]

82 In detail, if we first know entity(a) has the same background with entity(b), and we also know that entity(a), as a certain role, participates in a specific event, then we can predict that entity(b) might participtes in a similar event as the same role. [sent-115, score-0.932]

83 (6) Bush should torture the al Qaeda chief operations officer. [sent-117, score-0.039]

84 The sentences are two event mentions which have the same attributes: (65)AETvr agerbinulgmtegy e2npr:tesCro. [sent-118, score-0.849]

85 tAoewntacure k Rx oalme= pAlTeta rcgke tr From the sentences, we can find that the entities “Saddam” and “Qaeda chief” have the same background (viz. [sent-120, score-0.171]

86 , terrorist leader), and they are both the arguments of Attack events as the role of Target. [sent-121, score-0.293]

87 So if we previously know any of the event mentions, we can infer another one with the help of the entities of the same background. [sent-122, score-0.947]

88 In a word, the cross-entity inference, we proposed for event extraction, bases on the hypothesis: Entities of the consistent type normally participate in similar events as the same role. [sent-123, score-1.082]

89 As we will introduce below, some statistical data from ACE training corpus can support the hypothesis, which show the consistency of event type and role in event mentions where entities of the same type occur. [sent-124, score-2.105]

90 1 Entity Consistency and Distribution Within the ACE corpus, there is a strong entity consistency: if one entity mention appears in a type 2 They are extracted from the files “CNN_CF_20030305. [sent-126, score-0.525]

91 of event, other entity mentions of the same type will appear in similar events, and even use the same word to trigger the events. [sent-131, score-0.534]

92 To see this we calculated the conditional probability (in the ACE corpus) of a certain entity type appearing in the 33 ACE event subtypes. [sent-132, score-1.146]

93 Conditional probability of an entity type appearing as the 34 ACE role types (Here only the probabilities of Population-Center, Exploding and Air entities as examples) As there are 33 event subtypes and 43 entity types, there are potentially 33*43=1419 entityevent combinations. [sent-135, score-1.541]

94 For example, the Population-Center entities only occur in 4 types of event mentions with the conditional probability more than 0. [sent-137, score-1.054]

95 From Table 3, we can find that only Attack and Transport events co-occur frequently with Population-Center entities (see Figure 1 and Table 3). [sent-139, score-0.317]

96 05 Actually we find that most entity types appear in more restricted event mentions than Population- Center entity. [sent-145, score-1.073]

97 For example, Air entity only cooccurs with 5 event types (Attack, Transport, Die, Transfer-Ownership and Injure), and Exploding 1130 entity co-occurs with 4 event types (see Figure 1). [sent-146, score-2.015]

98 Especially, they only co-occur with one or two event types with the conditional probability more than 0. [sent-147, score-0.834]


similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('event', 0.773), ('ace', 0.265), ('entity', 0.194), ('events', 0.173), ('trigger', 0.172), ('entities', 0.144), ('inference', 0.131), ('transductive', 0.129), ('consistency', 0.114), ('transport', 0.106), ('type', 0.092), ('extraction', 0.081), ('mentions', 0.076), ('argument', 0.076), ('elect', 0.072), ('exploding', 0.072), ('attack', 0.062), ('arguments', 0.058), ('priori', 0.052), ('attributes', 0.051), ('obama', 0.051), ('bush', 0.05), ('evidence', 0.049), ('evidences', 0.045), ('subtypes', 0.045), ('mention', 0.045), ('normally', 0.044), ('predict', 0.044), ('presidential', 0.043), ('revoke', 0.043), ('qaeda', 0.043), ('doctor', 0.043), ('attend', 0.043), ('role', 0.041), ('beats', 0.039), ('chief', 0.039), ('grishman', 0.039), ('air', 0.039), ('roles', 0.038), ('liao', 0.037), ('fillers', 0.035), ('gupta', 0.035), ('participants', 0.033), ('occurrence', 0.033), ('regard', 0.032), ('license', 0.031), ('conditional', 0.031), ('know', 0.03), ('types', 0.03), ('false', 0.029), ('place', 0.029), ('certain', 0.028), ('discussing', 0.028), ('clue', 0.028), ('appearing', 0.028), ('identification', 0.028), ('background', 0.027), ('wider', 0.026), ('left', 0.025), ('discourse', 0.024), ('hard', 0.022), ('local', 0.021), ('inef', 0.021), ('cooccurs', 0.021), ('revoked', 0.021), ('spuriously', 0.021), ('volcano', 0.021), ('vesuvius', 0.021), ('tar', 0.021), ('terrorist', 0.021), ('inevitable', 0.021), ('retired', 0.021), ('jianmin', 0.021), ('reference', 0.021), ('ji', 0.021), ('correctly', 0.02), ('name', 0.02), ('availability', 0.02), ('committing', 0.02), ('somebody', 0.02), ('ces', 0.02), ('appreciable', 0.02), ('claimed', 0.02), ('rem', 0.02), ('lro', 0.02), ('nobody', 0.02), ('credible', 0.02), ('knew', 0.02), ('esa', 0.02), ('inferential', 0.02), ('election', 0.02), ('maslennikov', 0.02), ('injure', 0.02), ('step', 0.019), ('relation', 0.019), ('ues', 0.018), ('wins', 0.018), ('iat', 0.018), ('peripheral', 0.018), ('chua', 0.018)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.9999997 328 acl-2011-Using Cross-Entity Inference to Improve Event Extraction

Author: Yu Hong ; Jianfeng Zhang ; Bin Ma ; Jianmin Yao ; Guodong Zhou ; Qiaoming Zhu

Abstract: Event extraction is the task of detecting certain specified types of events that are mentioned in the source language data. The state-of-the-art research on the task is transductive inference (e.g. cross-event inference). In this paper, we propose a new method of event extraction by well using cross-entity inference. In contrast to previous inference methods, we regard entitytype consistency as key feature to predict event mentions. We adopt this inference method to improve the traditional sentence-level event extraction system. Experiments show that we can get 8.6% gain in trigger (event) identification, and more than 11.8% gain for argument (role) classification in ACE event extraction. 1

2 0.6940468 65 acl-2011-Can Document Selection Help Semi-supervised Learning? A Case Study On Event Extraction

Author: Shasha Liao ; Ralph Grishman

Abstract: Annotating training data for event extraction is tedious and labor-intensive. Most current event extraction tasks rely on hundreds of annotated documents, but this is often not enough. In this paper, we present a novel self-training strategy, which uses Information Retrieval (IR) to collect a cluster of related documents as the resource for bootstrapping. Also, based on the particular characteristics of this corpus, global inference is applied to provide more confident and informative data selection. We compare this approach to self-training on a normal newswire corpus and show that IR can provide a better corpus for bootstrapping and that global inference can further improve instance selection. We obtain gains of 1.7% in trigger labeling and 2.3% in role labeling through IR and an additional 1.1% in trigger labeling and 1.3% in role labeling by applying global inference. 1

3 0.63061428 244 acl-2011-Peeling Back the Layers: Detecting Event Role Fillers in Secondary Contexts

Author: Ruihong Huang ; Ellen Riloff

Abstract: The goal of our research is to improve event extraction by learning to identify secondary role filler contexts in the absence of event keywords. We propose a multilayered event extraction architecture that progressively “zooms in” on relevant information. Our extraction model includes a document genre classifier to recognize event narratives, two types of sentence classifiers, and noun phrase classifiers to extract role fillers. These modules are organized as a pipeline to gradually zero in on event-related information. We present results on the MUC-4 event extraction data set and show that this model performs better than previous systems.

4 0.55286574 122 acl-2011-Event Extraction as Dependency Parsing

Author: David McClosky ; Mihai Surdeanu ; Christopher Manning

Abstract: Nested event structures are a common occurrence in both open domain and domain specific extraction tasks, e.g., a “crime” event can cause a “investigation” event, which can lead to an “arrest” event. However, most current approaches address event extraction with highly local models that extract each event and argument independently. We propose a simple approach for the extraction of such structures by taking the tree of event-argument relations and using it directly as the representation in a reranking dependency parser. This provides a simple framework that captures global properties of both nested and flat event structures. We explore a rich feature space that models both the events to be parsed and context from the original supporting text. Our approach obtains competitive results in the extraction of biomedical events from the BioNLP’09 shared task with a F1 score of 53.5% in development and 48.6% in testing.

5 0.23000513 293 acl-2011-Template-Based Information Extraction without the Templates

Author: Nathanael Chambers ; Dan Jurafsky

Abstract: Standard algorithms for template-based information extraction (IE) require predefined template schemas, and often labeled data, to learn to extract their slot fillers (e.g., an embassy is the Target of a Bombing template). This paper describes an approach to template-based IE that removes this requirement and performs extraction without knowing the template structure in advance. Our algorithm instead learns the template structure automatically from raw text, inducing template schemas as sets of linked events (e.g., bombings include detonate, set off, and destroy events) associated with semantic roles. We also solve the standard IE task, using the induced syntactic patterns to extract role fillers from specific documents. We evaluate on the MUC-4 terrorism dataset and show that we induce template structure very similar to handcreated gold structure, and we extract role fillers with an F1 score of .40, approaching the performance of algorithms that require full knowledge of the templates.

6 0.21137078 18 acl-2011-A Latent Topic Extracting Method based on Events in a Document and its Application

7 0.16397458 12 acl-2011-A Generative Entity-Mention Model for Linking Entities with Knowledge Base

8 0.14455567 121 acl-2011-Event Discovery in Social Media Feeds

9 0.14372991 226 acl-2011-Multi-Modal Annotation of Quest Games in Second Life

10 0.13812201 186 acl-2011-Joint Training of Dependency Parsing Filters through Latent Support Vector Machines

11 0.13598168 196 acl-2011-Large-Scale Cross-Document Coreference Using Distributed Inference and Hierarchical Models

12 0.13567518 286 acl-2011-Social Network Extraction from Texts: A Thesis Proposal

13 0.12610576 277 acl-2011-Semi-supervised Relation Extraction with Large-scale Word Clustering

14 0.11542336 116 acl-2011-Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers

15 0.11428124 191 acl-2011-Knowledge Base Population: Successful Approaches and Challenges

16 0.10449297 129 acl-2011-Extending the Entity Grid with Entity-Specific Features

17 0.10435433 23 acl-2011-A Pronoun Anaphora Resolution System based on Factorial Hidden Markov Models

18 0.099380232 86 acl-2011-Coreference for Learning to Extract Relations: Yes Virginia, Coreference Matters

19 0.088754259 126 acl-2011-Exploiting Syntactico-Semantic Structures for Relation Extraction

20 0.083746552 128 acl-2011-Exploring Entity Relations for Named Entity Disambiguation


similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.172), (1, 0.122), (2, -0.377), (3, 0.039), (4, 0.485), (5, 0.343), (6, -0.167), (7, -0.127), (8, 0.358), (9, 0.056), (10, -0.048), (11, 0.066), (12, -0.031), (13, 0.052), (14, 0.047), (15, 0.068), (16, 0.106), (17, 0.075), (18, -0.022), (19, 0.028), (20, 0.026), (21, -0.032), (22, 0.023), (23, 0.024), (24, -0.021), (25, 0.049), (26, -0.015), (27, 0.007), (28, 0.001), (29, 0.035), (30, 0.013), (31, 0.056), (32, -0.018), (33, 0.0), (34, 0.002), (35, -0.001), (36, -0.0), (37, -0.02), (38, 0.001), (39, -0.011), (40, -0.008), (41, -0.019), (42, -0.006), (43, 0.007), (44, 0.01), (45, -0.002), (46, 0.002), (47, 0.021), (48, 0.017), (49, -0.025)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.99232012 328 acl-2011-Using Cross-Entity Inference to Improve Event Extraction

Author: Yu Hong ; Jianfeng Zhang ; Bin Ma ; Jianmin Yao ; Guodong Zhou ; Qiaoming Zhu

Abstract: Event extraction is the task of detecting certain specified types of events that are mentioned in the source language data. The state-of-the-art research on the task is transductive inference (e.g. cross-event inference). In this paper, we propose a new method of event extraction by well using cross-entity inference. In contrast to previous inference methods, we regard entitytype consistency as key feature to predict event mentions. We adopt this inference method to improve the traditional sentence-level event extraction system. Experiments show that we can get 8.6% gain in trigger (event) identification, and more than 11.8% gain for argument (role) classification in ACE event extraction. 1

2 0.95392823 244 acl-2011-Peeling Back the Layers: Detecting Event Role Fillers in Secondary Contexts

Author: Ruihong Huang ; Ellen Riloff

Abstract: The goal of our research is to improve event extraction by learning to identify secondary role filler contexts in the absence of event keywords. We propose a multilayered event extraction architecture that progressively “zooms in” on relevant information. Our extraction model includes a document genre classifier to recognize event narratives, two types of sentence classifiers, and noun phrase classifiers to extract role fillers. These modules are organized as a pipeline to gradually zero in on event-related information. We present results on the MUC-4 event extraction data set and show that this model performs better than previous systems.

3 0.93517458 65 acl-2011-Can Document Selection Help Semi-supervised Learning? A Case Study On Event Extraction

Author: Shasha Liao ; Ralph Grishman

Abstract: Annotating training data for event extraction is tedious and labor-intensive. Most current event extraction tasks rely on hundreds of annotated documents, but this is often not enough. In this paper, we present a novel self-training strategy, which uses Information Retrieval (IR) to collect a cluster of related documents as the resource for bootstrapping. Also, based on the particular characteristics of this corpus, global inference is applied to provide more confident and informative data selection. We compare this approach to self-training on a normal newswire corpus and show that IR can provide a better corpus for bootstrapping and that global inference can further improve instance selection. We obtain gains of 1.7% in trigger labeling and 2.3% in role labeling through IR and an additional 1.1% in trigger labeling and 1.3% in role labeling by applying global inference. 1

4 0.86670554 122 acl-2011-Event Extraction as Dependency Parsing

Author: David McClosky ; Mihai Surdeanu ; Christopher Manning

Abstract: Nested event structures are a common occurrence in both open domain and domain specific extraction tasks, e.g., a “crime” event can cause a “investigation” event, which can lead to an “arrest” event. However, most current approaches address event extraction with highly local models that extract each event and argument independently. We propose a simple approach for the extraction of such structures by taking the tree of event-argument relations and using it directly as the representation in a reranking dependency parser. This provides a simple framework that captures global properties of both nested and flat event structures. We explore a rich feature space that models both the events to be parsed and context from the original supporting text. Our approach obtains competitive results in the extraction of biomedical events from the BioNLP’09 shared task with a F1 score of 53.5% in development and 48.6% in testing.

5 0.62896603 293 acl-2011-Template-Based Information Extraction without the Templates

Author: Nathanael Chambers ; Dan Jurafsky

Abstract: Standard algorithms for template-based information extraction (IE) require predefined template schemas, and often labeled data, to learn to extract their slot fillers (e.g., an embassy is the Target of a Bombing template). This paper describes an approach to template-based IE that removes this requirement and performs extraction without knowing the template structure in advance. Our algorithm instead learns the template structure automatically from raw text, inducing template schemas as sets of linked events (e.g., bombings include detonate, set off, and destroy events) associated with semantic roles. We also solve the standard IE task, using the induced syntactic patterns to extract role fillers from specific documents. We evaluate on the MUC-4 terrorism dataset and show that we induce template structure very similar to handcreated gold structure, and we extract role fillers with an F1 score of .40, approaching the performance of algorithms that require full knowledge of the templates.

6 0.52479911 121 acl-2011-Event Discovery in Social Media Feeds

7 0.51966059 226 acl-2011-Multi-Modal Annotation of Quest Games in Second Life

8 0.43123999 186 acl-2011-Joint Training of Dependency Parsing Filters through Latent Support Vector Machines

9 0.36447716 286 acl-2011-Social Network Extraction from Texts: A Thesis Proposal

10 0.36073059 18 acl-2011-A Latent Topic Extracting Method based on Events in a Document and its Application

11 0.30760399 191 acl-2011-Knowledge Base Population: Successful Approaches and Challenges

12 0.29076228 12 acl-2011-A Generative Entity-Mention Model for Linking Entities with Knowledge Base

13 0.27555636 126 acl-2011-Exploiting Syntactico-Semantic Structures for Relation Extraction

14 0.2700409 40 acl-2011-An Error Analysis of Relation Extraction in Social Media Documents

15 0.25787127 196 acl-2011-Large-Scale Cross-Document Coreference Using Distributed Inference and Hierarchical Models

16 0.25316671 291 acl-2011-SystemT: A Declarative Information Extraction System

17 0.25283572 209 acl-2011-Lexically-Triggered Hidden Markov Models for Clinical Document Coding

18 0.22927371 129 acl-2011-Extending the Entity Grid with Entity-Specific Features

19 0.22004864 116 acl-2011-Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers

20 0.21845636 190 acl-2011-Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations


similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(5, 0.019), (17, 0.054), (26, 0.026), (31, 0.012), (37, 0.067), (39, 0.033), (41, 0.482), (55, 0.015), (59, 0.065), (72, 0.017), (91, 0.025), (96, 0.069), (97, 0.011)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.94351465 328 acl-2011-Using Cross-Entity Inference to Improve Event Extraction

Author: Yu Hong ; Jianfeng Zhang ; Bin Ma ; Jianmin Yao ; Guodong Zhou ; Qiaoming Zhu

Abstract: Event extraction is the task of detecting certain specified types of events that are mentioned in the source language data. The state-of-the-art research on the task is transductive inference (e.g. cross-event inference). In this paper, we propose a new method of event extraction by well using cross-entity inference. In contrast to previous inference methods, we regard entitytype consistency as key feature to predict event mentions. We adopt this inference method to improve the traditional sentence-level event extraction system. Experiments show that we can get 8.6% gain in trigger (event) identification, and more than 11.8% gain for argument (role) classification in ACE event extraction. 1

2 0.94053042 83 acl-2011-Contrasting Multi-Lingual Prosodic Cues to Predict Verbal Feedback for Rapport

Author: Siwei Wang ; Gina-Anne Levow

Abstract: Verbal feedback is an important information source in establishing interactional rapport. However, predicting verbal feedback across languages is challenging due to languagespecific differences, inter-speaker variation, and the relative sparseness and optionality of verbal feedback. In this paper, we employ an approach combining classifier weighting and SMOTE algorithm oversampling to improve verbal feedback prediction in Arabic, English, and Spanish dyadic conversations. This approach improves the prediction of verbal feedback, up to 6-fold, while maintaining a high overall accuracy. Analyzing highly weighted features highlights widespread use of pitch, with more varied use of intensity and duration.

3 0.91706944 219 acl-2011-Metagrammar engineering: Towards systematic exploration of implemented grammars

Author: Antske Fokkens

Abstract: When designing grammars of natural language, typically, more than one formal analysis can account for a given phenomenon. Moreover, because analyses interact, the choices made by the engineer influence the possibilities available in further grammar development. The order in which phenomena are treated may therefore have a major impact on the resulting grammar. This paper proposes to tackle this problem by using metagrammar development as a methodology for grammar engineering. Iargue that metagrammar engineering as an approach facilitates the systematic exploration of grammars through comparison of competing analyses. The idea is illustrated through a comparative study of auxiliary structures in HPSG-based grammars for German and Dutch. Auxiliaries form a central phenomenon of German and Dutch and are likely to influence many components of the grammar. This study shows that a special auxiliary+verb construction significantly improves efficiency compared to the standard argument-composition analysis for both parsing and generation.

4 0.90553391 189 acl-2011-K-means Clustering with Feature Hashing

Author: Hajime Senuma

Abstract: One of the major problems of K-means is that one must use dense vectors for its centroids, and therefore it is infeasible to store such huge vectors in memory when the feature space is high-dimensional. We address this issue by using feature hashing (Weinberger et al., 2009), a dimension-reduction technique, which can reduce the size of dense vectors while retaining sparsity of sparse vectors. Our analysis gives theoretical motivation and justification for applying feature hashing to Kmeans, by showing how much will the objective of K-means be (additively) distorted. Furthermore, to empirically verify our method, we experimented on a document clustering task.

5 0.89479101 139 acl-2011-From Bilingual Dictionaries to Interlingual Document Representations

Author: Jagadeesh Jagarlamudi ; Hal Daume III ; Raghavendra Udupa

Abstract: Mapping documents into an interlingual representation can help bridge the language barrier of a cross-lingual corpus. Previous approaches use aligned documents as training data to learn an interlingual representation, making them sensitive to the domain of the training data. In this paper, we learn an interlingual representation in an unsupervised manner using only a bilingual dictionary. We first use the bilingual dictionary to find candidate document alignments and then use them to find an interlingual representation. Since the candidate alignments are noisy, we de- velop a robust learning algorithm to learn the interlingual representation. We show that bilingual dictionaries generalize to different domains better: our approach gives better performance than either a word by word translation method or Canonical Correlation Analysis (CCA) trained on a different domain.

6 0.89380932 143 acl-2011-Getting the Most out of Transition-based Dependency Parsing

7 0.89054221 232 acl-2011-Nonparametric Bayesian Machine Transliteration with Synchronous Adaptor Grammars

8 0.88492578 56 acl-2011-Bayesian Inference for Zodiac and Other Homophonic Ciphers

9 0.80970484 185 acl-2011-Joint Identification and Segmentation of Domain-Specific Dialogue Acts for Conversational Dialogue Systems

10 0.62659311 65 acl-2011-Can Document Selection Help Semi-supervised Learning? A Case Study On Event Extraction

11 0.60682029 94 acl-2011-Deciphering Foreign Language

12 0.59908801 196 acl-2011-Large-Scale Cross-Document Coreference Using Distributed Inference and Hierarchical Models

13 0.59685695 223 acl-2011-Modeling Wisdom of Crowds Using Latent Mixture of Discriminative Experts

14 0.5814842 33 acl-2011-An Affect-Enriched Dialogue Act Classification Model for Task-Oriented Dialogue

15 0.57125598 135 acl-2011-Faster and Smaller N-Gram Language Models

16 0.56919485 244 acl-2011-Peeling Back the Layers: Detecting Event Role Fillers in Secondary Contexts

17 0.56121933 40 acl-2011-An Error Analysis of Relation Extraction in Social Media Documents

18 0.55915272 12 acl-2011-A Generative Entity-Mention Model for Linking Entities with Knowledge Base

19 0.55667263 316 acl-2011-Unary Constraints for Efficient Context-Free Parsing

20 0.55054802 126 acl-2011-Exploiting Syntactico-Semantic Structures for Relation Extraction