acl acl2013 acl2013-16 knowledge-graph by maker-knowledge-mining

16 acl-2013-A Novel Translation Framework Based on Rhetorical Structure Theory

Source: pdf

Author: Mei Tu ; Yu Zhou ; Chengqing Zong

Abstract: Rhetorical structure theory (RST) is widely used for discourse understanding, which represents a discourse as a hierarchically semantic structure. In this paper, we propose a novel translation framework with the help of RST. In our framework, the translation process mainly includes three steps: 1) Source RST-tree acquisition: a source sentence is parsed into an RST tree; 2) Rule extraction: translation rules are extracted from the source tree and the target string via bilingual word alignment; 3) RST-based translation: the source RST-tree is translated with translation rules. Experiments on Chinese-to-English show that our RST-based approach achieves improvements of 2.3/0.77/1.43 BLEU points on NIST04/NIST05/CWMT2008 respectively. 1

Reference: text

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 A Novel Translation Framework Based on Rhetorical Structure Theory Mei Tu Yu Zhou Chengqing Zong National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences { mtu yzhou cqz ong } @nlpr . [sent-1, score-0.058]

2 cn , , Abstract Rhetorical structure theory (RST) is widely used for discourse understanding, which represents a discourse as a hierarchically semantic structure. [sent-4, score-0.331]

3 In this paper, we propose a novel translation framework with the help of RST. [sent-5, score-0.207]

4 1 Introduction For statistical machine translation (SMT), a crucial issue is how to build a translation model to extract as much accurate and generative translation knowledge as possible. [sent-11, score-0.528]

5 We think the deep reason is that those models only extract translation information on lexical or syntactic level, but fail to give an overall understanding of source sentences on semantic level of discourse. [sent-14, score-0.263]

6 , 2011; Wong and Kit, 2012) build discourse-based translation models to ensure the lexical coherence or consistency. [sent-17, score-0.176]

7 Although some lexicons can be translated better by their models, the overall structure still remains unnatural. [sent-18, score-0.084]

8 (2000) design a discourse structure transferring module, but leave much work to do, especially on how to integrate this module into SMT and how to automatically analyze the structures. [sent-20, score-0.161]

9 Those reasons urge us to seek a new translation framework under the idea of “translation with overall understanding”. [sent-21, score-0.207]

10 Rhetorical structure theory (RST) (Mann and Thompson, 1988) provides us with a good perspective and inspiration to build such a framework. [sent-22, score-0.083]

11 Generally, an RST tree can explicitly show the minimal spans with semantic functional integrity, which are called elementary discourse units (edus) (Marcu et al. [sent-23, score-0.308]

12 , 2000), and it also depicts the hierarchical relations among edus. [sent-24, score-0.031]

13 Furthermore, since different languages’ edus are usually equivalent on semantic level, it is intuitive to create a new framework based on RST by directly mapping the source edus to target ones. [sent-25, score-0.437]

14 1 Annotation of Chinese RST Tree Similar to (Soricut and Marcu, 2003), a node of RST tree is represented as a tuple R-[s, m, e], which means the relation R controls two semantic spans U1 and U2 , U1 starts from word position s and stops at word position m. [sent-29, score-0.34]

15 1 Although the rupe 's nominal rate against he dol ar was held down , India's real exchange rate rosebecause of high inflation . [sent-35, score-0.029]

16 rhetorical relations for Chinese particularly, upon which our Chinese RST parser is developed. [sent-37, score-0.311]

17 Figure 1 illustrates an example of Chinese RST tree and its alignment to the English string. [sent-38, score-0.105]

18 The Antithesis relation controls U1 from 0 to 9 and U2 from 10 to 21. [sent-40, score-0.1]

19 Different shadow blocks denote the alignments of different edus. [sent-42, score-0.029]

20 Links between source and target words are alignments of cue words. [sent-43, score-0.24]

21 Cue words are viewed as the strongest clues for rhetorical relation recognition and always found at the beginning of text (Reitter, 2003), such as “即使(although), 由于(because of)”. [sent-44, score-0.293]

22 With the cue words included, the relations are much easier to be analyzed. [sent-45, score-0.191]

23 So we focus on the explicit relations with cue words in this paper as our first try. [sent-46, score-0.191]

24 One is the segmentation of edu and the other is the relation tagging between two semantic spans. [sent-49, score-0.097]

25 Inspired by the features used in English RST parser (Soricut and Marcu, 2003; Reitter, 2003; Duverle and Prendinger, 2009; Hernault et al. [sent-53, score-0.067]

26 , 2010a), we design a Bayesian model to build a joint parser for segmentation and tagging simultaneously. [sent-54, score-0.114]

27 In the table, punctuations include comma, semicolons, period and question mark. [sent-56, score-0.043]

28 We view explicit connectives as cue words in this paper. [sent-57, score-0.16]

29 Figure 2 illustrates the conditional independences of 9 features which are denoted with F1~F9. [sent-58, score-0.105]

30 F1F2mF8F3RelF4F5Fe6F7F9 Figure 2: The graph for conditional independences of 9 features. [sent-59, score-0.105]

31 The segmentation and parsing conditional probabilities are computed as follows: P(mjF19) = P(mjF13; F8) (1) P(ejF19) = P(ejF47;F9) (2) P(ReljF19) = P(ReljF34) (3) where Fn represents the nth feature , Fnl means features from n to l. [sent-60, score-0.131]

32 (1) and (2) describe the conditional probabilities of m and e. [sent-62, score-0.084]

33 Finally, the relation is figured out by Formula (3). [sent-69, score-0.05]

34 A complete RST tree con- structs until the end of the iterative process for this sentence. [sent-71, score-0.091]

35 It is plausible in our cases, because we only have a small scale of manually-annotated Chinese RST corpus, which prefers simple rather than complicated models. [sent-73, score-0.056]

36 1 Translation Model Rule Extraction As shown in Figure 1, the RST tree-to-string alignment provides us with two types of translation rules. [sent-75, score-0.219]

37 The other is RST tree-tostring rule, and it’s defined as, relation ::U1(®; X)=U2(°; Y ) ) U1(tr(®); tr(X)) » U2(tr(°); tr(Y )) where the terminal characters α and γ represent the cue words which are optimum match for maximizing Formula (3). [sent-78, score-0.21]

38 The operator ~ is an operator to indicate that the order of tr(U1) and tr(U2) is monotone or reverse. [sent-81, score-0.072]

39 During rules’ extraction, if the mean position of all the words in tr(U1) precedes that in tr(U2), ~ is monotone. [sent-82, score-0.029]

40 For example in Figure 1, the Reason relation controls U1: [10,13] and U2: [14,21]. [sent-84, score-0.1]

41 Because the mean position of tr(U2) is before that of tr(U1), the reverse order is selected. [sent-85, score-0.029]

42 We list the RSTbased rules for Example 1in Figure 1. [sent-86, score-0.054]

43 2 Probabilities Estimation For the phrase-based translation rules, we use four common probabilities and the probabilities’ estimation is the same with those in (Koehn et al. [sent-88, score-0.221]

44 While the probabilities of RST-based translation rules are given as follows, (1) P(rejrf;Rel) CCouounnt(tr(er;frf;r;erlealtaiotino)n): where = re is the target side of the rule, ignorance of the order, i. [sent-90, score-0.304]

45 U1(tr(®); tr(X)) » U2(tr(°); tr(Y )) with two directions, rf is the source side, i. [sent-92, score-0.165]

46 U1(®; X)=U2(°; Y) , and Rel means the relation type. [sent-94, score-0.05]

47 ¿ 2 fmonotone; It is the conditional probability of re-ordering. [sent-96, score-0.039]

48 4 Decoding The decoding procedure of a discourse can be derived from the original decoding formula e1I = argmaxe1IP(e1I jfJ1) . [sent-97, score-0.329]

49 es is the target string combined by series of en (translations of fn). [sent-99, score-0.029]

50 eu1 and eu2 are translations of U1 and U2 respectively. [sent-101, score-0.055]

51 fcp and ecp are cue-words pair of source and target sides. [sent-103, score-0.212]

52 The first and second factors are just the probabilities introduced in Section 3. [sent-104, score-0.045]

53 Suppose the best rules selected by (4) are just those written in the figure, Then span [11,13] and [14,21] are firstly translated by (5) and (6). [sent-107, score-0.147]

54 Their translations are then re-packaged by the rule of Reason- = = ; ; ; ; [10,13,21]. [sent-108, score-0.106]

55 Iteratively, the translations of span [1,9] and [10,21] are re-packaged by the rule of Antithesis-[0,9,21] to form the final translation. [sent-109, score-0.152]

56 In Figure 1, U1 and U2 of Reason node are firstly translated. [sent-111, score-0.04]

57 Then the translations of two spans of Antithesis node are re-ordered and constructed into the final translation. [sent-113, score-0.175]

58 In our decoders, language model(LM) is used for translating edus in Formula(5),(6),(7),(8), but not for reordering the upper spans because with the bottom-to-up combination, the spans become longer and harder to be judged by a traditional language model. [sent-116, score-0.357]

59 So we only use RST rules to guide the reordering. [sent-117, score-0.054]

60 1 Setup In order to do Chinese RST parser, we annotated over 1,000 complicated sentences on CTB (Xue et al. [sent-120, score-0.056]

61 We obtain the word alignment with the grow-diag-final-and strategy by GIZA++4. [sent-127, score-0.043]

62 For tuning and testing, we use NIST03 evaluation data as the development set, and extract the relatively long and complicated sentences from NIST04, NIST05 and CWMT085 evaluation data as the test set. [sent-133, score-0.056]

63 To create the baseline system, we use the toolkit Moses6 to build a phrase-based translation system. [sent-136, score-0.176]

64 (2009) have presented good results by dividing long and complicated sentences into subsentences only by punctuations during decoding, we re-implement their method for comparison. [sent-138, score-0.099]

65 The parsing errors mostly result from the segmentation errors, which are mainly caused by syntactic parsing errors. [sent-142, score-0.047]

66 On the other hand, the polysemous cue words, such as “而(but, and, thus)” may lead ambiguity for relation recognition, because they can be clues for different relations. [sent-143, score-0.24]

67 3 Results of Translation Table 3 presents the translation comparison results. [sent-149, score-0.176]

68 Observing and comparing the translation results, we find that our translation results are more readable by maintaining the semantic integrality of the edus and by giving more appreciate reorganization of the translated edus. [sent-161, score-0.562]

69 HomePage 373 6 Conclusion and Future Work In this paper, we present an RST-based translation framework for modeling semantic structures in translation model, so as to maintain the semantically functional integrity and hierarchical relations of edus during translating. [sent-171, score-0.67]

70 With respect to the existing models, we think our translation framework works more similarly to what human does, and we believe that this research is a crucial step towards discourse-oriented translation. [sent-172, score-0.207]

71 In the next step, we will study on the implicit discourse relations for Chinese and further modify the RST-based framework. [sent-173, score-0.155]

72 Besides, we will try to combine other current translation models such as syntactic model and hierarchical model into our framework. [sent-174, score-0.176]

73 Furthermore, the more accurate evaluation metric for discourse-oriented translation will be further studied. [sent-175, score-0.176]

74 A novel discourse parser based on support vector ma- chine classification. [sent-181, score-0.191]

75 Hilda: A discourse parser using support vector machine classification. [sent-195, score-0.191]

76 Rhetorical structure theory: Description and construction of text structures. [sent-204, score-0.037]

77 Rhetorical structure theory: A framework for the analysis of texts. [sent-208, score-0.068]

78 Rhetorical structure theory: Toward a functional theory of text organization. [sent-212, score-0.125]

79 Simple signals for complex rhetorics: On rhetorical analysis with rich-feature support vector models. [sent-221, score-0.213]

80 Sentence level discourse parsing using syntactic and lexical in- formation. [sent-225, score-0.124]

81 Extending machine translation evaluation metrics with lexical cohesion to document level. [sent-230, score-0.176]

82 The Penn Chinese treebank: Phrase structure annotation of a large corpus. [sent-244, score-0.037]

similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('rst', 0.559), ('rel', 0.273), ('rhetorical', 0.213), ('tr', 0.194), ('translation', 0.176), ('antithesis', 0.166), ('jfn', 0.166), ('edus', 0.163), ('cue', 0.16), ('rejrf', 0.133), ('chinese', 0.131), ('jre', 0.127), ('discourse', 0.124), ('ey', 0.121), ('rf', 0.114), ('formula', 0.109), ('jfx', 0.1), ('jfy', 0.1), ('spans', 0.08), ('hernault', 0.076), ('mann', 0.074), ('ex', 0.071), ('parser', 0.067), ('ecp', 0.066), ('fcp', 0.066), ('fpr', 0.066), ('independences', 0.066), ('jrf', 0.066), ('reitter', 0.066), ('marcu', 0.065), ('tree', 0.062), ('xiong', 0.06), ('duverle', 0.059), ('decoder', 0.058), ('complicated', 0.056), ('fp', 0.056), ('translations', 0.055), ('rules', 0.054), ('formulae', 0.054), ('soricut', 0.052), ('fn', 0.052), ('sandra', 0.051), ('rule', 0.051), ('source', 0.051), ('dtic', 0.051), ('gong', 0.051), ('integrity', 0.051), ('prendinger', 0.051), ('controls', 0.05), ('relation', 0.05), ('decoding', 0.048), ('segmentation', 0.047), ('translated', 0.047), ('mitsuru', 0.046), ('span', 0.046), ('theory', 0.046), ('probabilities', 0.045), ('xd', 0.045), ('punctuations', 0.043), ('alignment', 0.043), ('functional', 0.042), ('hugo', 0.041), ('node', 0.04), ('conditional', 0.039), ('structure', 0.037), ('operator', 0.036), ('reason', 0.036), ('helmut', 0.036), ('simplified', 0.035), ('translating', 0.034), ('hao', 0.034), ('wong', 0.032), ('ldc', 0.031), ('framework', 0.031), ('relations', 0.031), ('optimization', 0.031), ('smt', 0.03), ('clues', 0.03), ('xiao', 0.03), ('william', 0.03), ('billy', 0.029), ('dol', 0.029), ('enablement', 0.029), ('fnl', 0.029), ('hilda', 0.029), ('ishizuka', 0.029), ('ju', 0.029), ('mtu', 0.029), ('rhetorics', 0.029), ('semicolons', 0.029), ('shadow', 0.029), ('structs', 0.029), ('wenwen', 0.029), ('yzhou', 0.029), ('position', 0.029), ('xue', 0.029), ('target', 0.029), ('lm', 0.029), ('koehn', 0.028), ('approximately', 0.028)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.99999934 16 acl-2013-A Novel Translation Framework Based on Rhetorical Structure Theory

Author: Mei Tu ; Yu Zhou ; Chengqing Zong

2 0.2400559 85 acl-2013-Combining Intra- and Multi-sentential Rhetorical Parsing for Document-level Discourse Analysis

Author: Shafiq Joty ; Giuseppe Carenini ; Raymond Ng ; Yashar Mehdad

Abstract: We propose a novel approach for developing a two-stage document-level discourse parser. Our parser builds a discourse tree by applying an optimal parsing algorithm to probabilities inferred from two Conditional Random Fields: one for intrasentential parsing and the other for multisentential parsing. We present two approaches to combine these two stages of discourse parsing effectively. A set of empirical evaluations over two different datasets demonstrates that our discourse parser significantly outperforms the stateof-the-art, often by a wide margin.

3 0.15242675 2 acl-2013-A Bayesian Model for Joint Unsupervised Induction of Sentiment, Aspect and Discourse Representations

Author: Angeliki Lazaridou ; Ivan Titov ; Caroline Sporleder

Abstract: We propose a joint model for unsupervised induction of sentiment, aspect and discourse information and show that by incorporating a notion of latent discourse relations in the model, we improve the prediction accuracy for aspect and sentiment polarity on the sub-sentential level. We deviate from the traditional view of discourse, as we induce types of discourse relations and associated discourse cues relevant to the considered opinion analysis task; consequently, the induced discourse relations play the role of opinion and aspect shifters. The quantitative analysis that we conducted indicated that the integration of a discourse model increased the prediction accuracy results with respect to the discourse-agnostic approach and the qualitative analysis suggests that the induced representations encode a meaningful discourse structure.

4 0.12062662 361 acl-2013-Travatar: A Forest-to-String Machine Translation Engine based on Tree Transducers

Author: Graham Neubig

Abstract: In this paper we describe Travatar, a forest-to-string machine translation (MT) engine based on tree transducers. It provides an open-source C++ implementation for the entire forest-to-string MT pipeline, including rule extraction, tuning, decoding, and evaluation. There are a number of options for model training, and tuning includes advanced options such as hypergraph MERT, and training of sparse features through online learning. The training pipeline is modeled after that of the popular Moses decoder, so users familiar with Moses should be able to get started quickly. We perform a validation experiment of the decoder on EnglishJapanese machine translation, and find that it is possible to achieve greater accuracy than translation using phrase-based and hierarchical-phrase-based translation. As auxiliary results, we also compare different syntactic parsers and alignment techniques that we tested in the process of developing the decoder. Travatar is available under the LGPL at http : / /phont ron . com/t ravat ar

5 0.11951277 314 acl-2013-Semantic Roles for String to Tree Machine Translation

Author: Marzieh Bazrafshan ; Daniel Gildea

Abstract: We experiment with adding semantic role information to a string-to-tree machine translation system based on the rule extraction procedure of Galley et al. (2004). We compare methods based on augmenting the set of nonterminals by adding semantic role labels, and altering the rule extraction process to produce a separate set of rules for each predicate that encompass its entire predicate-argument structure. Our results demonstrate that the second approach is effective in increasing the quality of translations.

6 0.11853765 229 acl-2013-Leveraging Synthetic Discourse Data via Multi-task Learning for Implicit Discourse Relation Recognition

7 0.11586376 80 acl-2013-Chinese Parsing Exploiting Characters

8 0.11368617 10 acl-2013-A Markov Model of Machine Translation using Non-parametric Bayesian Inference

9 0.11131318 193 acl-2013-Improving Chinese Word Segmentation on Micro-blog Using Rich Punctuations

10 0.11048167 223 acl-2013-Learning a Phrase-based Translation Model from Monolingual Data with Application to Domain Adaptation

11 0.10862643 164 acl-2013-FudanNLP: A Toolkit for Chinese Natural Language Processing

12 0.10336972 123 acl-2013-Discriminative Learning with Natural Annotations: Word Segmentation as a Case Study

13 0.099280782 44 acl-2013-An Empirical Examination of Challenges in Chinese Parsing

14 0.099185556 41 acl-2013-Aggregated Word Pair Features for Implicit Discourse Relation Disambiguation

15 0.09534736 7 acl-2013-A Lattice-based Framework for Joint Chinese Word Segmentation, POS Tagging and Parsing

16 0.093929455 71 acl-2013-Bootstrapping Entity Translation on Weakly Comparable Corpora

17 0.090991296 11 acl-2013-A Multi-Domain Translation Model Framework for Statistical Machine Translation

18 0.090915084 255 acl-2013-Name-aware Machine Translation

19 0.085525222 40 acl-2013-Advancements in Reordering Models for Statistical Machine Translation

20 0.085468367 320 acl-2013-Shallow Local Multi-Bottom-up Tree Transducers in Statistical Machine Translation

similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.214), (1, -0.125), (2, 0.004), (3, 0.089), (4, 0.03), (5, 0.066), (6, -0.031), (7, 0.034), (8, 0.009), (9, 0.147), (10, 0.082), (11, 0.063), (12, -0.037), (13, 0.089), (14, 0.061), (15, -0.084), (16, 0.102), (17, -0.111), (18, -0.108), (19, -0.125), (20, -0.0), (21, 0.054), (22, -0.018), (23, -0.085), (24, -0.086), (25, -0.017), (26, 0.071), (27, 0.077), (28, 0.073), (29, 0.057), (30, 0.056), (31, -0.054), (32, 0.051), (33, -0.035), (34, -0.006), (35, 0.03), (36, -0.081), (37, 0.016), (38, 0.038), (39, 0.02), (40, -0.023), (41, -0.001), (42, -0.023), (43, -0.032), (44, 0.009), (45, -0.037), (46, -0.091), (47, -0.045), (48, 0.002), (49, 0.016)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.91012818 16 acl-2013-A Novel Translation Framework Based on Rhetorical Structure Theory

Author: Mei Tu ; Yu Zhou ; Chengqing Zong

2 0.74506986 85 acl-2013-Combining Intra- and Multi-sentential Rhetorical Parsing for Document-level Discourse Analysis

Author: Shafiq Joty ; Giuseppe Carenini ; Raymond Ng ; Yashar Mehdad

3 0.67252642 229 acl-2013-Leveraging Synthetic Discourse Data via Multi-task Learning for Implicit Discourse Relation Recognition

Author: Man Lan ; Yu Xu ; Zhengyu Niu

Abstract: To overcome the shortage of labeled data for implicit discourse relation recognition, previous works attempted to automatically generate training data by removing explicit discourse connectives from sentences and then built models on these synthetic implicit examples. However, a previous study (Sporleder and Lascarides, 2008) showed that models trained on these synthetic data do not generalize very well to natural (i.e. genuine) implicit discourse data. In this work we revisit this issue and present a multi-task learning based system which can effectively use synthetic data for implicit discourse relation recognition. Results on PDTB data show that under the multi-task learning framework our models with the use of the prediction of explicit discourse connectives as auxiliary learning tasks, can achieve an averaged F1 improvement of 5.86% over baseline models.

4 0.63657331 41 acl-2013-Aggregated Word Pair Features for Implicit Discourse Relation Disambiguation

Author: Or Biran ; Kathleen McKeown

Abstract: We present a reformulation of the word pair features typically used for the task of disambiguating implicit relations in the Penn Discourse Treebank. Our word pair features achieve significantly higher performance than the previous formulation when evaluated without additional features. In addition, we present results for a full system using additional features which achieves close to state of the art performance without resorting to gold syntactic parses or to context outside the relation.

5 0.58966368 255 acl-2013-Name-aware Machine Translation

Author: Haibo Li ; Jing Zheng ; Heng Ji ; Qi Li ; Wen Wang

Abstract: We propose a Name-aware Machine Translation (MT) approach which can tightly integrate name processing into MT model, by jointly annotating parallel corpora, extracting name-aware translation grammar and rules, adding name phrase table and name translation driven decoding. Additionally, we also propose a new MT metric to appropriately evaluate the translation quality of informative words, by assigning different weights to different words according to their importance values in a document. Experiments on Chinese-English translation demonstrated the effectiveness of our approach on enhancing the quality of overall translation, name translation and word alignment over a high-quality MT baseline1 .

6 0.58283603 361 acl-2013-Travatar: A Forest-to-String Machine Translation Engine based on Tree Transducers

7 0.56441343 10 acl-2013-A Markov Model of Machine Translation using Non-parametric Bayesian Inference

8 0.56239051 180 acl-2013-Handling Ambiguities of Bilingual Predicate-Argument Structures for Statistical Machine Translation

9 0.55230713 71 acl-2013-Bootstrapping Entity Translation on Weakly Comparable Corpora

10 0.551337 312 acl-2013-Semantic Parsing as Machine Translation

11 0.54328668 92 acl-2013-Context-Dependent Multilingual Lexical Lookup for Under-Resourced Languages

12 0.53354234 2 acl-2013-A Bayesian Model for Joint Unsupervised Induction of Sentiment, Aspect and Discourse Representations

13 0.5171563 330 acl-2013-Stem Translation with Affix-Based Rule Selection for Agglutinative Languages

14 0.51459312 127 acl-2013-Docent: A Document-Level Decoder for Phrase-Based Statistical Machine Translation

15 0.51116616 320 acl-2013-Shallow Local Multi-Bottom-up Tree Transducers in Statistical Machine Translation

16 0.50720453 13 acl-2013-A New Syntactic Metric for Evaluation of Machine Translation

17 0.49733207 46 acl-2013-An Infinite Hierarchical Bayesian Model of Phrasal Translation

18 0.4952572 64 acl-2013-Automatically Predicting Sentence Translation Difficulty

19 0.48793021 314 acl-2013-Semantic Roles for String to Tree Machine Translation

20 0.48761567 137 acl-2013-Enlisting the Ghost: Modeling Empty Categories for Machine Translation

similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(0, 0.047), (6, 0.025), (11, 0.068), (16, 0.012), (24, 0.055), (26, 0.049), (28, 0.021), (35, 0.06), (42, 0.061), (48, 0.052), (70, 0.039), (84, 0.276), (88, 0.033), (90, 0.055), (95, 0.068)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.73768383 16 acl-2013-A Novel Translation Framework Based on Rhetorical Structure Theory

Author: Mei Tu ; Yu Zhou ; Chengqing Zong

2 0.6583091 297 acl-2013-Recognizing Partial Textual Entailment

Author: Omer Levy ; Torsten Zesch ; Ido Dagan ; Iryna Gurevych

Abstract: Textual entailment is an asymmetric relation between two text fragments that describes whether one fragment can be inferred from the other. It thus cannot capture the notion that the target fragment is “almost entailed” by the given text. The recently suggested idea of partial textual entailment may remedy this problem. We investigate partial entailment under the faceted entailment model and the possibility of adapting existing textual entailment methods to this setting. Indeed, our results show that these methods are useful for rec- ognizing partial entailment. We also provide a preliminary assessment of how partial entailment may be used for recognizing (complete) textual entailment.

3 0.64153934 9 acl-2013-A Lightweight and High Performance Monolingual Word Aligner

Author: Xuchen Yao ; Benjamin Van Durme ; Chris Callison-Burch ; Peter Clark

Abstract: Fast alignment is essential for many natural language tasks. But in the setting of monolingual alignment, previous work has not been able to align more than one sentence pair per second. We describe a discriminatively trained monolingual word aligner that uses a Conditional Random Field to globally decode the best alignment with features drawn from source and target sentences. Using just part-of-speech tags and WordNet as external resources, our aligner gives state-of-the-art result, while being an order-of-magnitude faster than the previous best performing system.

4 0.61791599 316 acl-2013-SenseSpotting: Never let your parallel data tie you to an old domain

Author: Marine Carpuat ; Hal Daume III ; Katharine Henry ; Ann Irvine ; Jagadeesh Jagarlamudi ; Rachel Rudinger

Abstract: Words often gain new senses in new domains. Being able to automatically identify, from a corpus of monolingual text, which word tokens are being used in a previously unseen sense has applications to machine translation and other tasks sensitive to lexical semantics. We define a task, SENSESPOTTING, in which we build systems to spot tokens that have new senses in new domain text. Instead of difficult and expensive annotation, we build a goldstandard by leveraging cheaply available parallel corpora, targeting our approach to the problem of domain adaptation for machine translation. Our system is able to achieve F-measures of as much as 80%, when applied to word types it has never seen before. Our approach is based on a large set of novel features that capture varied aspects of how words change when used in new domains.

5 0.52529061 155 acl-2013-Fast and Accurate Shift-Reduce Constituent Parsing

Author: Muhua Zhu ; Yue Zhang ; Wenliang Chen ; Min Zhang ; Jingbo Zhu

Abstract: Shift-reduce dependency parsers give comparable accuracies to their chartbased counterparts, yet the best shiftreduce constituent parsers still lag behind the state-of-the-art. One important reason is the existence of unary nodes in phrase structure trees, which leads to different numbers of shift-reduce actions between different outputs for the same input. This turns out to have a large empirical impact on the framework of global training and beam search. We propose a simple yet effective extension to the shift-reduce process, which eliminates size differences between action sequences in beam-search. Our parser gives comparable accuracies to the state-of-the-art chart parsers. With linear run-time complexity, our parser is over an order of magnitude faster than the fastest chart parser.

6 0.52439535 123 acl-2013-Discriminative Learning with Natural Annotations: Word Segmentation as a Case Study

7 0.52349216 174 acl-2013-Graph Propagation for Paraphrasing Out-of-Vocabulary Words in Statistical Machine Translation

8 0.51977187 226 acl-2013-Learning to Prune: Context-Sensitive Pruning for Syntactic MT

9 0.51816112 82 acl-2013-Co-regularizing character-based and word-based models for semi-supervised Chinese word segmentation

10 0.51785457 341 acl-2013-Text Classification based on the Latent Topics of Important Sentences extracted by the PageRank Algorithm

11 0.517281 196 acl-2013-Improving pairwise coreference models through feature space hierarchy learning

12 0.51714724 343 acl-2013-The Effect of Higher-Order Dependency Features in Discriminative Phrase-Structure Parsing

13 0.51678765 223 acl-2013-Learning a Phrase-based Translation Model from Monolingual Data with Application to Domain Adaptation

14 0.51652062 193 acl-2013-Improving Chinese Word Segmentation on Micro-blog Using Rich Punctuations

15 0.51578915 185 acl-2013-Identifying Bad Semantic Neighbors for Improving Distributional Thesauri

16 0.51503944 70 acl-2013-Bilingually-Guided Monolingual Dependency Grammar Induction

17 0.51455033 132 acl-2013-Easy-First POS Tagging and Dependency Parsing with Beam Search

18 0.51441383 267 acl-2013-PARMA: A Predicate Argument Aligner

19 0.51421958 276 acl-2013-Part-of-Speech Induction in Dependency Trees for Statistical Machine Translation

20 0.51387739 18 acl-2013-A Sentence Compression Based Framework to Query-Focused Multi-Document Summarization