acl acl2011 acl2011-200 knowledge-graph by maker-knowledge-mining

200 acl-2011-Learning Dependency-Based Compositional Semantics

Source: pdf

Author: Percy Liang ; Michael Jordan ; Dan Klein

Abstract: Compositional question answering begins by mapping questions to logical forms, but training a semantic parser to perform this mapping typically requires the costly annotation of the target logical forms. In this paper, we learn to map questions to answers via latent logical forms, which are induced automatically from question-answer pairs. In tackling this challenging learning problem, we introduce a new semantic representation which highlights a parallel between dependency syntax and efficient evaluation of logical forms. On two standard semantic parsing benchmarks (GEO and JOBS), our system obtains the highest published accuracies, despite requiring no annotated logical forms.

Reference: text

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 edu Abstract Compositional question answering begins by mapping questions to logical forms, but training a semantic parser to perform this mapping typically requires the costly annotation of the target logical forms. [sent-8, score-0.999]

2 In this paper, we learn to map questions to answers via latent logical forms, which are induced automatically from question-answer pairs. [sent-9, score-0.51]

3 In tackling this challenging learning problem, we introduce a new semantic representation which highlights a parallel between dependency syntax and efficient evaluation of logical forms. [sent-10, score-0.495]

4 On two standard semantic parsing benchmarks (GEO and JOBS), our system obtains the highest published accuracies, despite requiring no annotated logical forms. [sent-11, score-0.49]

5 Answering these types of complex questions compositionally involves first mapping the questions into logical forms (semantic parsing). [sent-13, score-0.648]

6 Supervised semantic parsers (Zelle and Mooney, 1996; Tang and Mooney, 2001 ; Ge and Mooney, 2005; Zettlemoyer and Collins, 2005; Kate and Mooney, 2007; Zettlemoyer and Collins, 2007; Wong and Mooney, 2007; Kwiatkowski et al. [sent-14, score-0.09]

7 , 2010) rely on manual annotation of logical forms, which is expensive. [sent-15, score-0.375]

8 On the other hand, existing unsupervised semantic parsers (Poon and Domingos, 2009) do not handle deeper linguistic phenomena such as quantification, negation, and superlatives. [sent-16, score-0.133]

9 (2010), we obviate the need for annotated logical forms by considering the endto-end problem of mapping questions to answers. [sent-18, score-0.622]

10 However, we still model the logical form (now as a latent variable) to capture the complexities of language. [sent-19, score-0.445]

11 We represent logical forms z as labeled trees, induced automatically from (x, y) pairs. [sent-21, score-0.551]

12 We want to induce latent logical forms z (and parameters θ) given only question-answer pairs (x, y), which is much cheaper to obtain than (x, z) pairs. [sent-22, score-0.509]

13 The core problem that arises in this setting is program induction: finding a logical form z (over an exponentially large space of possibilities) that produces the target answer y. [sent-23, score-0.463]

14 Unlike standard semantic parsing, our end goal is only to generate the correct y, so we are free to choose the representation for z. [sent-24, score-0.053]

15 The dominant paradigm in compositional se- mantics is Montague semantics, which constructs lambda calculus forms in a bottom-up manner. [sent-26, score-0.447]

16 CCG is one instantiation (Steedman, 2000), which is used by many semantic parsers, e. [sent-27, score-0.053]

17 However, the logical forms there can become quite complex, and in the context of program induction, this would lead to an unwieldy search space. [sent-30, score-0.502]

18 (2010), are simpler but lack the full expressive power of lambda calculus. [sent-35, score-0.137]

19 The main technical contribution of this work is a new semantic representation, dependency-based compositional semantics (DCS), which is both simple and expressive (Section 2). [sent-36, score-0.346]

20 The logical forms in this framework are trees, which is desirable for two reasons: (i) they parallel syntactic dependency trees, which facilitates parsing and learning; and (ii) evaluating them to obtain the answer is computationally efficient. [sent-37, score-0.56]

21 Our system outperforms all existing systems despite using no annotated logical forms. [sent-39, score-0.375]

22 1) of dependency-based compositional semantics (DCS), which captures the core idea of using trees to represent formal semantics. [sent-41, score-0.369]

23 2), which handles linguistic phenomena such as quantification, where syntactic and semantic scope diverge. [sent-43, score-0.096]

24 We start with some definitions, using US geography as an example domain. [sent-44, score-0.032]

25 Let V be the set of all values, awnh eixcah minpclelud deosm primitives (e. [sent-45, score-0.035]

26 , 3, CA ∈ V) as well as sets and tuples formed from other v∈alu Ves) (e. [sent-47, score-0.045]

27 , state, count ∈ P), Pw bheich a are just symbols. [sent-52, score-0.031]

28 A world w is mapping from each predicate p ∈ P Ato a rsledt wof tuples; nfogr example, w(state) = {(CA) , (OR) , . [sent-53, score-0.205]

29 h Cereon ceaepchtu predicate irlsd a sre ala rteiloan(possibly infinite). [sent-61, score-0.192]

30 As another ePxample, w(average) = {(S, x¯) : x¯ = |S1|−1 Px∈S1 S(x)}, where a s =et o {f( pairs )S : ¯ixs t=rea |tSed| as Pa xs∈etS-vSal(uxe)d} ,fu wnhcetiroen a S(x) = {y : (x, y) ∈ S} swP aith s edto-vmalauine S1 = {x : (x, y) ∈ S}. [sent-66, score-0.038]

31 T,yh)e logical tfhor dmoms iani nD SCS= are c :a l(lxed,y D) ∈CS S trees, where nodes are labeled with predicates, and edges are labeled with relations. [sent-67, score-0.586]

32 Formally: Definition 1(DCS trees) Let Z be the set of DCS trees, itwiohnere 1 e( DacChS z ∈ Zs) c LoentsZi sts b of (i) a predicate 591 Relations R j0 Σ Xi (join) (aggregate) (execute) E Q C (extract) (quantify) (compare) Table 1: Possible relations appearing on the edges of a DCS tree. [sent-68, score-0.236]

33 r ∈ R (see Teaacbhle 1) aend e a cnhsiilsdt tree e. [sent-84, score-0.103]

34 Figure 2(a) DshCoSws tr an example of a DCS tree. [sent-90, score-0.032]

35 though a DCS tree is a logical form, note that it looks like a syntactic dependency tree with predicates in place of words. [sent-92, score-0.605]

36 It is this transparency between syntax and semantics provided by DCS which leads to a simple and streamlined compositional semantics suitable for program induction. [sent-93, score-0.43]

37 1 Basic Version The basic version of DCS restricts R to join and aggregate ir cel vaetrisoniosn (see TCaSbl ree 1). [sent-95, score-0.268]

38 Ltse tR us st joaritn by considering a DCS tree z with only join relations. [sent-96, score-0.216]

39 Such a z defines a constraint satisfaction problem (CSP) with nodes as variables. [sent-97, score-0.091]

40 The CSP has two types of constraints: (i) x ∈ w(p) for each node x labeled cwoinths predicate p ∈ P; apn)d f (ii) xj = yj0 (the j-th component aotfe x must equal t(hiie) jx 0-th component of y) for each edge (x, y) labeled ∈ R. [sent-98, score-0.407]

41 We say a value v is consistent for a node x if there exists a solution that assigns v to x. [sent-101, score-0.07]

42 The denotation JzKw (z esvoalultuiaotned th oant wass) i gs nths ev s toet xof. [sent-102, score-0.289]

43 Tcohnesi dsetnenotta avtiaolune Jsz oKf the rsoolout tinoonde th (aset ea sFsiiggunsre v v2 t foo xr . [sent-103, score-0.038]

44 withjj0 Computation We can compute the denotation JzKw of a DCS tree z by exploiting dynamic proJgzraKmming on trees (Dechter, 2003). [sent-105, score-0.405]

45 The recurrence iJsz aKs follows: JDp;jj110:c1;··· ;jjm0m:cmEKw (1) \m = w(p) ∩ \{v : Kvji = tj0i,t ∈ JciKw}. [sent-106, score-0.031]

46 i\= \1 At each node, we compute the set of ,ttup ∈le Jsc vK consistent with the predicate at that node (v ∈ w(p)), and Example: major city in California z = hcity; 11 : hmajori ; 11 : hloc; 12 : hCAiii maj1orc1it1yl1o2c C1A λccl∃iomtcy(∃(‘c)‘)∃ ∧ ∧sCm. [sent-107, score-0.193]

47 Aa(js)o∧r(m)∧ c1=(‘ m) ∧1∧( cs1)=∧ ‘1∧ ‘2= s1 (a) DCS tree (b) Lambda calculus formula (c) Denotation: JzKw = {SF, LA, . [sent-108, score-0.133]

48 } Fig(uc)re D2:e n(oa)t aAtino enx:a JmzKple of a DCS tree (written in both the mathematical and graphical notation). [sent-111, score-0.068]

49 Each node is labeled with a predicate, and each edge is labeled with a relation. [sent-112, score-0.212]

50 (b) A DCS tree z with only join relations encodes a constraint satisfaction problem. [sent-113, score-0.307]

51 for each child i, the ji-th component of v must equal the j0i-th component of some t in the child’s denotation (t ∈ JciKw). [sent-115, score-0.321]

52 1 iNonow ( tth ∈e d JucalK importance of trees in DCS is clear: We have seen that trees parallel syntactic dependency structure, which will facilitate parsing. [sent-117, score-0.282]

53 In addition, trees enable efficient computation, thereby establishing a new connection between dependency syntax and efficient semantic evaluation. [sent-118, score-0.244]

54 Aggregate relation DCS trees that only use join relations can represent arbitrarily complex compositional structures, but they cannot capture higherorder phenomena in language. [sent-119, score-0.539]

55 For example, consider the phrase number of major cities, and suppose that number corresponds to the count predicate. [sent-120, score-0.031]

56 It is impossible to represent the semantics of this phrase with just a CSP, so we introduce a new aggregate relation, notated Σ. [sent-121, score-0.198]

57 Consider a tree hΣ : ci, wgrhegosaet ero reotla aitsi ocon,nn noetcatetedd dto Σ a c Choilnds c veira a aΣ t. [sent-122, score-0.134]

58 r eIef hthΣe :dcei-, notation of c is a set of values s, the parent’s denotation is then a singleton set containing s. [sent-123, score-0.249]

59 The deJhnΣot:actiiKon o=f t{hJec Kmiddle node is {s}, example. [sent-126, score-0.07]

similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('dcs', 0.638), ('logical', 0.375), ('denotation', 0.213), ('csp', 0.17), ('compositional', 0.16), ('join', 0.148), ('jzkw', 0.128), ('trees', 0.124), ('predicate', 0.123), ('mooney', 0.117), ('forms', 0.095), ('uc', 0.089), ('lambda', 0.089), ('geo', 0.085), ('jcikw', 0.085), ('zettlemoyer', 0.085), ('semantics', 0.085), ('aggregate', 0.082), ('node', 0.07), ('tree', 0.068), ('calculus', 0.065), ('quantification', 0.065), ('questions', 0.064), ('kate', 0.062), ('benchmarks', 0.062), ('predicates', 0.06), ('jobs', 0.059), ('satisfaction', 0.057), ('answer', 0.056), ('semantic', 0.053), ('clarke', 0.052), ('mapping', 0.05), ('labeled', 0.049), ('expressive', 0.048), ('berkeley', 0.048), ('tuples', 0.045), ('edge', 0.044), ('phenomena', 0.043), ('edges', 0.041), ('collins', 0.04), ('ii', 0.04), ('latent', 0.039), ('zelle', 0.038), ('oant', 0.038), ('ala', 0.038), ('nths', 0.038), ('denotations', 0.038), ('zs', 0.038), ('mantics', 0.038), ('foo', 0.038), ('aith', 0.038), ('alu', 0.038), ('cel', 0.038), ('fig', 0.038), ('iani', 0.038), ('obviate', 0.038), ('sts', 0.038), ('parsers', 0.037), ('child', 0.036), ('component', 0.036), ('notation', 0.036), ('kwiatkowski', 0.035), ('ves', 0.035), ('aend', 0.035), ('jh', 0.035), ('ero', 0.035), ('rela', 0.035), ('enx', 0.035), ('primitives', 0.035), ('transparency', 0.035), ('relations', 0.034), ('dependency', 0.034), ('nodes', 0.034), ('syntax', 0.033), ('wor', 0.032), ('rfe', 0.032), ('apn', 0.032), ('geography', 0.032), ('rea', 0.032), ('wz', 0.032), ('tahe', 0.032), ('world', 0.032), ('answering', 0.032), ('induced', 0.032), ('tr', 0.032), ('ca', 0.032), ('program', 0.032), ('complexities', 0.031), ('dto', 0.031), ('ato', 0.031), ('execute', 0.031), ('notated', 0.031), ('tang', 0.031), ('sre', 0.031), ('recurrence', 0.031), ('count', 0.031), ('nal', 0.03), ('higherorder', 0.03), ('hp', 0.03), ('ccl', 0.028)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.99999988 200 acl-2011-Learning Dependency-Based Compositional Semantics

Author: Percy Liang ; Michael Jordan ; Dan Klein

2 0.1639501 79 acl-2011-Confidence Driven Unsupervised Semantic Parsing

Author: Dan Goldwasser ; Roi Reichart ; James Clarke ; Dan Roth

Abstract: Current approaches for semantic parsing take a supervised approach requiring a considerable amount of training data which is expensive and difficult to obtain. This supervision bottleneck is one of the major difficulties in scaling up semantic parsing. We argue that a semantic parser can be trained effectively without annotated data, and introduce an unsupervised learning algorithm. The algorithm takes a self training approach driven by confidence estimation. Evaluated over Geoquery, a standard dataset for this task, our system achieved 66% accuracy, compared to 80% of its fully supervised counterpart, demonstrating the promise of unsupervised approaches for this task.

3 0.11689694 3 acl-2011-A Bayesian Model for Unsupervised Semantic Parsing

Author: Ivan Titov ; Alexandre Klementiev

Abstract: We propose a non-parametric Bayesian model for unsupervised semantic parsing. Following Poon and Domingos (2009), we consider a semantic parsing setting where the goal is to (1) decompose the syntactic dependency tree of a sentence into fragments, (2) assign each of these fragments to a cluster of semantically equivalent syntactic structures, and (3) predict predicate-argument relations between the fragments. We use hierarchical PitmanYor processes to model statistical dependencies between meaning representations of predicates and those of their arguments, as well as the clusters of their syntactic realizations. We develop a modification of the MetropolisHastings split-merge sampler, resulting in an efficient inference algorithm for the model. The method is experimentally evaluated by us- ing the induced semantic representation for the question answering task in the biomedical domain.

4 0.083356321 190 acl-2011-Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations

Author: Raphael Hoffmann ; Congle Zhang ; Xiao Ling ; Luke Zettlemoyer ; Daniel S. Weld

Abstract: Information extraction (IE) holds the promise of generating a large-scale knowledge base from the Web’s natural language text. Knowledge-based weak supervision, using structured data to heuristically label a training corpus, works towards this goal by enabling the automated learning of a potentially unbounded number of relation extractors. Recently, researchers have developed multiinstance learning algorithms to combat the noisy training data that can come from heuristic labeling, but their models assume relations are disjoint — for example they cannot extract the pair Founded ( Jobs Apple ) and CEO-o f ( Jobs Apple ) . , , This paper presents a novel approach for multi-instance learning with overlapping relations that combines a sentence-level extrac- , tion model with a simple, corpus-level component for aggregating the individual facts. We apply our model to learn extractors for NY Times text using weak supervision from Freebase. Experiments show that the approach runs quickly and yields surprising gains in accuracy, at both the aggregate and sentence level.

5 0.064055346 206 acl-2011-Learning to Transform and Select Elementary Trees for Improved Syntax-based Machine Translations

Author: Bing Zhao ; Young-Suk Lee ; Xiaoqiang Luo ; Liu Li

Abstract: We propose a novel technique of learning how to transform the source parse trees to improve the translation qualities of syntax-based translation models using synchronous context-free grammars. We transform the source tree phrasal structure into a set of simpler structures, expose such decisions to the decoding process, and find the least expensive transformation operation to better model word reordering. In particular, we integrate synchronous binarizations, verb regrouping, removal of redundant parse nodes, and incorporate a few important features such as translation boundaries. We learn the structural preferences from the data in a generative framework. The syntax-based translation system integrating the proposed techniques outperforms the best Arabic-English unconstrained system in NIST08 evaluations by 1.3 absolute BLEU, which is statistically significant.

6 0.063051529 269 acl-2011-Scaling up Automatic Cross-Lingual Semantic Role Annotation

7 0.061942592 144 acl-2011-Global Learning of Typed Entailment Rules

8 0.05761442 205 acl-2011-Learning to Grade Short Answer Questions using Semantic Similarity Measures and Dependency Graph Alignments

9 0.057505306 170 acl-2011-In-domain Relation Discovery with Meta-constraints via Posterior Regularization

10 0.052872609 169 acl-2011-Improving Question Recommendation by Exploiting Information Need

11 0.052183777 28 acl-2011-A Statistical Tree Annotator and Its Applications

12 0.052108794 167 acl-2011-Improving Dependency Parsing with Semantic Classes

13 0.051091924 30 acl-2011-Adjoining Tree-to-String Translation

14 0.050603725 282 acl-2011-Shift-Reduce CCG Parsing

15 0.047107413 39 acl-2011-An Ensemble Model that Combines Syntactic and Semantic Clustering for Discriminative Dependency Parsing

16 0.047042422 166 acl-2011-Improving Decoding Generalization for Tree-to-String Translation

17 0.046581592 25 acl-2011-A Simple Measure to Assess Non-response

18 0.046335969 322 acl-2011-Unsupervised Learning of Semantic Relation Composition

19 0.045174912 8 acl-2011-A Corpus of Scope-disambiguated English Text

20 0.043366965 173 acl-2011-Insertion Operator for Bayesian Tree Substitution Grammars

similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.119), (1, -0.014), (2, -0.047), (3, -0.074), (4, 0.011), (5, -0.016), (6, -0.02), (7, 0.018), (8, -0.035), (9, -0.053), (10, 0.037), (11, 0.002), (12, 0.027), (13, 0.034), (14, -0.034), (15, -0.064), (16, -0.052), (17, -0.116), (18, -0.031), (19, -0.007), (20, 0.013), (21, 0.047), (22, -0.052), (23, 0.006), (24, 0.015), (25, -0.058), (26, -0.053), (27, -0.027), (28, 0.008), (29, -0.002), (30, -0.047), (31, -0.023), (32, 0.05), (33, 0.022), (34, 0.034), (35, 0.007), (36, -0.066), (37, -0.088), (38, 0.068), (39, -0.089), (40, -0.066), (41, -0.05), (42, -0.013), (43, -0.045), (44, 0.009), (45, -0.062), (46, -0.051), (47, -0.048), (48, -0.047), (49, -0.042)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.94806987 200 acl-2011-Learning Dependency-Based Compositional Semantics

Author: Percy Liang ; Michael Jordan ; Dan Klein

2 0.64904004 322 acl-2011-Unsupervised Learning of Semantic Relation Composition

Author: Eduardo Blanco ; Dan Moldovan

Abstract: This paper presents an unsupervised method for deriving inference axioms by composing semantic relations. The method is independent of any particular relation inventory. It relies on describing semantic relations using primitives and manipulating these primitives according to an algebra. The method was tested using a set of eight semantic relations yielding 78 inference axioms which were evaluated over PropBank.

3 0.62475777 3 acl-2011-A Bayesian Model for Unsupervised Semantic Parsing

Author: Ivan Titov ; Alexandre Klementiev

4 0.60556656 79 acl-2011-Confidence Driven Unsupervised Semantic Parsing

Author: Dan Goldwasser ; Roi Reichart ; James Clarke ; Dan Roth

5 0.59071428 25 acl-2011-A Simple Measure to Assess Non-response

Author: Anselmo Penas ; Alvaro Rodrigo

Abstract: There are several tasks where is preferable not responding than responding incorrectly. This idea is not new, but despite several previous attempts there isn’t a commonly accepted measure to assess non-response. We study here an extension of accuracy measure with this feature and a very easy to understand interpretation. The measure proposed (c@1) has a good balance of discrimination power, stability and sensitivity properties. We show also how this measure is able to reward systems that maintain the same number of correct answers and at the same time decrease the number of incorrect ones, by leaving some questions unanswered. This measure is well suited for tasks such as Reading Comprehension tests, where multiple choices per question are given, but only one is correct.

6 0.53623301 294 acl-2011-Temporal Evaluation

7 0.52800679 169 acl-2011-Improving Question Recommendation by Exploiting Information Need

8 0.50374168 174 acl-2011-Insights from Network Structure for Text Mining

9 0.48204166 205 acl-2011-Learning to Grade Short Answer Questions using Semantic Similarity Measures and Dependency Graph Alignments

10 0.47191975 170 acl-2011-In-domain Relation Discovery with Meta-constraints via Posterior Regularization

11 0.44987264 269 acl-2011-Scaling up Automatic Cross-Lingual Semantic Role Annotation

12 0.44842756 324 acl-2011-Unsupervised Semantic Role Induction via Split-Merge Clustering

13 0.44443455 28 acl-2011-A Statistical Tree Annotator and Its Applications

14 0.43533581 190 acl-2011-Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations

15 0.42231056 320 acl-2011-Unsupervised Discovery of Domain-Specific Knowledge from Text

16 0.42025682 176 acl-2011-Integrating surprisal and uncertain-input models in online sentence comprehension: formal techniques and empirical results

17 0.41601002 284 acl-2011-Simple Unsupervised Grammar Induction from Raw Text with Cascaded Finite State Models

18 0.41371134 68 acl-2011-Classifying arguments by scheme

19 0.40245712 262 acl-2011-Relation Guided Bootstrapping of Semantic Lexicons

20 0.3974705 295 acl-2011-Temporal Restricted Boltzmann Machines for Dependency Parsing

similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(5, 0.02), (17, 0.073), (26, 0.026), (37, 0.115), (39, 0.04), (41, 0.039), (55, 0.045), (59, 0.049), (67, 0.257), (72, 0.019), (91, 0.101), (96, 0.114)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.81568533 200 acl-2011-Learning Dependency-Based Compositional Semantics

Author: Percy Liang ; Michael Jordan ; Dan Klein

2 0.60395914 126 acl-2011-Exploiting Syntactico-Semantic Structures for Relation Extraction

Author: Yee Seng Chan ; Dan Roth

Abstract: In this paper, we observe that there exists a second dimension to the relation extraction (RE) problem that is orthogonal to the relation type dimension. We show that most of these second dimensional structures are relatively constrained and not difficult to identify. We propose a novel algorithmic approach to RE that starts by first identifying these structures and then, within these, identifying the semantic type of the relation. In the real RE problem where relation arguments need to be identified, exploiting these structures also allows reducing pipelined propagated errors. We show that this RE framework provides significant improvement in RE performance.

3 0.60222745 79 acl-2011-Confidence Driven Unsupervised Semantic Parsing

Author: Dan Goldwasser ; Roi Reichart ; James Clarke ; Dan Roth

4 0.6021806 313 acl-2011-Two Easy Improvements to Lexical Weighting

Author: David Chiang ; Steve DeNeefe ; Michael Pust

Abstract: We introduce two simple improvements to the lexical weighting features of Koehn, Och, and Marcu (2003) for machine translation: one which smooths the probability of translating word f to word e by simplifying English morphology, and one which conditions it on the kind of training data that f and e co-occurred in. These new variations lead to improvements of up to +0.8 BLEU, with an average improvement of +0.6 BLEU across two language pairs, two genres, and two translation systems.

5 0.59939259 331 acl-2011-Using Large Monolingual and Bilingual Corpora to Improve Coordination Disambiguation

Author: Shane Bergsma ; David Yarowsky ; Kenneth Church

Abstract: Resolving coordination ambiguity is a classic hard problem. This paper looks at coordination disambiguation in complex noun phrases (NPs). Parsers trained on the Penn Treebank are reporting impressive numbers these days, but they don’t do very well on this problem (79%). We explore systems trained using three types of corpora: (1) annotated (e.g. the Penn Treebank), (2) bitexts (e.g. Europarl), and (3) unannotated monolingual (e.g. Google N-grams). Size matters: (1) is a million words, (2) is potentially billions of words and (3) is potentially trillions of words. The unannotated monolingual data is helpful when the ambiguity can be resolved through associations among the lexical items. The bilingual data is helpful when the ambiguity can be resolved by the order of words in the translation. We train separate classifiers with monolingual and bilingual features and iteratively improve them via achieves data and pervised tations. co-training. The co-trained classifier close to 96% accuracy on Treebank makes 20% fewer errors than a susystem trained with Treebank anno-

6 0.59849989 108 acl-2011-EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

7 0.59570521 277 acl-2011-Semi-supervised Relation Extraction with Large-scale Word Clustering

8 0.59409845 86 acl-2011-Coreference for Learning to Extract Relations: Yes Virginia, Coreference Matters

9 0.59346581 145 acl-2011-Good Seed Makes a Good Crop: Accelerating Active Learning Using Language Modeling

10 0.59327513 85 acl-2011-Coreference Resolution with World Knowledge

11 0.59312654 241 acl-2011-Parsing the Internal Structure of Words: A New Paradigm for Chinese Word Segmentation

12 0.59181499 119 acl-2011-Evaluating the Impact of Coder Errors on Active Learning

13 0.59168893 170 acl-2011-In-domain Relation Discovery with Meta-constraints via Posterior Regularization

14 0.59045398 262 acl-2011-Relation Guided Bootstrapping of Semantic Lexicons

15 0.59014547 186 acl-2011-Joint Training of Dependency Parsing Filters through Latent Support Vector Machines

16 0.58964735 324 acl-2011-Unsupervised Semantic Role Induction via Split-Merge Clustering

17 0.5895806 190 acl-2011-Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations

18 0.58925039 103 acl-2011-Domain Adaptation by Constraining Inter-Domain Variability of Latent Feature Representation

19 0.58860755 284 acl-2011-Simple Unsupervised Grammar Induction from Raw Text with Cascaded Finite State Models

20 0.5881629 110 acl-2011-Effective Use of Function Words for Rule Generalization in Forest-Based Translation