acl acl2011 acl2011-167 knowledge-graph by maker-knowledge-mining

167 acl-2011-Improving Dependency Parsing with Semantic Classes

Source: pdf

Author: Eneko Agirre ; Kepa Bengoetxea ; Koldo Gojenola ; Joakim Nivre

Abstract: This paper presents the introduction of WordNet semantic classes in a dependency parser, obtaining improvements on the full Penn Treebank for the first time. We tried different combinations of some basic semantic classes and word sense disambiguation algorithms. Our experiments show that selecting the adequate combination of semantic features on development data is key for success. Given the basic nature of the semantic classes and word sense disambiguation algorithms used, we think there is ample room for future improvements. 1

Reference: text

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 + Abstract This paper presents the introduction of WordNet semantic classes in a dependency parser, obtaining improvements on the full Penn Treebank for the first time. [sent-8, score-0.737]

2 We tried different combinations of some basic semantic classes and word sense disambiguation algorithms. [sent-9, score-0.725]

3 Our experiments show that selecting the adequate combination of semantic features on development data is key for success. [sent-10, score-0.366]

4 Given the basic nature of the semantic classes and word sense disambiguation algorithms used, we think there is ample room for future improvements. [sent-11, score-0.704]

5 In principle, dependency parsing offers good prospects for experimenting with word-to-word-semantic relationships. [sent-14, score-0.354]

6 We present a set of experiments using semantic classes in dependency parsing of the Penn Treebank (PTB). [sent-15, score-0.706]

7 (2008), who used different types of semantic information, obtaining significant improvements in two constituency parsers, showing how semantic information helps in constituency parsing. [sent-17, score-0.743]

8 We will evaluate the parser on both the full PTB (Marcus et al. [sent-19, score-0.201]

9 1993) and on a sense699 annotated subset of the Brown Corpus portion of PTB, in order to investigate the upper bound performance of the models given gold-standard sense information, as in Agirre et al. [sent-20, score-0.293]

10 (2008) trained two state-of-the-art sta- tistical parsers (Charniak, 2000; Bikel, 2004) on semantically-enriched input, where content words had been substituted with their semantic classes. [sent-23, score-0.325]

11 This was done trying to overcome the limitations of lexicalized approaches to parsing (Magerman, 1995; Collins, 1996; Charniak, 1997; Collins, 2003), where related words, like scissors and knife cannot be generalized. [sent-24, score-0.275]

12 This simple method allowed incorporating lexical semantic information into the parser. [sent-25, score-0.323]

13 They tested the parsers in both a full parsing and a PP attachment context. [sent-26, score-0.472]

14 The experiments showed that semantic classes gave significant improvement relative to the baseline, demonstrating that a simplistic approach to incorporating lexical semantics into a parser significantly improves its performance. [sent-27, score-0.548]

15 This work presented the first results over both WordNet and the Penn Treebank to show that semantic processing helps parsing. [sent-28, score-0.273]

16 Collins (2000) tested a combined parsing/word sense disambiguation model based in WordNet which did not obtain improvements in parsing. [sent-29, score-0.379]

17 (2008) presented a semisupervised method for training dependency parsers, using word clusters derived from a large unannotated corpus as features. [sent-31, score-0.255]

18 They demonstrate the effectiveness of the approach in a series of dependency parsing experiments on PTB and the Prague Dependency Treebank, showing that the cluster-based features yield substantial gains in performance across a wide range of conditions. [sent-32, score-0.449]

19 i ac t2io0n11 fo Ar Cssoocmiaptuiotanti foonra Clo Lminpguutiast i ocns:aslh Loirntpgaupisetrics , pages 699–703, Ciaramita and Attardi (2007) show that adding semantic features extracted by a named entity tagger (such as PERSON or MONEY) improves the accuracy of a dependency parser, yielding a 5. [sent-37, score-0.47]

20 Candito and Seddah (2010) performed experiments in statistical parsing of French, where terminal forms were replaced by more general symbols, particularly clusters of words obtained through unsupervised clustering. [sent-39, score-0.214]

21 The results showed that word clusters had a positive effect. [sent-40, score-0.057]

22 Regarding dependency parsing of the English PTB, currently Koo and Collins (2010) and Zhang and Nivre (201 1) hold the best results, with 93. [sent-41, score-0.315]

23 Apart from these, there have been other attempts to make use of semantic information in different frameworks and languages, as in (Hektoen 1997; Xiong et al. [sent-45, score-0.273]

24 3 Experimental Framework In this section we will briefly describe the datadriven parser used for the experiments (subsection 3 . [sent-48, score-0.107]

25 Finally, we will describe the types of semantic representation used in the experiments. [sent-51, score-0.326]

26 2006) is a trainable dependency parser that has been successfully applied to typologically different languages and treebanks. [sent-54, score-0.265]

27 The parser obtains deterministically a dependency tree in linear-time in a single pass over the input using two main data structures: a stack of partially analyzed items and the remaining input sequence. [sent-57, score-0.265]

28 To determine the best action at each step, the parser uses history-based feature models and SVM classifiers. [sent-58, score-0.107]

29 One of the main reasons for using MaltParser for our experiments is that it easily allows the introduction of semantic informa- tion, adding new features, and incorporating them in the training model. [sent-59, score-0.323]

30 2 Dataset We used two different datasets: the full PTB and the Semcor/PTB intersection (Agirre et al. [sent-61, score-0.153]

31 700 The full PTB allows for comparison with the stateof-the-art, and we followed the usual train-test split. [sent-63, score-0.094]

32 The Semcor/PTB intersection contains both gold-standard sense and parse tree annotations, and allows to set an upper bound of the relative impact of a given semantic representation on parsing. [sent-64, score-0.678]

33 We will evaluate the parser via Labeled Attachment Score (LAS). [sent-68, score-0.107]

34 We will use Bikel’s randomized parsing evaluation comparator to test the statistical significance of the results using word sense information, relative to the respective baseline parser using only standard features. [sent-69, score-0.501]

35 We used PennConverter (Johansson and Nugues, 2007) to convert constituent trees in the Penn Treebank annotation style into dependency trees. [sent-70, score-0.158]

36 Although in general the results from parsing Pennconverter’s output are lower than with other conversions, Johansson and Nugues (2007) claim that this conversion is better suited for semantic processing, with a richer structure and a more finegrained set of dependency labels. [sent-71, score-0.634]

37 3 Semantic representation and disambiguation methods We will experiment with the range of semantic representations used in Agirre et al. [sent-75, score-0.402]

38 Words in WordNet (Fellbaum, 1998) are organized into sets of synonyms, called synsets (SS). [sent-78, score-0.103]

39 Each synset in turn belongs to a unique semantic file (SF). [sent-79, score-0.417]

40 There are a total of 45 SFs (1 for adverbs, 3 for adjectives, 15 for verbs, and 26 for nouns), based on syntactic and semantic categories. [sent-80, score-0.273]

41 For example, noun semantic files (SF_N) differentiate nouns denoting acts or actions, and nouns denoting animals, among others. [sent-81, score-0.562]

42 We experiment with both full syn- sets and SFs as instances of fine-grained and coarse-grained semantic representation, respectively. [sent-82, score-0.367]

43 As an example of the difference in these two representations, knife in its tool sense is in the EDGE TOOL USED AS A CUTTING INSTRUMENT singleton synset, and also in the ARTIFACT SF along with thousands of other words including cutter. [sent-83, score-0.316]

44 Note that these are the two extremes of semantic granularity in WordNet. [sent-84, score-0.273]

45 As a hybrid representation, we also tested the effect of merging words with their corresponding SF (e. [sent-85, score-0.054]

46 This is a form of semantic specialization rather than generalization, and allows the parser to discriminate between the different senses of each word, but not generalize across words. [sent-88, score-0.478]

47 For each of these three semantic representations, we experimented with using each of: (1) all open-class POSs (nouns, verbs, adjectives and adverbs), (2) nouns only, and (3) verbs only. [sent-89, score-0.395]

48 There are thus a total of 9 combinations of representation type and target POS: SS (synset), SS_N (noun synsets), SS_V (verb synsets), SF (semantic file), SF_N (noun semantic files), SF_V (verb semantic files), WSF (wordform+SF), WSF_N (wordform+SF for nouns) and WSF_V (for verbs). [sent-90, score-0.659]

49 For a given semantic representation, we need some form of WSD to determine the semantics of each token occurrence of a target word. [sent-91, score-0.273]

50 For the full Penn Treebank experiments, we only had access to the first sense, taken from Wordnet 1. [sent-94, score-0.094]

51 4 Results In the following two subsections, we will first present the results in the SemCor/PTB intersection, with the option of using gold, 1st sense and automatic sense information (subsection 4. [sent-96, score-0.396]

52 2) will show the results on the full PTB, using 1st sense information. [sent-98, score-0.292]

53 All results are shown as labelled attachment score (LAS). [sent-99, score-0.115]

54 1 Semcor/PTB (GOLD/1ST/ASR) We conducted a series of experiments testing: • Each individual semantic feature, which gives 9 possibilities, also testing different learning configurations for each one. [sent-101, score-0.367]

55 • Combinations of semantic features, for instance, SF+SS_N+WSF would combine the 701 System LAS Semcor-Penn intersection. [sent-102, score-0.273]

56 Although there were hundreds of combinations, we took the best combination of semantic features on the development set for the final test. [sent-107, score-0.366]

57 For that reason, the table only presents 10 results for each disambiguation method, 9 for the individual features and one for the best combination. [sent-108, score-0.211]

58 Table 1 presents the results obtained for each of the disambiguation methods (gold standard sense information, 1st sense, and automatic sense ranking) and individual semantic feature. [sent-109, score-0.841]

59 In all cases except two, the use of semantic classes is benefiSystem LAS set for the full PTB. [sent-110, score-0.485]

60 Regarding individual features, the SF feature using GOLD senses gives the best im- provement. [sent-115, score-0.112]

61 Regarding the best combination as selected in the training data, each WSD method yields a different combination, with best results for 1ST. [sent-118, score-0.054]

62 The improvement is statistically significant for both 1ST and GOLD. [sent-119, score-0.056]

63 The best results are obtained when using the first sense heuristic, but the difference is not statistically significant. [sent-121, score-0.254]

64 This shows that perfect WSD is not needed to obtain improvements, but it also shows that we reached the upperbound of our generalization and learning method. [sent-122, score-0.039]

65 2 Penn Treebank and 1st sense We only had 1st sense information available for the full PTB. [sent-124, score-0.49]

66 We tested MaltParser on the best configuration obtained for the reduced Semcor/PTB on the full treebank, taking sections 2-21 for training and section 23 for the final test. [sent-125, score-0.148]

67 Table 2 presents the results, showing that several of the individual features and the best combination give significant improvements. [sent-126, score-0.243]

68 To our knowledge, this is the first time that WordNet semantic classes help to obtain improvements on the full Penn Treebank. [sent-127, score-0.536]

69 702 It is interesting to mention that, although not shown on the tables, using lemmatization to assign semantic classes to wordforms gave a slight increase for all the tests (0. [sent-128, score-0.456]

70 This can be seen as an argument in favour of performing morphological analysis, an aspect that is many times neglected when processing morphologically poor languages as English. [sent-131, score-0.039]

71 5 Conclusions We tested the inclusion of several types of semantic information, in the form of WordNet semantic classes in a dependency parser, showing that: • Semantic information gives an improvement on a transition-based deterministic dependency parsing. [sent-134, score-1.088]

72 • Feature combinations give an improvement over using a single feature. [sent-135, score-0.06]

73 (2008) used a simple method of substituting wordforms with semantic information, which only allowed using a single semantic feature. [sent-137, score-0.611]

74 MaltParser allows the combination of several semantic features together with other features such as wordform, lemma or part of speech. [sent-138, score-0.405]

75 Although tables 1 and 2 only show the best combination for each type of semantic information, this can be appreciated on GOLD and 1ST in Table 1. [sent-139, score-0.366]

76 Due to space reasons, we only have showed the best combination, but we can say that in general combining features gives significant increases over using a single semantic feature. [sent-140, score-0.312]

77 • The present work presents a statistically significant improvement for the full treebank using WordNet-based semantic information for the first time. [sent-141, score-0.582]

78 (2008), which showed improvements on a subset of the PTB. [sent-143, score-0.051]

79 Given the basic nature of the semantic classes and WSD algorithms, we think there is room for future improvements, incorporating new kinds of semantic information, such as WordNet base concepts, Wikipedia concepts, or similarity measures. [sent-144, score-0.753]

80 Improving parsing and PP attachment performance with sense information. [sent-147, score-0.47]

81 Statistical parsing with a context-free grammar and word statistics. [sent-167, score-0.157]

similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('semantic', 0.273), ('agirre', 0.257), ('sf', 0.223), ('ptb', 0.206), ('sense', 0.198), ('wordnet', 0.168), ('dependency', 0.158), ('parsing', 0.157), ('maltparser', 0.15), ('wordform', 0.145), ('wsd', 0.144), ('nivre', 0.134), ('koo', 0.12), ('knife', 0.118), ('pennconverter', 0.118), ('classes', 0.118), ('penn', 0.118), ('treebank', 0.116), ('attachment', 0.115), ('parser', 0.107), ('synsets', 0.103), ('las', 0.1), ('subsection', 0.1), ('full', 0.094), ('johansson', 0.092), ('bengoetxea', 0.09), ('koldo', 0.09), ('sfs', 0.09), ('wsf', 0.09), ('joakim', 0.082), ('asr', 0.081), ('synset', 0.079), ('nugues', 0.079), ('kepa', 0.079), ('disambiguation', 0.076), ('nouns', 0.074), ('collins', 0.071), ('fujita', 0.068), ('bikel', 0.068), ('file', 0.065), ('wordforms', 0.065), ('ciaramita', 0.065), ('candito', 0.062), ('fellbaum', 0.061), ('gold', 0.06), ('combinations', 0.06), ('intersection', 0.059), ('senses', 0.059), ('eneko', 0.058), ('mccarthy', 0.058), ('clusters', 0.057), ('statistically', 0.056), ('files', 0.055), ('showing', 0.054), ('combination', 0.054), ('tested', 0.054), ('representation', 0.053), ('individual', 0.053), ('christiane', 0.053), ('artifact', 0.053), ('parsers', 0.052), ('ratnaparkhi', 0.051), ('improvements', 0.051), ('carreras', 0.05), ('incorporating', 0.05), ('regarding', 0.049), ('xavier', 0.049), ('xiong', 0.049), ('bound', 0.048), ('verbs', 0.048), ('upper', 0.047), ('terry', 0.047), ('conversion', 0.046), ('constituency', 0.046), ('adverbs', 0.045), ('michael', 0.043), ('presents', 0.043), ('denoting', 0.043), ('suzuki', 0.043), ('series', 0.041), ('semisupervised', 0.04), ('neglected', 0.039), ('appreciated', 0.039), ('magerman', 0.039), ('upperbound', 0.039), ('wordnetbased', 0.039), ('nodalida', 0.039), ('tartu', 0.039), ('conversions', 0.039), ('semcor', 0.039), ('bler', 0.039), ('comparator', 0.039), ('ehu', 0.039), ('ingfi', 0.039), ('oakim', 0.039), ('prospects', 0.039), ('seddah', 0.039), ('specialization', 0.039), ('features', 0.039), ('room', 0.039)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.99999946 167 acl-2011-Improving Dependency Parsing with Semantic Classes

Author: Eneko Agirre ; Kepa Bengoetxea ; Koldo Gojenola ; Joakim Nivre

2 0.26870847 309 acl-2011-Transition-based Dependency Parsing with Rich Non-local Features

Author: Yue Zhang ; Joakim Nivre

Abstract: Transition-based dependency parsers generally use heuristic decoding algorithms but can accommodate arbitrarily rich feature representations. In this paper, we show that we can improve the accuracy of such parsers by considering even richer feature sets than those employed in previous systems. In the standard Penn Treebank setup, our novel features improve attachment score form 91.4% to 92.9%, giving the best results so far for transitionbased parsing and rivaling the best results overall. For the Chinese Treebank, they give a signficant improvement of the state of the art. An open source release of our parser is freely available.

3 0.22067755 143 acl-2011-Getting the Most out of Transition-based Dependency Parsing

Author: Jinho D. Choi ; Martha Palmer

Abstract: This paper suggests two ways of improving transition-based, non-projective dependency parsing. First, we add a transition to an existing non-projective parsing algorithm, so it can perform either projective or non-projective parsing as needed. Second, we present a bootstrapping technique that narrows down discrepancies between gold-standard and automatic parses used as features. The new addition to the algorithm shows a clear advantage in parsing speed. The bootstrapping technique gives a significant improvement to parsing accuracy, showing near state-of-theart performance with respect to other parsing approaches evaluated on the same data set.

4 0.21446702 39 acl-2011-An Ensemble Model that Combines Syntactic and Semantic Clustering for Discriminative Dependency Parsing

Author: Gholamreza Haffari ; Marzieh Razavi ; Anoop Sarkar

Abstract: We combine multiple word representations based on semantic clusters extracted from the (Brown et al., 1992) algorithm and syntactic clusters obtained from the Berkeley parser (Petrov et al., 2006) in order to improve discriminative dependency parsing in the MSTParser framework (McDonald et al., 2005). We also provide an ensemble method for combining diverse cluster-based models. The two contributions together significantly improves unlabeled dependency accuracy from 90.82% to 92. 13%.

5 0.21030551 198 acl-2011-Latent Semantic Word Sense Induction and Disambiguation

Author: Tim Van de Cruys ; Marianna Apidianaki

Abstract: In this paper, we present a unified model for the automatic induction of word senses from text, and the subsequent disambiguation of particular word instances using the automatically extracted sense inventory. The induction step and the disambiguation step are based on the same principle: words and contexts are mapped to a limited number of topical dimensions in a latent semantic word space. The intuition is that a particular sense is associated with a particular topic, so that different senses can be discriminated through their association with particular topical dimensions; in a similar vein, a particular instance of a word can be disambiguated by determining its most important topical dimensions. The model is evaluated on the SEMEVAL-20 10 word sense induction and disambiguation task, on which it reaches stateof-the-art results.

6 0.19808505 111 acl-2011-Effects of Noun Phrase Bracketing in Dependency Parsing and Machine Translation

7 0.17858665 240 acl-2011-ParaSense or How to Use Parallel Corpora for Word Sense Disambiguation

8 0.17412752 127 acl-2011-Exploiting Web-Derived Selectional Preference to Improve Statistical Dependency Parsing

9 0.17270699 158 acl-2011-Identification of Domain-Specific Senses in a Machine-Readable Dictionary

10 0.15779623 333 acl-2011-Web-Scale Features for Full-Scale Parsing

11 0.14962822 269 acl-2011-Scaling up Automatic Cross-Lingual Semantic Role Annotation

12 0.14623202 282 acl-2011-Shift-Reduce CCG Parsing

13 0.14585529 3 acl-2011-A Bayesian Model for Unsupervised Semantic Parsing

14 0.14385183 48 acl-2011-Automatic Detection and Correction of Errors in Dependency Treebanks

15 0.13065855 230 acl-2011-Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency Parsing Evaluation

16 0.13045168 164 acl-2011-Improving Arabic Dependency Parsing with Form-based and Functional Morphological Features

17 0.12277687 224 acl-2011-Models and Training for Unsupervised Preposition Sense Disambiguation

18 0.11812057 307 acl-2011-Towards Tracking Semantic Change by Visual Analytics

19 0.11709217 324 acl-2011-Unsupervised Semantic Role Induction via Split-Merge Clustering

20 0.11471716 304 acl-2011-Together We Can: Bilingual Bootstrapping for WSD

similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.259), (1, -0.02), (2, -0.131), (3, -0.291), (4, -0.039), (5, -0.056), (6, 0.165), (7, 0.104), (8, 0.052), (9, -0.059), (10, 0.052), (11, -0.111), (12, 0.143), (13, -0.135), (14, -0.034), (15, -0.129), (16, 0.14), (17, 0.073), (18, -0.043), (19, 0.039), (20, -0.048), (21, -0.003), (22, -0.051), (23, 0.034), (24, 0.079), (25, -0.09), (26, 0.041), (27, -0.085), (28, -0.073), (29, 0.028), (30, -0.025), (31, -0.004), (32, 0.01), (33, 0.018), (34, 0.049), (35, 0.014), (36, 0.019), (37, -0.012), (38, 0.043), (39, 0.054), (40, -0.014), (41, 0.027), (42, -0.029), (43, 0.052), (44, 0.018), (45, 0.038), (46, 0.016), (47, 0.02), (48, 0.039), (49, -0.005)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.97168469 167 acl-2011-Improving Dependency Parsing with Semantic Classes

Author: Eneko Agirre ; Kepa Bengoetxea ; Koldo Gojenola ; Joakim Nivre

2 0.77441514 39 acl-2011-An Ensemble Model that Combines Syntactic and Semantic Clustering for Discriminative Dependency Parsing

Author: Gholamreza Haffari ; Marzieh Razavi ; Anoop Sarkar

3 0.76120204 143 acl-2011-Getting the Most out of Transition-based Dependency Parsing

Author: Jinho D. Choi ; Martha Palmer

4 0.76117647 309 acl-2011-Transition-based Dependency Parsing with Rich Non-local Features

Author: Yue Zhang ; Joakim Nivre

5 0.75997329 230 acl-2011-Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency Parsing Evaluation

Author: Roy Schwartz ; Omri Abend ; Roi Reichart ; Ari Rappoport

Abstract: Dependency parsing is a central NLP task. In this paper we show that the common evaluation for unsupervised dependency parsing is highly sensitive to problematic annotations. We show that for three leading unsupervised parsers (Klein and Manning, 2004; Cohen and Smith, 2009; Spitkovsky et al., 2010a), a small set of parameters can be found whose modification yields a significant improvement in standard evaluation measures. These parameters correspond to local cases where no linguistic consensus exists as to the proper gold annotation. Therefore, the standard evaluation does not provide a true indication of algorithm quality. We present a new measure, Neutral Edge Direction (NED), and show that it greatly reduces this undesired phenomenon.

6 0.75883466 127 acl-2011-Exploiting Web-Derived Selectional Preference to Improve Statistical Dependency Parsing

7 0.74538064 111 acl-2011-Effects of Noun Phrase Bracketing in Dependency Parsing and Machine Translation

8 0.71954876 198 acl-2011-Latent Semantic Word Sense Induction and Disambiguation

9 0.66872746 48 acl-2011-Automatic Detection and Correction of Errors in Dependency Treebanks

10 0.65990621 333 acl-2011-Web-Scale Features for Full-Scale Parsing

11 0.61677712 243 acl-2011-Partial Parsing from Bitext Projections

12 0.61327463 236 acl-2011-Optimistic Backtracking - A Backtracking Overlay for Deterministic Incremental Parsing

13 0.61055332 307 acl-2011-Towards Tracking Semantic Change by Visual Analytics

14 0.60991096 107 acl-2011-Dynamic Programming Algorithms for Transition-Based Dependency Parsers

15 0.60794169 295 acl-2011-Temporal Restricted Boltzmann Machines for Dependency Parsing

16 0.5935083 229 acl-2011-NULEX: An Open-License Broad Coverage Lexicon

17 0.58931583 334 acl-2011-Which Noun Phrases Denote Which Concepts?

18 0.58574086 282 acl-2011-Shift-Reduce CCG Parsing

19 0.57033247 59 acl-2011-Better Automatic Treebank Conversion Using A Feature-Based Approach

20 0.56116557 158 acl-2011-Identification of Domain-Specific Senses in a Machine-Readable Dictionary

similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(5, 0.018), (17, 0.039), (37, 0.135), (39, 0.064), (41, 0.071), (55, 0.024), (59, 0.074), (72, 0.022), (91, 0.031), (96, 0.1), (97, 0.359)]

similar papers list:

simIndex simValue paperId paperTitle

1 0.87449682 315 acl-2011-Types of Common-Sense Knowledge Needed for Recognizing Textual Entailment

Author: Peter LoBue ; Alexander Yates

Abstract: Understanding language requires both linguistic knowledge and knowledge about how the world works, also known as common-sense knowledge. We attempt to characterize the kinds of common-sense knowledge most often involved in recognizing textual entailments. We identify 20 categories of common-sense knowledge that are prevalent in textual entailment, many of which have received scarce attention from researchers building collections of knowledge.

same-paper 2 0.76657665 167 acl-2011-Improving Dependency Parsing with Semantic Classes

Author: Eneko Agirre ; Kepa Bengoetxea ; Koldo Gojenola ; Joakim Nivre

3 0.74318099 10 acl-2011-A Discriminative Model for Joint Morphological Disambiguation and Dependency Parsing

Author: John Lee ; Jason Naradowsky ; David A. Smith

Abstract: Most previous studies of morphological disambiguation and dependency parsing have been pursued independently. Morphological taggers operate on n-grams and do not take into account syntactic relations; parsers use the “pipeline” approach, assuming that morphological information has been separately obtained. However, in morphologically-rich languages, there is often considerable interaction between morphology and syntax, such that neither can be disambiguated without the other. In this paper, we propose a discriminative model that jointly infers morphological properties and syntactic structures. In evaluations on various highly-inflected languages, this joint model outperforms both a baseline tagger in morphological disambiguation, and a pipeline parser in head selection.

4 0.69078553 336 acl-2011-Why Press Backspace? Understanding User Input Behaviors in Chinese Pinyin Input Method

Author: Yabin Zheng ; Lixing Xie ; Zhiyuan Liu ; Maosong Sun ; Yang Zhang ; Liyun Ru

Abstract: Chinese Pinyin input method is very important for Chinese language information processing. Users may make errors when they are typing in Chinese words. In this paper, we are concerned with the reasons that cause the errors. Inspired by the observation that pressing backspace is one of the most common user behaviors to modify the errors, we collect 54, 309, 334 error-correction pairs from a realworld data set that contains 2, 277, 786 users via backspace operations. In addition, we present a comparative analysis of the data to achieve a better understanding of users’ input behaviors. Comparisons with English typos suggest that some language-specific properties result in a part of Chinese input errors. 1

5 0.63712871 14 acl-2011-A Hierarchical Model of Web Summaries

Author: Yves Petinot ; Kathleen McKeown ; Kapil Thadani

Abstract: We investigate the relevance of hierarchical topic models to represent the content of Web gists. We focus our attention on DMOZ, a popular Web directory, and propose two algorithms to infer such a model from its manually-curated hierarchy of categories. Our first approach, based on information-theoretic grounds, uses an algorithm similar to recursive feature selection. Our second approach is fully Bayesian and derived from the more general model, hierarchical LDA. We evaluate the performance of both models against a flat 1-gram baseline and show improvements in terms of perplexity over held-out data.

6 0.57704419 158 acl-2011-Identification of Domain-Specific Senses in a Machine-Readable Dictionary

7 0.57246584 164 acl-2011-Improving Arabic Dependency Parsing with Form-based and Functional Morphological Features

8 0.54356164 229 acl-2011-NULEX: An Open-License Broad Coverage Lexicon

9 0.53424668 309 acl-2011-Transition-based Dependency Parsing with Rich Non-local Features

10 0.524903 111 acl-2011-Effects of Noun Phrase Bracketing in Dependency Parsing and Machine Translation

11 0.52077675 304 acl-2011-Together We Can: Bilingual Bootstrapping for WSD

12 0.51721728 222 acl-2011-Model-Portability Experiments for Textual Temporal Analysis

13 0.50743318 7 acl-2011-A Corpus for Modeling Morpho-Syntactic Agreement in Arabic: Gender, Number and Rationality

14 0.50265592 85 acl-2011-Coreference Resolution with World Knowledge

15 0.49869946 13 acl-2011-A Graph Approach to Spelling Correction in Domain-Centric Search

16 0.4981724 242 acl-2011-Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments

17 0.49801621 289 acl-2011-Subjectivity and Sentiment Analysis of Modern Standard Arabic

18 0.49047488 198 acl-2011-Latent Semantic Word Sense Induction and Disambiguation

19 0.48771203 297 acl-2011-That's What She Said: Double Entendre Identification

20 0.4862538 320 acl-2011-Unsupervised Discovery of Domain-Specific Knowledge from Text