acl acl2012 knowledge-graph by maker-knowledge-mining

acl 2012 knowledge graph


similar papers computed by tfidf model


similar papers computed by lsi model


similar papers computed by lda model


papers list:

1 acl-2012-ACCURAT Toolkit for Multi-Level Alignment and Information Extraction from Comparable Corpora

Author: Marcis Pinnis ; Radu Ion ; Dan Stefanescu ; Fangzhong Su ; Inguna Skadina ; Andrejs Vasiljevs ; Bogdan Babych

Abstract: The lack of parallel corpora and linguistic resources for many languages and domains is one of the major obstacles for the further advancement of automated translation. A possible solution is to exploit comparable corpora (non-parallel bi- or multi-lingual text resources) which are much more widely available than parallel translation data. Our presented toolkit deals with parallel content extraction from comparable corpora. It consists of tools bundled in two workflows: (1) alignment of comparable documents and extraction of parallel sentences and (2) extraction and bilingual mapping of terms and named entities. The toolkit pairs similar bilingual comparable documents and extracts parallel sentences and bilingual terminological and named entity dictionaries from comparable corpora. This demonstration focuses on the English, Latvian, Lithuanian, and Romanian languages.

2 acl-2012-A Broad-Coverage Normalization System for Social Media Language

Author: Fei Liu ; Fuliang Weng ; Xiao Jiang

Abstract: Social media language contains huge amount and wide variety of nonstandard tokens, created both intentionally and unintentionally by the users. It is of crucial importance to normalize the noisy nonstandard tokens before applying other NLP techniques. A major challenge facing this task is the system coverage, i.e., for any user-created nonstandard term, the system should be able to restore the correct word within its top n output candidates. In this paper, we propose a cognitivelydriven normalization system that integrates different human perspectives in normalizing the nonstandard tokens, including the enhanced letter transformation, visual priming, and string/phonetic similarity. The system was evaluated on both word- and messagelevel using four SMS and Twitter data sets. Results show that our system achieves over 90% word-coverage across all data sets (a . 10% absolute increase compared to state-ofthe-art); the broad word-coverage can also successfully translate into message-level performance gain, yielding 6% absolute increase compared to the best prior approach.

3 acl-2012-A Class-Based Agreement Model for Generating Accurately Inflected Translations

Author: Spence Green ; John DeNero

Abstract: When automatically translating from a weakly inflected source language like English to a target language with richer grammatical features such as gender and dual number, the output commonly contains morpho-syntactic agreement errors. To address this issue, we present a target-side, class-based agreement model. Agreement is promoted by scoring a sequence of fine-grained morpho-syntactic classes that are predicted during decoding for each translation hypothesis. For English-to-Arabic translation, our model yields a +1.04 BLEU average improvement over a state-of-the-art baseline. The model does not require bitext or phrase table annotations and can be easily implemented as a feature in many phrase-based decoders. 1

4 acl-2012-A Comparative Study of Target Dependency Structures for Statistical Machine Translation

Author: Xianchao Wu ; Katsuhito Sudoh ; Kevin Duh ; Hajime Tsukada ; Masaaki Nagata

Abstract: This paper presents a comparative study of target dependency structures yielded by several state-of-the-art linguistic parsers. Our approach is to measure the impact of these nonisomorphic dependency structures to be used for string-to-dependency translation. Besides using traditional dependency parsers, we also use the dependency structures transformed from PCFG trees and predicate-argument structures (PASs) which are generated by an HPSG parser and a CCG parser. The experiments on Chinese-to-English translation show that the HPSG parser’s PASs achieved the best dependency and translation accuracies. 1

5 acl-2012-A Comparison of Chinese Parsers for Stanford Dependencies

Author: Wanxiang Che ; Valentin Spitkovsky ; Ting Liu

Abstract: Stanford dependencies are widely used in natural language processing as a semanticallyoriented representation, commonly generated either by (i) converting the output of a constituent parser, or (ii) predicting dependencies directly. Previous comparisons of the two approaches for English suggest that starting from constituents yields higher accuracies. In this paper, we re-evaluate both methods for Chinese, using more accurate dependency parsers than in previous work. Our comparison of performance and efficiency across seven popular open source parsers (four constituent and three dependency) shows, by contrast, that recent higher-order graph-based techniques can be more accurate, though somewhat slower, than constituent parsers. We demonstrate also that n-way jackknifing is a useful technique for producing automatic (rather than gold) partof-speech tags to train Chinese dependency parsers. Finally, we analyze the relations produced by both kinds of parsing and suggest which specific parsers to use in practice.

6 acl-2012-A Comprehensive Gold Standard for the Enron Organizational Hierarchy

Author: Apoorv Agarwal ; Adinoyi Omuya ; Aaron Harnly ; Owen Rambow

Abstract: Many researchers have attempted to predict the Enron corporate hierarchy from the data. This work, however, has been hampered by a lack of data. We present a new, large, and freely available gold-standard hierarchy. Using our new gold standard, we show that a simple lower bound for social network-based systems outperforms an upper bound on the approach taken by current NLP systems.

7 acl-2012-A Computational Approach to the Automation of Creative Naming

Author: Gozde Ozbal ; Carlo Strapparava

Abstract: In this paper, we propose a computational approach to generate neologisms consisting of homophonic puns and metaphors based on the category of the service to be named and the properties to be underlined. We describe all the linguistic resources and natural language processing techniques that we have exploited for this task. Then, we analyze the performance of the system that we have developed. The empirical results show that our approach is generally effective and it constitutes a solid starting point for the automation ofthe naming process.

8 acl-2012-A Corpus of Textual Revisions in Second Language Writing

Author: John Lee ; Jonathan Webster

Abstract: This paper describes the creation of the first large-scale corpus containing drafts and final versions of essays written by non-native speakers, with the sentences aligned across different versions. Furthermore, the sentences in the drafts are annotated with comments from teachers. The corpus is intended to support research on textual revision by language learners, and how it is influenced by feedback. This corpus has been converted into an XML format conforming to the standards of the Text Encoding Initiative (TEI).

9 acl-2012-A Cost Sensitive Part-of-Speech Tagging: Differentiating Serious Errors from Minor Errors

Author: Hyun-Je Song ; Jeong-Woo Son ; Tae-Gil Noh ; Seong-Bae Park ; Sang-Jo Lee

Abstract: All types of part-of-speech (POS) tagging errors have been equally treated by existing taggers. However, the errors are not equally important, since some errors affect the performance of subsequent natural language processing (NLP) tasks seriously while others do not. This paper aims to minimize these serious errors while retaining the overall performance of POS tagging. Two gradient loss functions are proposed to reflect the different types of errors. They are designed to assign a larger cost to serious errors and a smaller one to minor errors. Through a set of POS tagging experiments, it is shown that the classifier trained with the proposed loss functions reduces serious errors compared to state-of-the-art POS taggers. In addition, the experimental result on text chunking shows that fewer serious errors help to improve the performance of sub- sequent NLP tasks.

10 acl-2012-A Discriminative Hierarchical Model for Fast Coreference at Large Scale

Author: Michael Wick ; Sameer Singh ; Andrew McCallum

Abstract: Sameer Singh Andrew McCallum University of Massachusetts University of Massachusetts 140 Governor’s Drive 140 Governor’s Drive Amherst, MA Amherst, MA s ameer@ cs .umas s .edu mccal lum@ c s .umas s .edu Hamming” who authored “The unreasonable effectiveness of mathematics.” Features of the mentions Methods that measure compatibility between mention pairs are currently the dominant ap- proach to coreference. However, they suffer from a number of drawbacks including difficulties scaling to large numbers of mentions and limited representational power. As these drawbacks become increasingly restrictive, the need to replace the pairwise approaches with a more expressive, highly scalable alternative is becoming urgent. In this paper we propose a novel discriminative hierarchical model that recursively partitions entities into trees of latent sub-entities. These trees succinctly summarize the mentions providing a highly compact, information-rich structure for reasoning about entities and coreference uncertainty at massive scales. We demonstrate that the hierarchical model is several orders of magnitude faster than pairwise, allowing us to perform coreference on six million author mentions in under four hours on a single CPU.

11 acl-2012-A Feature-Rich Constituent Context Model for Grammar Induction

Author: Dave Golland ; John DeNero ; Jakob Uszkoreit

Abstract: We present LLCCM, a log-linear variant ofthe constituent context model (CCM) of grammar induction. LLCCM retains the simplicity of the original CCM but extends robustly to long sentences. On sentences of up to length 40, LLCCM outperforms CCM by 13.9% bracketing F1 and outperforms a right-branching baseline in regimes where CCM does not.

12 acl-2012-A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction

Author: Seokhwan Kim ; Gary Geunbae Lee

Abstract: Although researchers have conducted extensive studies on relation extraction in the last decade, supervised approaches are still limited because they require large amounts of training data to achieve high performances. To build a relation extractor without significant annotation effort, we can exploit cross-lingual annotation projection, which leverages parallel corpora as external resources for supervision. This paper proposes a novel graph-based projection approach and demonstrates the merits of it by using a Korean relation extraction system based on projected dataset from an English-Korean parallel corpus.

13 acl-2012-A Graphical Interface for MT Evaluation and Error Analysis

Author: Meritxell Gonzalez ; Jesus Gimenez ; Lluis Marquez

Abstract: Error analysis in machine translation is a necessary step in order to investigate the strengths and weaknesses of the MT systems under development and allow fair comparisons among them. This work presents an application that shows how a set of heterogeneous automatic metrics can be used to evaluate a test bed of automatic translations. To do so, we have set up an online graphical interface for the ASIYA toolkit, a rich repository of evaluation measures working at different linguistic levels. The current implementation of the interface shows constituency and dependency trees as well as shallow syntactic and semantic annotations, and word alignments. The intelligent visualization of the linguistic structures used by the metrics, as well as a set of navigational functionalities, may lead towards advanced methods for automatic error analysis.

14 acl-2012-A Joint Model for Discovery of Aspects in Utterances

Author: Asli Celikyilmaz ; Dilek Hakkani-Tur

Abstract: We describe a joint model for understanding user actions in natural language utterances. Our multi-layer generative approach uses both labeled and unlabeled utterances to jointly learn aspects regarding utterance’s target domain (e.g. movies), intention (e.g., finding a movie) along with other semantic units (e.g., movie name). We inject information extracted from unstructured web search query logs as prior information to enhance the generative process of the natural language utterance understanding model. Using utterances from five domains, our approach shows up to 4.5% improvement on domain and dialog act performance over cascaded approach in which each semantic component is learned sequentially and a supervised joint learning model (which requires fully labeled data).

15 acl-2012-A Meta Learning Approach to Grammatical Error Correction

Author: Hongsuck Seo ; Jonghoon Lee ; Seokhwan Kim ; Kyusong Lee ; Sechun Kang ; Gary Geunbae Lee

Abstract: We introduce a novel method for grammatical error correction with a number of small corpora. To make the best use of several corpora with different characteristics, we employ a meta-learning with several base classifiers trained on different corpora. This research focuses on a grammatical error correction task for article errors. A series of experiments is presented to show the effectiveness of the proposed approach on two different grammatical error tagged corpora. 1.

16 acl-2012-A Nonparametric Bayesian Approach to Acoustic Model Discovery

Author: Chia-ying Lee ; James Glass

Abstract: We investigate the problem of acoustic modeling in which prior language-specific knowledge and transcribed data are unavailable. We present an unsupervised model that simultaneously segments the speech, discovers a proper set of sub-word units (e.g., phones) and learns a Hidden Markov Model (HMM) for each induced acoustic unit. Our approach is formulated as a Dirichlet process mixture model in which each mixture is an HMM that represents a sub-word unit. We apply our model to the TIMIT corpus, and the results demonstrate that our model discovers sub-word units that are highly correlated with English phones and also produces better segmentation than the state-of-the-art unsupervised baseline. We test the quality of the learned acoustic models on a spoken term detection task. Compared to the baselines, our model improves the relative precision of top hits by at least 22.1% and outper- forms a language-mismatched acoustic model.

17 acl-2012-A Novel Burst-based Text Representation Model for Scalable Event Detection

Author: Xin Zhao ; Rishan Chen ; Kai Fan ; Hongfei Yan ; Xiaoming Li

Abstract: Mining retrospective events from text streams has been an important research topic. Classic text representation model (i.e., vector space model) cannot model temporal aspects of documents. To address it, we proposed a novel burst-based text representation model, denoted as BurstVSM. BurstVSM corresponds dimensions to bursty features instead of terms, which can capture semantic and temporal information. Meanwhile, it significantly reduces the number of non-zero entries in the representation. We test it via scalable event detection, and experiments in a 10-year news archive show that our methods are both effective and efficient.

18 acl-2012-A Probabilistic Model for Canonicalizing Named Entity Mentions

Author: Dani Yogatama ; Yanchuan Sim ; Noah A. Smith

Abstract: We present a statistical model for canonicalizing named entity mentions into a table whose rows represent entities and whose columns are attributes (or parts of attributes). The model is novel in that it incorporates entity context, surface features, firstorder dependencies among attribute-parts, and a notion of noise. Transductive learning from a few seeds and a collection of mention tokens combines Bayesian inference and conditional estimation. We evaluate our model and its components on two datasets collected from political blogs and sports news, finding that it outperforms a simple agglomerative clustering approach and previous work.

19 acl-2012-A Ranking-based Approach to Word Reordering for Statistical Machine Translation

Author: Nan Yang ; Mu Li ; Dongdong Zhang ; Nenghai Yu

Abstract: Long distance word reordering is a major challenge in statistical machine translation research. Previous work has shown using source syntactic trees is an effective way to tackle this problem between two languages with substantial word order difference. In this work, we further extend this line of exploration and propose a novel but simple approach, which utilizes a ranking model based on word order precedence in the target language to reposition nodes in the syntactic parse tree of a source sentence. The ranking model is automatically derived from word aligned parallel data with a syntactic parser for source language based on both lexical and syntactical features. We evaluated our approach on largescale Japanese-English and English-Japanese machine translation tasks, and show that it can significantly outperform the baseline phrase- based SMT system.

20 acl-2012-A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining

Author: Hassan Sajjad ; Alexander Fraser ; Helmut Schmid

Abstract: We propose a novel model to automatically extract transliteration pairs from parallel corpora. Our model is efficient, language pair independent and mines transliteration pairs in a consistent fashion in both unsupervised and semi-supervised settings. We model transliteration mining as an interpolation of transliteration and non-transliteration sub-models. We evaluate on NEWS 2010 shared task data and on parallel corpora with competitive results.

21 acl-2012-A System for Real-time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle

22 acl-2012-A Topic Similarity Model for Hierarchical Phrase-based Translation

23 acl-2012-A Two-step Approach to Sentence Compression of Spoken Utterances

24 acl-2012-A Web-based Evaluation Framework for Spatial Instruction-Giving Systems

25 acl-2012-An Exploration of Forest-to-String Translation: Does Translation Help or Hurt Parsing?

26 acl-2012-Applications of GPC Rules and Character Structures in Games for Learning Chinese Characters

27 acl-2012-Arabic Retrieval Revisited: Morphological Hole Filling

28 acl-2012-Aspect Extraction through Semi-Supervised Modeling

29 acl-2012-Assessing the Effect of Inconsistent Assessors on Summarization Evaluation

30 acl-2012-Attacking Parsing Bottlenecks with Unlabeled Data and Relevant Factorizations

31 acl-2012-Authorship Attribution with Author-aware Topic Models

32 acl-2012-Automated Essay Scoring Based on Finite State Transducer: towards ASR Transcription of Oral English Speech

33 acl-2012-Automatic Event Extraction with Structured Preference Modeling

34 acl-2012-Automatically Learning Measures of Child Language Development

35 acl-2012-Automatically Mining Question Reformulation Patterns from Search Log Data

36 acl-2012-BIUTEE: A Modular Open-Source System for Recognizing Textual Entailment

37 acl-2012-Baselines and Bigrams: Simple, Good Sentiment and Topic Classification

38 acl-2012-Bayesian Symbol-Refined Tree Substitution Grammars for Syntactic Parsing

39 acl-2012-Beefmoves: Dissemination, Diversity, and Dynamics of English Borrowings in a German Hip Hop Forum

40 acl-2012-Big Data versus the Crowd: Looking for Relationships in All the Right Places

41 acl-2012-Bootstrapping a Unified Model of Lexical and Phonetic Acquisition

42 acl-2012-Bootstrapping via Graph Propagation

43 acl-2012-Building Trainable Taggers in a Web-based, UIMA-Supported NLP Workbench

44 acl-2012-CSNIPER - Annotation-by-query for Non-canonical Constructions in Large Corpora

45 acl-2012-Capturing Paradigmatic and Syntagmatic Lexical Relations: Towards Accurate Chinese Part-of-Speech Tagging

46 acl-2012-Character-Level Machine Translation Evaluation for Languages with Ambiguous Word Boundaries

47 acl-2012-Chinese Comma Disambiguation for Discourse Analysis

48 acl-2012-Classifying French Verbs Using French and English Lexical Resources

49 acl-2012-Coarse Lexical Semantic Annotation with Supersenses: An Arabic Case Study

50 acl-2012-Collective Classification for Fine-grained Information Status

51 acl-2012-Collective Generation of Natural Image Descriptions

52 acl-2012-Combining Coherence Models and Machine Translation Evaluation Metrics for Summarization Evaluation

53 acl-2012-Combining Textual Entailment and Argumentation Theory for Supporting Online Debates Interactions

54 acl-2012-Combining Word-Level and Character-Level Models for Machine Translation Between Closely-Related Languages

55 acl-2012-Community Answer Summarization for Multi-Sentence Question with Group L1 Regularization

56 acl-2012-Computational Approaches to Sentence Completion

57 acl-2012-Concept-to-text Generation via Discriminative Reranking

58 acl-2012-Coreference Semantics from Web Features

59 acl-2012-Corpus-based Interpretation of Instructions in Virtual Environments

60 acl-2012-Coupling Label Propagation and Constraints for Temporal Fact Extraction

61 acl-2012-Cross-Domain Co-Extraction of Sentiment and Topic Lexicons

62 acl-2012-Cross-Lingual Mixture Model for Sentiment Classification

63 acl-2012-Cross-lingual Parse Disambiguation based on Semantic Correspondence

64 acl-2012-Crosslingual Induction of Semantic Roles

65 acl-2012-Crowdsourcing Inference-Rule Evaluation

66 acl-2012-DOMCAT: A Bilingual Concordancer for Domain-Specific Computer Assisted Translation

67 acl-2012-Deciphering Foreign Language by Combining Language Models and Context Vectors

68 acl-2012-Decoding Running Key Ciphers

69 acl-2012-Deep Learning for NLP (without Magic)

70 acl-2012-Demonstration of IlluMe: Creating Ambient According to Instant Message Logs

71 acl-2012-Dependency Hashing for n-best CCG Parsing

72 acl-2012-Detecting Semantic Equivalence and Information Disparity in Cross-lingual Documents

73 acl-2012-Discriminative Learning for Joint Template Filling

74 acl-2012-Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach

75 acl-2012-Discriminative Strategies to Integrate Multiword Expression Recognition and Parsing

76 acl-2012-Distributional Semantics in Technicolor

77 acl-2012-Ecological Evaluation of Persuasive Messages Using Google AdWords

78 acl-2012-Efficient Search for Transformation-based Inference

79 acl-2012-Efficient Tree-Based Topic Modeling

80 acl-2012-Efficient Tree-based Approximation for Entailment Graph Learning

81 acl-2012-Enhancing Statistical Machine Translation with Character Alignment

82 acl-2012-Entailment-based Text Exploration with Application to the Health-care Domain

83 acl-2012-Error Mining on Dependency Trees

84 acl-2012-Estimating Compact Yet Rich Tree Insertion Grammars

85 acl-2012-Event Linking: Grounding Event Reference in a News Archive

86 acl-2012-Exploiting Latent Information to Predict Diffusions of Novel Topics on Social Networks

87 acl-2012-Exploiting Multiple Treebanks for Parsing with Quasi-synchronous Grammars

88 acl-2012-Exploiting Social Information in Grounded Language Learning via Grammatical Reduction

89 acl-2012-Exploring Deterministic Constraints: from a Constrained English POS Tagger to an Efficient ILP Solution to Chinese Word Segmentation

90 acl-2012-Extracting Narrative Timelines as Temporal Dependency Structures

91 acl-2012-Extracting and modeling durations for habits and events from Twitter

92 acl-2012-FLOW: A First-Language-Oriented Writing Assistant System

93 acl-2012-Fast Online Lexicon Learning for Grounded Language Acquisition

94 acl-2012-Fast Online Training with Frequency-Adaptive Learning Rates for Chinese Word Segmentation and New Word Detection

95 acl-2012-Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining

96 acl-2012-Fast and Robust Part-of-Speech Tagging Using Dynamic Model Selection

97 acl-2012-Fast and Scalable Decoding with Language Model Look-Ahead for Phrase-based Statistical Machine Translation

98 acl-2012-Finding Bursty Topics from Microblogs

99 acl-2012-Finding Salient Dates for Building Thematic Timelines

100 acl-2012-Fine Granular Aspect Analysis using Latent Structural Models

101 acl-2012-Fully Abstractive Approach to Guided Summarization

102 acl-2012-Genre Independent Subgroup Detection in Online Discussion Threads: A Study of Implicit Attitude using Textual Latent Semantics

103 acl-2012-Grammar Error Correction Using Pseudo-Error Sentences and Domain Adaptation

104 acl-2012-Graph-based Semi-Supervised Learning Algorithms for NLP

105 acl-2012-Head-Driven Hierarchical Phrase-based Translation

106 acl-2012-Head-driven Transition-based Parsing with Top-down Prediction

107 acl-2012-Heuristic Cube Pruning in Linear Time

108 acl-2012-Hierarchical Chunk-to-String Translation

109 acl-2012-Higher-order Constituent Parsing and Parser Combination

110 acl-2012-Historical Analysis of Legal Opinions with a Sparse Mixed-Effects Latent Variable Model

111 acl-2012-How Are Spelling Errors Generated and Corrected? A Study of Corrected and Uncorrected Spelling Errors Using Keystroke Logs

112 acl-2012-Humor as Circuits in Semantic Networks

113 acl-2012-INPROwidth.3emiSS: A Component for Just-In-Time Incremental Speech Synthesis

114 acl-2012-IRIS: a Chat-oriented Dialogue System based on the Vector Space Model

115 acl-2012-Identifying High-Impact Sub-Structures for Convolution Kernels in Document-level Sentiment Classification

116 acl-2012-Improve SMT Quality with Automatically Extracted Paraphrase Rules

117 acl-2012-Improving Word Representations via Global Context and Multiple Word Prototypes

118 acl-2012-Improving the IBM Alignment Models Using Variational Bayes

119 acl-2012-Incremental Joint Approach to Word Segmentation, POS Tagging, and Dependency Parsing in Chinese

120 acl-2012-Information-theoretic Multi-view Domain Adaptation

121 acl-2012-Iterative Viterbi A* Algorithm for K-Best Sequential Decoding

122 acl-2012-Joint Evaluation of Morphological Segmentation and Syntactic Parsing

123 acl-2012-Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT

124 acl-2012-Joint Inference of Named Entity Recognition and Normalization for Tweets

125 acl-2012-Joint Learning of a Dual SMT System for Paraphrase Generation

126 acl-2012-Labeling Documents with Timestamps: Learning from their Time Expressions

127 acl-2012-Large-Scale Syntactic Language Modeling with Treelets

128 acl-2012-Learning Better Rule Extraction with Translation Span Alignment

129 acl-2012-Learning High-Level Planning from Text

130 acl-2012-Learning Syntactic Verb Frames using Graphical Models

131 acl-2012-Learning Translation Consensus with Structured Label Propagation

132 acl-2012-Learning the Latent Semantics of a Concept from its Definition

133 acl-2012-Learning to "Read Between the Lines" using Bayesian Logic Programs

134 acl-2012-Learning to Find Translations and Transliterations on the Web

135 acl-2012-Learning to Temporally Order Medical Events in Clinical Text

136 acl-2012-Learning to Translate with Multiple Objectives

137 acl-2012-Lemmatisation as a Tagging Task

138 acl-2012-LetsMT!: Cloud-Based Platform for Do-It-Yourself Machine Translation

139 acl-2012-MIX Is Not a Tree-Adjoining Language

140 acl-2012-Machine Translation without Words through Substring Alignment

141 acl-2012-Maximum Expected BLEU Training of Phrase and Lexicon Translation Models

142 acl-2012-Mining Entity Types from Query Logs via User Intent Modeling

143 acl-2012-Mixing Multiple Translation Models in Statistical Machine Translation

144 acl-2012-Modeling Review Comments

145 acl-2012-Modeling Sentences in the Latent Space

146 acl-2012-Modeling Topic Dependencies in Hierarchical Text Categorization

147 acl-2012-Modeling the Translation of Predicate-Argument Structure for SMT

148 acl-2012-Modified Distortion Matrices for Phrase-Based Statistical Machine Translation

149 acl-2012-Movie-DiC: a Movie Dialogue Corpus for Research and Development

150 acl-2012-Multilingual Named Entity Recognition using Parallel Data and Metadata from Wikipedia

151 acl-2012-Multilingual Subjectivity and Sentiment Analysis

152 acl-2012-Multilingual WSD with Just a Few Lines of Code: the BabelNet API

153 acl-2012-Named Entity Disambiguation in Streaming Data

154 acl-2012-Native Language Detection with Tree Substitution Grammars

155 acl-2012-NiuTrans: An Open Source Toolkit for Phrase-based and Syntax-based Machine Translation

156 acl-2012-Online Plagiarized Detection Through Exploiting Lexical, Syntax, and Semantic Information

157 acl-2012-PDTB-style Discourse Annotation of Chinese Text

158 acl-2012-PORT: a Precision-Order-Recall MT Evaluation Metric for Tuning

159 acl-2012-Pattern Learning for Relation Extraction with a Hierarchical Topic Model

160 acl-2012-Personalized Normalization for a Multilingual Chat System

161 acl-2012-Polarity Consistency Checking for Sentiment Dictionaries

162 acl-2012-Post-ordering by Parsing for Japanese-English Statistical Machine Translation

163 acl-2012-Prediction of Learning Curves in Machine Translation

164 acl-2012-Private Access to Phrase Tables for Statistical Machine Translation

165 acl-2012-Probabilistic Integration of Partial Lexical Information for Noise Robust Haptic Voice Recognition

166 acl-2012-Qualitative Modeling of Spatial Prepositions and Motion Expressions

167 acl-2012-QuickView: NLP-based Tweet Search

168 acl-2012-Reducing Approximation and Estimation Errors for Chinese Lexical Processing with Heterogeneous Annotations

169 acl-2012-Reducing Wrong Labels in Distant Supervision for Relation Extraction

170 acl-2012-Robust Conversion of CCG Derivations to Phrase Structure Trees

171 acl-2012-SITS: A Hierarchical Nonparametric Model using Speaker Identity for Topic Segmentation in Multiparty Conversations

172 acl-2012-Selective Sharing for Multilingual Dependency Parsing

173 acl-2012-Self-Disclosure and Relationship Strength in Twitter Conversations

174 acl-2012-Semantic Parsing with Bayesian Tree Transducers

175 acl-2012-Semi-supervised Dependency Parsing using Lexical Affinities

176 acl-2012-Sentence Compression with Semantic Role Constraints

177 acl-2012-Sentence Dependency Tagging in Online Question Answering Forums

178 acl-2012-Sentence Simplification by Monolingual Machine Translation

179 acl-2012-Smaller Alignment Models for Better Translations: Unsupervised Word Alignment with the l0-norm

180 acl-2012-Social Event Radar: A Bilingual Context Mining and Sentiment Analysis Summarization System

181 acl-2012-Spectral Learning of Latent-Variable PCFGs

182 acl-2012-Spice it up? Mining Refinements to Online Instructions from User Generated Content

183 acl-2012-State-of-the-Art Kernels for Natural Language Processing

184 acl-2012-String Re-writing Kernel

185 acl-2012-Strong Lexicalization of Tree Adjoining Grammars

186 acl-2012-Structuring E-Commerce Inventory

187 acl-2012-Subgroup Detection in Ideological Discussions

188 acl-2012-Subgroup Detector: A System for Detecting Subgroups in Online Discussions

189 acl-2012-Syntactic Annotations for the Google Books NGram Corpus

190 acl-2012-Syntactic Stylometry for Deception Detection

191 acl-2012-Temporally Anchored Relation Extraction

192 acl-2012-Tense and Aspect Error Correction for ESL Learners Using Global Context

193 acl-2012-Text-level Discourse Parsing with Rich Linguistic Features

194 acl-2012-Text Segmentation by Language Using Minimum Description Length

195 acl-2012-The Creation of a Corpus of English Metalanguage

196 acl-2012-The OpenGrm open-source finite-state grammar software libraries

197 acl-2012-Tokenization: Returning to a Long Solved Problem A Survey, Contrastive Experiment, Recommendations, and Toolkit

198 acl-2012-Topic Models, Latent Space Models, Sparse Coding, and All That: A Systematic Understanding of Probabilistic Semantic Extraction in Large Corpus

199 acl-2012-Topic Models for Dynamic Translation Model Adaptation

200 acl-2012-Toward Automatically Assembling Hittite-Language Cuneiform Tablet Fragments into Larger Texts

201 acl-2012-Towards the Unsupervised Acquisition of Discourse Relations

202 acl-2012-Transforming Standard Arabic to Colloquial Arabic

203 acl-2012-Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information

204 acl-2012-Translation Model Size Reduction for Hierarchical Phrase-based Statistical Machine Translation

205 acl-2012-Tweet Recommendation with Graph Co-Ranking

206 acl-2012-UWN: A Large Multilingual Lexical Knowledge Base

207 acl-2012-Unsupervised Morphology Rivals Supervised Morphology for Arabic MT

208 acl-2012-Unsupervised Relation Discovery with Sense Disambiguation

209 acl-2012-Unsupervised Semantic Role Induction with Global Role Ordering

210 acl-2012-Unsupervized Word Segmentation: the Case for Mandarin Chinese

211 acl-2012-Using Rejuvenation to Improve Particle Filtering for Bayesian Word Segmentation

212 acl-2012-Using Search-Logs to Improve Query Tagging

213 acl-2012-Utilizing Dependency Language Models for Graph-based Dependency Parsing Models

214 acl-2012-Verb Classification using Distributional Similarity in Syntactic and Semantic Structures

215 acl-2012-WizIE: A Best Practices Guided Development Environment for Information Extraction

216 acl-2012-Word Epoch Disambiguation: Finding How Words Change Over Time

217 acl-2012-Word Sense Disambiguation Improves Information Retrieval

218 acl-2012-You Had Me at Hello: How Phrasing Affects Memorability

219 acl-2012-langid.py: An Off-the-shelf Language Identification Tool