acl acl2012 acl2012-186 acl2012-186-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Karin Mauge ; Khash Rohanimanesh ; Jean-David Ruvini
Abstract: Large e-commerce enterprises feature millions of items entered daily by a large variety of sellers. While some sellers provide rich, structured descriptions of their items, a vast majority of them provide unstructured natural language descriptions. In the paper we present a 2 steps method for structuring items into descriptive properties. The first step consists in unsupervised property discovery and extraction. The second step involves supervised property synonym discovery using a maximum entropy based clustering algorithm. We evaluate our method on a year worth of ecommerce data and show that it achieves excellent precision with good recall.