Record Details

Text similarity measurement using concept representation of texts

DSpace at IIT Bombay

View Archive Info
 
 
Field Value
 
Title Text similarity measurement using concept representation of texts
 
Creator PANDYA, A
BHATTACHARYYA, P
 
Description Measuring semantic nearness of documents is important for accurate information retrieval, automated text categorization and classification. Inspired by the observation that text documents contain semantically coherent set of ideas/topics, this paper presents the design and experimental evaluation of a method to represent a text document as a set of concepts. Based on this, we propose a method to measure semantic nearness of texts. Our method makes use of WordNet which is a lexico-semantic network of words. We bypass word sense disambiguation. In order to show the effectiveness of our representation of texts, we compare experimental results of text classification and clustering with the results of classification and clustering with standard techniques.
 
Publisher SPRINGER-VERLAG BERLIN
 
Date 2011-10-23T18:17:07Z
2011-12-15T09:11:17Z
2011-10-23T18:17:07Z
2011-12-15T09:11:17Z
2005
 
Type Article; Proceedings Paper
 
Identifier PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS,3776,678-683
3-540-30506-8
0302-9743
http://dspace.library.iitb.ac.in/xmlui/handle/10054/15199
http://hdl.handle.net/100/1967
 
Source 1st International Conference on Pattern Recognition and Machine Intelligence,Calcutta, INDIA,DEC 20-22, 2005
 
Language English