THE PECULIARITIES OF THE TEXT DOCUMENT REPRESENTATION, USING ONTOLOGY AND TAGGING-BASED CLUSTERING TECHNIQUE

Authors

  • Marijus Bernotas Šiauliai University
  • Kazys Karklius Šiauliai University
  • Remigijus Laurutis Šiauliai University
  • Asta Slotkienė Šiauliai University

Abstract

Text documents are very significant in the contemporary organizations, moreover their constant accumulation enlarges the scope of document storage. Standard text mining and information retrieval techniques of text document usually rely on word matching. An alternative way of information retrieval is clustering. In this paper we suggest to complement the traditional clustering method by document repre-sentation based on tagging, and to improve clustering results by using knowledge technology – ontology. The proposed method solves locally applied language incompact usage in the process of document clus-tering.

Downloads

Published

2007-07-04

Issue

Section

Articles