Abstract:
In this paper, there is describing a similarity-based retrieval framework that addresses the challenges associated with the relational database text documents. This system proposed to automatically classify documents based on the meanings of words and the relationships between groups of meanings or concepts. There may be found similar documents based on a set of common keywords and retrieved these documents based on the degree of relevance which is measured on the relative frequency of the keywords. So, this system uses measure similarity between new and old thesis title description to detect duplicate system in case study.