Computer Engineering & Information Technology , Amir Kabir University of Technology
This paper presents a new method for analyzing words in the Persian language context to find orthographical and structural errors regardless of the meaning. This technique tokenizes each word in a statement then tries to detect the kind of word, and analyses its correctness in terms of orthography and morphology by means of a lexicon. It should be noted that some words in the Persian language have the same stem, which are constructed by adding particles to them according to certain rules. For these words the researchers present a new method to reduce the size/volume of the lexicon and to quicken in searching.