Other Journals Published by Timeline Publication Pvt. Ltd.
Preprocessing and Morphological Analysis in Text Mining
-
Krishna Kumar Mohbey; Sachin Tiwari
- This paper is based on the preprocessing
activities which is performed by the software or language
translators before applying mining algorithms on the huge
data. Text mining is an important area of Data mining and it
plays a vital role for extracting useful information from the
huge database or data ware house. But before applying the
text mining or information extraction process, preprocessing
is must because the given data or dataset have the noisy,
incomplete, inconsistent, dirty and unformatted data. In this
paper we try to collect the necessary requirements for
preprocessing. When we complete the preprocess task then
we can easily extract the knowledgful information using
mining strategy. This paper also provides the information
about the analysis of data like tokenization, stemming and
semantic analysis like phrase recognition and parsing. This
paper also collect the procedures for preprocessing data i.e.
it describe that how the stemming, tokenization or parsing
are applied.
- Select Volume / Issues:
- Year:
- 2011
- Type of Publication:
- Article
- Keywords:
- Morphological analysis; parsing; stemming; Tokenization
- Journal:
- IJECCE
- Volume:
- 2
- Number:
- 2
- Pages:
- 116-122
- Month:
- October
Hits: 8387