DNA SEQUENCE DATABASE CLASSIFICATION AND REDUCTION: ROUGH SETS THEORY APPROACH

A Rahman , M Nordin and M Saman , M Yazid and Ahmad , Aziz and M Tap, A Osman (2007) DNA SEQUENCE DATABASE CLASSIFICATION AND REDUCTION: ROUGH SETS THEORY APPROACH. In: 2nd International Conference on Informatics (Informatics 2007), Hilton Petaling Jaya Hotel, Petaling Jaya, Selangor, Malaysia.

[img] PDF
AIA.pdf

Download (3MB)
Official URL: http://dspace.fsktm.um.edu.my/handle/1812/344

Abstract

Data classification is a vital task in large scale data mining application. DNA sequences are the basis of life and they encode all the necessary information needed to reproduce life. The size of public DNA sequence databases are growing doubling every year. This situation makes automatic classification and reduction of DNA sequences has become important for effective sequence similarity search problem. A challenge in DNA sequence similarity search is that the sequence record structure does not have any attribute that can be used for implementing classification process. In this paper, by means of filtering process an automaton based exact string matching is employed to generate a special attribute used for DNA sequence database classification and reduction. Rough sets theory provides an indiscernibility relation technique which can be used to classify and reduct the database based on some definition of ‘equivalence’. The generated attribute is used for constructing indiscernibility relation among sequences. With computational implementation, the experiments are executed to investigate the effectiveness of rough sets theory on generating DNA sequence database classification and reduction. Moreover, the experiments will demonstrate that the DNA sequence similarity search performance is significantly improved by using this approach.

Item Type: Conference or Workshop Item (Paper)
Subjects: Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science
Divisions: Faculty of Engineering, Science and Mathematics > School of Electronics and Computer Science
Depositing User: MS SITI NUR ATIKAH MOHAMAD RUSDI
Date Deposited: 16 Jul 2013 02:30
Last Modified: 16 Jul 2013 02:30
URI: http://repository.um.edu.my/id/eprint/428

Actions (login required)

View Item View Item