Study On Clustering Techniques And Application To Microarray Gene Expression Bioinformatics Data

Kumar , Dhiraj (2009) Study On Clustering Techniques And Application To Microarray Gene Expression Bioinformatics Data. MTech by Research thesis.

[img]
Preview
PDF
1006Kb

Abstract

With the explosive growth of the amount of publicly available genomic data, a new field of computer science i.e., bioinformatics has been emerged, focusing on the use of computing systems for efficiently deriving, storing, and analyzing the character strings of genome to help to solve problems in molecular biology. The flood of data from biology, mainly in the form of DNA, RNA and Protein sequences, puts heavy demand on computers and computational scientists. At the same time, it demands a transformation of basic ethos of biological sciences. Hence, Data mining techniques can be used efficiently to explore hidden pattern underlying in biological data. Un-supervised classification, also known as Clustering; which is one of the branch of Data Mining can be applied to biological data and this can result in a better era of rapid medical development and drug discovery. In the past decade, the advent of efficient genome sequencing tools have led to enormous progress in life sciences. Among the most important innovations, microarray technology allows to quantify the expression for thousand of genes simultaneously. The characteristic of these data which makes it different from machine-learning/pattern recognition data includes, a fair amount of random noise, missing values, a dimension in the range of thousands, and a sample size in few dozens. A particular application of the microarray technology is in the area of cancer research, where the goal is for precise and early detection of tumorous cells with high accuracy. The challenge for a biologist and computer scientist is to provide solution based on terms of automation, quality and efficiency.

Item Type:Thesis (MTech by Research)
Uncontrolled Keywords:Brute Force, Fuzzy C-Means, Genetic Algorithm, Simulated Annealing
Subjects:Engineering and Technology > Computer and Information Science > Data Mining
Divisions: Engineering and Technology > Department of Computer Science
ID Code:2798
Deposited By:Hemanta Biswal
Deposited On:01 Aug 2011 09:56
Last Modified:01 Aug 2011 09:56
Supervisor(s):Rath, S K

Repository Staff Only: item control page