|
FUZZY ART MAP ALGORITHM
FOR DATA MINING IN BIO INFORMATICS
Bio informatics is the application of information technology to the
management of bio logical data. It is interdisciplinary area of science
where mathematics , statistics,and computer science are applied to data
produced by experimental work in bio chemistry,cell,biology and genetics.
The need for merging of the biological sciences with the world of It and
computer science has mainly arisen due to the huge amount of information
being produced from the study of genetic material .
Mapping is the process of splitting each chromosome into smaller
fragments,which could be propagated and characterized and placed back in
correct order on each chromosome. Sequencing is a process of determination
of the order of the nucleotides(base sequences) in DNA or RNA molecule,the
order of amino acids protein. Genomics refers to the number of genes ,the
function of genes the location and regulation of genes.
In data acquisition first the DNA sample of the person has to be
retrieved. A DNA sample can be obtained from any tissue, including blood.
then the given DNA sample is ionized using ESIElectrospray ionization (ESI)
allows production of molecular ions directly from samples in solution.
It can be used for small and large molecular-weight biopolymers (peptides,
proteins, carbohydrates, and DNA fragments), and lipids. I is a continuous
ionization method that is suitable for using as an interface with HPLC or
capillary electrophoresis. Multiply charged ions are usually produced. ESI
should be considered a complement to MALDI.
The sample must be soluble, stable in solution, polar, and relatively
clean (free of nonvolatile buffers, detergents, salts, etc.).
Laboratories preprocess the genetic data obtain from the ionization method
using a series of modular programs .These are most commonly written in
Perl,however other languages that are often used include Python,XML and
JAVA.These preprocessing basically involves the organizing of sequence of
data and the checking of data integrity ,after these processes are carried
out the data can be imported into a data base.Sequence data comes in the
form of base strings
|