Approximation of Missing Values in DNA Microarray Gene Expression Data

Reet Kamal; Sukhwinder Bir; Amanjot Kaur

Call for Paper

April Edition

IJCA solicits high quality original research papers for the upcoming April edition of the journal. The last date of research paper submission is 20 March 2026

Submit your paper

Know more

The week's pick

Explainable Hybrid Deep Learning for Automated Diagnosis of Canine Mammary Tumors

Elham Shawky Salama Heba Askr Ashraf Darwish Aboul Ella Hassanien

Random Articles

Reseach Article

Approximation of Missing Values in DNA Microarray Gene Expression Data

by Reet Kamal, Sukhwinder Bir, Amanjot Kaur

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 4 - Number 3

Year of Publication: 2010

Authors: Reet Kamal, Sukhwinder Bir, Amanjot Kaur

10.5120/810-1150

Reet Kamal, Sukhwinder Bir, Amanjot Kaur . Approximation of Missing Values in DNA Microarray Gene Expression Data. International Journal of Computer Applications. 4, 3 ( July 2010), 20-25. DOI=10.5120/810-1150

@article{ 10.5120/810-1150,

author = { Reet Kamal, Sukhwinder Bir, Amanjot Kaur },

title = { Approximation of Missing Values in DNA Microarray Gene Expression Data },

journal = { International Journal of Computer Applications },

issue_date = { July 2010 },

volume = { 4 },

number = { 3 },

month = { July },

year = { 2010 },

issn = { 0975-8887 },

pages = { 20-25 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume4/number3/810-1150/ },

doi = { 10.5120/810-1150 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T19:52:05.076889+05:30

%A Reet Kamal

%A Sukhwinder Bir

%A Amanjot Kaur

%T Approximation of Missing Values in DNA Microarray Gene Expression Data

%J International Journal of Computer Applications

%@ 0975-8887

%V 4

%N 3

%P 20-25

%D 2010

%I Foundation of Computer Science (FCS), NY, USA

Abstract

In the past few years, there has been a detonation of data in the field of biotechnology. Gene expression microarray experiments produce datasets with numerous missing expression values due to various reasons, e.g. insufficient resolution, image corruption, dust or scratches on the slides, or experimental error during the laboratory process.. To improve these missing values, many algorithms for gene expression analysis oblige a complete matrix of gene array values as input, such as K nearest neighbor impute method, Bayesian principal components analysis impute method, etc. Accurate estimation of missing values is an important requirement for efficient data analysis. Main problem of existing methods for microarray data is that there is no external information but the estimation is based exclusively on the expression data. We conjectured that utilizing a priori information on functional similarities available from public databases facilitates the missing value estimation. Robust missing value estimation methods are required since many algorithms for gene expression analysis entail a complete matrix of gene array values. Either genes with missing values can be removed, or the missing values can be replaced using prediction. Current methods for estimating the missing values include sample mean and K-nearest neighbors (KNN). Whether the accuracy of estimation methods depends on the actual gene expression has not been thoroughly investigated. Under this setting, we examine how the accuracy depends on the actual expression level and propose new method that provides improvements in accuracy relative to the current methods in certain ranges of gene expression.

References

Index Terms

Computer Science

Information Sciences

Keywords

Clustering DNA Microarray Fuzzy Logic