CFP last date
20 January 2025
Reseach Article

A Web based Database for Hypothetical Genes in the Human Genome

by Sivashankari Selvarajan, Piramanayagam Shanmughavel
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 14 - Number 4
Year of Publication: 2011
Authors: Sivashankari Selvarajan, Piramanayagam Shanmughavel
10.5120/1834-2460

Sivashankari Selvarajan, Piramanayagam Shanmughavel . A Web based Database for Hypothetical Genes in the Human Genome. International Journal of Computer Applications. 14, 4 ( January 2011), 10-13. DOI=10.5120/1834-2460

@article{ 10.5120/1834-2460,
author = { Sivashankari Selvarajan, Piramanayagam Shanmughavel },
title = { A Web based Database for Hypothetical Genes in the Human Genome },
journal = { International Journal of Computer Applications },
issue_date = { January 2011 },
volume = { 14 },
number = { 4 },
month = { January },
year = { 2011 },
issn = { 0975-8887 },
pages = { 10-13 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume14/number4/1834-2460/ },
doi = { 10.5120/1834-2460 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:02:30.445852+05:30
%A Sivashankari Selvarajan
%A Piramanayagam Shanmughavel
%T A Web based Database for Hypothetical Genes in the Human Genome
%J International Journal of Computer Applications
%@ 0975-8887
%V 14
%N 4
%P 10-13
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Due to accumulation of genomic data, the function of a vast amount of genes and the proteins encoded by them are unknown. Unless, the function of proteome encoded by the entire genome is not known, the biochemical processes and their importance cannot be understood. Also, the computational annotation returns a gene without any homolog in the protein database it encodes it as ‘hypothetical’. Due to advancements in annotation projects, many genes whose evidence for expression invivo is not known and due to lack of similar protein could not be assigned function. This pose a challenge to functional genomics and automatic annotation of hypothetical genes are done at a faster rate using developed annotation tools to know the function of the hypothetical genes. Moreover, when the hypothetical genes are present in human, it is really a lacuna and hence functional annotation of the hypothetical genes in the human genome is the need of the hour. Hence, this work attempts to annotate the hypothetical genes in Human and makes the results publicly accessible using a web based database using PHP and MySQL.

References
  1. J. C. Venter et al., Science 291, 1304 (2001).
  2. Zarembinski, T. I., Hung, L.-W., Mueller-Dieckmann, H.-J., Kim, K.-K., Yokota, H ., Kim, R. & Kim, S.-H. (1998). Proc. Natl Acad. Sci. USA, 95, 15189-15193
  3. Todd AE, Orengo CA, Thornton JM, Evolution of function in protein superfamilies, from a structural perspective. J Mol Biol. 2001 Apr 6;307(4):1113-43.
  4. Gough, J., Karplus, K., Hughey, R. and Chothia, C. (2001). "Assignment of Homology to Genome Sequences using a Library of Hidden Markov Models that Represent all Proteins of Known Structure." J. Mol. Biol., 313(4), 903-919.
  5. Roman L Tatusov, Natalie D Fedorova, John D Jackson, Aviva R Jacobs, Boris Kiryutin, Eugene V Koonin, Dmitri M Krylov, Raja Mazumder, Sergei L Mekhedov, Anastasia N Nikolskaya, B Sridhar Rao, Sergei Smirnov, Alexander V Sverdlov, Sona Vasudevan, Yuri I Wolf, Jodie J Yin, and Darren A Natale , The COG database: an updated version includes eukaryotes, BMC Bioinformatics. 2003;4: 41.
  6. Krogh A, Larsson B, von Heijne G, Sonnhammer EL, Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes, J Mol Biol. 2001 Jan 19;305(3):567-80.
  7. Wu CH, Apweiler R, Bairoch A, Natale DA, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, et al. The Universal Protein Resource (UniProt): an expanding universe of protein information. Nucleic Acids Res. 2006;34:D187–D191.
  8. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997; 25:3389–3402.
  9. Gough, J., Karplus, K., Hughey, R. and Chothia, C. (2001). "Assignment of Homology to Genome Sequences using a Library of Hidden Markov Models that Represent all Proteins of Known Structure." J. Mol. Biol., 313(4), 903-919.
  10. Antonina Andreeva1,Dave Howorth1, John-Marc Chandonia, Steven E. Brenner, Tim J. P. Hubbard, Cyrus Chothia5 and Alexey G. Murzin, Data growth and its impact on the SCOP database: new developments Nucleic Acids Research, 2008, Vol. 36, Database issue D419–D425.
  11. Pruitt KD, Tatusova, T, Maglott DR, NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins, Nucleic Acids Res 2007 Jan 1;35(Database issue):D61-5
  12. Stefan Götz, Juan Miguel García-Gómez, Javier Terol, Tim D. Williams, María José Nueda, Montserrat Robles, Manuel Talón, Joaquín Dopazo and Ana Conesa, High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res. 2008 June; 36(10): 3420–3435.
Index Terms

Computer Science
Information Sciences

Keywords

Hypothetical database human hypothetical genes