研究生: |
丁白龍 Carlos R. Arias |
---|---|
論文名稱: |
Network Based Disease Gene Prioritization 以網路為基礎之疾病基因排序 |
指導教授: |
蘇豐文
Soo, Von-Wun |
口試委員: |
陳朝欽
Chen, Chaur-Chin 賴尚宏 Lai, Shang-Hong 蔡宗翰 Tsai, Tzong-Han 許聞廉 Hsu, Wen-Lian 洪炯宗 Horng, Jorng-Tzong 蘇豐文 Soo, Von-Wun |
學位類別: |
博士 Doctor |
系所名稱: |
電機資訊學院 - 資訊系統與應用研究所 Institute of Information Systems and Applications |
論文出版年: | 2012 |
畢業學年度: | 100 |
語文別: | 中文 |
論文頁數: | 109 |
中文關鍵詞: | 基因排序 、最短路徑 、微陣列數據 、蛋白質相互作用網絡 、前列腺癌 |
外文關鍵詞: | Gene Prioritization, Shortest Paths, Microarray Data, Protein Interaction Network, Prostate Cancer |
相關次數: | 點閱:3 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
Many biological processes, at any level of organization from cellular to ecosystem, can be modeled as a complex network. In ecosystems, the objects in the network are the organisms involved in the model and the relationships are how the organisms interact with each other. At the cellular level, objects range from genes to metabolites and the relationship represent the interactions between them. In recent years, we have witnessed an explosion of available biological data that started with the Human Genome Project, and then it matured with the birth of a new field of study called systems biology. In this field, available data is integrated and viewed from the systems perspective. A vast amount of data has become publicly available, and a significant amount of it can be modeled using networks. A subfield of systems biology, network biology takes special interest in the biological network models. Network biology helps the biomedical community to unravel the mysteries of life, and also of diseases.
Identification of diseases is a long time research subject, in this thesis we will present a method for disease gene identification using biological networks approach. The approach is based on protein interaction networks and microarray expression data. It integrates techniques like random walk with restarts with filtering purposes, shortest paths analysis for the core of the prioritization, and topological features of the network to help identify key genes. The contribution of this thesis is an integrated method for disease gene prioritization, that was tested using the prostate cancer as domain, obtaining the best performance for the top 50 rank compared to other state of the art methods.
[1] XGMML (eXtensible Graph Markup and Modeling Language), 1.0 edition,
2001. Accesed 26-Nov-2010.
[2] C Abate-Shen and MM Shen. Molecular Genetics of prostate cancer. Genes
& Development, 14(19):2410{2434, OCT 1 2000.
[3] Euan A. Adie, Richard R. Adams, Kathy L. Evans, David J. Porteous, and
Ben S. Pickard. Speeding disease gene discovery by sequence based candidate
prioritization. BMC Bioinformatics, 6(1), March 2005.
[4] Euan A. Adie, Richard R. Adams, Kathy L. Evans, David J. Porteous,
and Ben S. Pickard. Suspects: enabling fast and eective prioritization of
positional candidates. Bioinformatics, 22(6):773{4, 2006.
[5] Stein Aerts, Diether Lambrechts, Sunit Maity, Peter Van Loo, Bert Coessens,
Frederik De Smet, Leon-Charles C. Tranchevent, Bart De Moor,
Peter Marynen, Bassem Hassan, Peter Carmeliet, and Yves Moreau. Gene
prioritization through genomic data fusion. Nature biotechnology, 24(5):537{
544, May 2006.
[6] Alfred V. Aho, John E. Hopcroft, and Jerey D. Ullman. Data Structures
and Algorithms. Addison-Wesley, Massachusetts, USA, 1 edition, 1983.
[7] B. Aranda, P. Achuthan, Y. Alam-Faruque, I. Armean, A. Bridge, C. Derow,
M. Feuermann, A. T. Ghanbarian, S. Kerrien, J. Khadake, J. Kerssemakers,
C. Leroy, M. Menden, M. Michaut, L. Montecchi-Palazzi, S. N. Neuhauser,
S. Orchard, V. Perreau, B. Roechert, K. van Eijk, and H. Hermjakob. The
IntAct molecular interaction database in 2010. Nucleics Acids Research,
38(Suppl. 1):D525{D531, JAN 2010.
[8] Carlos Roberto Arias and Von-Wun Soo. Computing all pairs shortest paths
on graphs with articulation points. Journal of Computer Technology and
Application, 2011.
[9] Carlos Roberto Arias, Hsiang-Yuan Yeh, and Von-Wun Soo. Bioinformatics,
chapter Disease Gene Prioritization. INTECH, Rijeka, Croatia, 2011.
[10] Carlos Roberto Arias, Hsiang-Yuan Yeh, and Von-Wun Soo. Biomarker
identication for prostate cancer and lymph node metastasis from microarray
data and protein interaction network using gene prioritization method. The
Scientic World Journal, 2011. Accepted 2011-12-26.
[11] M. Ashburner, C. A. Ball, J. A. Blake, D. Botstein, H. Butler, J. M. Cherry,
A. P. Davis, K. Dolinski, S. S. Dwight, J. T. Eppig, M. A. Harris, D. P.
Hill, L. Issel-Tarver, A. Kasarskis, S. Lewis, J. C. Matese, J. E. Richardson,
M. Ringwald, G. M. Rubin, and G. Sherlock. Gene ontology: tool for
the unication of biology. the gene ontology consortium. Nature Genetics,
25(1):25{29, May 2000.
[12] Gary D. Bader, Doron Betel, and Christopher W. V. Hogue. BIND:
the Biomolecular Interaction Network Database. Nucleics Acids Research,
31(1):248{250, JAN 1 2003.
[13] Alberto Laszlo Barabasi and Reka Albert. Emergence of scaling in random
networks. Science, 286(5439):509{512, OCT 15 1999.
[14] Vladimir Batagelj and Andrej Mrvar. Pajek. http://pajek.imfm.si/, 2010.
[15] Roble G. Bedolla, Yu Wang, Alfredo Asuncion, Karim Chamie, Salma Siddiqui,
Maria M. Mudryj, Thomas J. Prihoda, Javed Siddiqui, Arul M. Chinnaiyan,
Rohit Mehra, Ralph W. de Vere White, and Paramita M. Ghosh. Nuclear
versus Cytoplasmic Localization of Filamin A in Prostate Cancer: Immunohistochemical
Correlation with Metastases. Clinical Cancer Research,
15(3):788{796, FEB 1 2009.
[16] Mikael Benson and Rainer Breitling. Network theory to understand microarray
studies of complex diseases. Current Molecura Medicine, 6(6):695{701,
SEP 2006.
[17] Arif Bilgin, John Ellson, Emden Gansner, Yifan Hu, and Stephen North.
Graphviz. http://www.graphviz.org, 2010.
[18] Uday Bondhugula, Ananth Devulapalli, Joseph Fernando, Pete Wycko,
and P. Sadayappan. Parallel fpga-based all-pairs shortest-paths in a directed
graph. In 20th International Parallel and Distributed Processing Symposium.
Rhodes Island, Greece, 2006, pages 10 pp.{, Cambridge, MA, USA, April
2006. The MIT Press.
[19] Steve Borgatti, Martin Everett, and Lin Freeman. UCINET. Analytic Technologies,
2010.
[20] Karsten M. Borgwardt, Cheng Soon Ong, Stefan Schonauer, S.V.N. Vishwanathan,
Alex J. Smola, and Hans-Peter Kriegel. Protein function prediction
via graph kernels. Bioinformatics, 21(1):47{56, 2005.
[21] Ulrik Brandes. A faster algorithm for betweenness centrality. Journal of
Mathematical Sociology, 25(2):163{177, 2001.
[22] Ase Bratland, Piet J. Boender, Hanne K. Hoifodt, Ingrid H. G. Ostensen,
Rob Ruijtenbeek, Meng yu Wang, Jens P. Berg, Wolfgang Lilleby, Oystein
Fodstad, and Anne Hansen Ree. Osteoblast-induced EGFR/ERBB2 signaling
in androgen-sensitive prostate carcinoma cells characterized by multiplex
kinase activity proling. Clinical & Experimental Metastasis, 26(5):485{496,
JUN 2009.
[23] Bobby-Joe Breitkreutz, Chris Stark, Teresa Reguly, Lorrie Boucher, Ashton
Breitkreutz, Michael Livstone, Rose Oughtred, Daniel H. Lackner, Jurg
Bahler, ValerieWood, Kara Dolinski, and Mike Tyers. The BioGRID interaction
database: 2008 update. Nucleics Acids Research, 36(Sp. Iss. SI):D637{
D640, JAN 2008.
[24] Lukas Bubendorf, Juha Kononen, Pasi Koivisto, Peter Schraml, Holger
Moch, Thomas C. Gasser, Niels Willi, Michael J. Mihatsch, Guido Sauter,
and Olli-P. Kallioniemi. Survey of gene amplications during prostate cancer
progression by high throughput
uorescence in situ hybridization on tissue
microarrays. Cancer Research, 59(4):803{806, FEB 15 1999.
[25] James J. Cai, Elhanan Borenstein, and Dmitri A. Petrov. Broker genes in
human disease. Genome Biology and Evolution, 2:815{825, 2010.
[26] Arnaud Ceol, Andrew Chatr Aryamontri, Luana Licata, Daniele Peluso,
Leonardo Briganti, Livia Perfetto, Luisa Castagnoli, and Gianni Cesareni.
MINT, the molecular interaction database: 2009 update. Nucleics Acids
Research, 38(Suppl. 1):D532{D539, JAN 2010.
[27] Christian Lenz Cesar. Graph Foundation Classes for Java (GFC).
http://www.alphaworks.ibm.com/tech/gfc, 1999.
[28] Jing Chen, Eric E. Bardes, Bruce J. Aronow, and Anil G. Jegga. Toppgene
suite for gene list enrichment analysis and candidate gene prioritization.
Nucleics Acids Research, 37(Web Server issue):gkp427+, July 2009.
[29] Xing Chen, Guiying Yan, Wei Ren, and Ji-Bin Qu. Modularized random
walk with restart for candidate disease genes prioritization. Systems Biology,
pages 353{360, 2009.
[30] Boris V. Cherkassky, Andrew V. Goldberg, and Tomasz Radzik. Shortest
paths algorithms: Theory and experimental evaluation. Mathematical Pro-
gramming, 73:129{174, 1993.
[31] Gene Ontology Consortium. Gene ontology database, 7 2010. Release Type:
assocdb; Release Name: 2010-07.
[32] Thomas H Cormen, Charles E Leiserson, Ronald L Rivest, and Cliord
Stein. Introduction to Algorithms. MIT Press, Cambridge, MA, USA, second
edition, 2001.
[33] Jean C. de Borda. Memoire sur les Elections au Scrutin. Histoire de
l'Academie Royale des Sciences, Paris, 1781.
[34] Tiago de Paula Peixoto. graph-tool. http://projects.skewed.de/graph-tool/,
2010.
[35] Jerey Dean and Sanjay Ghemawat. Mapreduce: Simplied data processing
on large clusters. Communications of the ACM, 51(1):107{113, JAN 2008.
[36] S. Dey and P.K. Srimani. Fast parallel algorithm for all-pairs shortest path
problem and its vlsi implementation. Computers and Digital Techniques,
IEE Proceedings E, 136(2):85{89, Mar 1989.
[37] Zhihu Ding, Chang-Jiun Wu, Gerald C. Chu, Yonghong Xiao, Dennis Ho,
Jingfang Zhang, Samuel R. Perry, Emma S. Labrot, Xiaoqiu Wu, Rosina
Lis, Yujin Hoshida, David Hiller, Baoli Hu, Shan Jiang, Hongwu Zheng,
Alexander H. Stegh, Kenneth L. Scott, Sabina Signoretti, Nabeel Bardeesy,
Y. Alan Wang, David E. Hill, Todd R. Golub, Meir J. Stampfer, Wing H.
Wong, Massimo Loda, Lorelei Mucci, Lynda Chin, and Ronald A. DePinho.
SMAD4-dependent barrier constrains prostate cancer growth and metastatic
progression. Nature, 470(7333):269+, FEB 10 2011.
[38] Mary E. Dolan, Li Ni, Evelyn Camon, and Judith A. Blake. A procedure for
assessing go annotation consistency. Bioinformatics, 21(suppl 1):i136{143,
June 2005.
[39] Aled M. Edwards, Bart Kus, Ronald Jansen, Dov Greenbaum, Jack Greenblatt,
and Mark Gerstein. Bridging structural biology and genomics: assessing
protein interaction data with known complexes. Trends in Genetics,
18(10):529{536, OCT 2002.
[40] Joanne Edwards and JMS Bartlett. The androgen receptor and signaltransduction
pathways in hormone-refractory prostate cancer. Part 1: modi
cations to the androgen receptor. BJU International, 95(9):1320{1326,
JUN 2005.
[41] Sinan Erten, Gurkan Bebek, and Mehmet Koyutuerk. Disease gene prioritization
based on topological similarity in protein-protein interaction networks.
In V Bafna and SC Sahinalp, editors, Research in Computational
Biology, volume 6577 of Lecture Notes in Bioinformatics, pages 54{68.
SPRINGER-VERLAG, 2011. 15th Annual International Conference on Research
in Computational Molecular Biology, Simon Fraser Univ, Lab Comp
Biol, Vancouver, Canada, MAR 28-31, 2011.
[42] Sinan Erten and Mehmet Koyuturk. Role of centrality in network-based
prioritization of disease genes. In Clara Pizzuti, Marylyn Ritchie, and Mario
Giacobini, editors, Evolutionary Computation, Machine Learning and Data
Mining in Bioinformatics, volume 6023 of Lecture Notes in Computer Sci-
ence, pages 13{25. Springer Berlin / Heidelberg, 2010.
[43] Claudio Festuccia, Giovanni Luca Gravina, Leda Biordi, Sandra D'Ascenzo,
Vincenza Dolo, Corrado Ficorella, Enrico Ricevuto, and Vincenzo
Tombolini. Eects of EGFR Tyrosine Kinase Inhibitor Erlotinib in Prostate
Cancer Cells In Vitro. Prostate, 69(14):1529{1537, OCT 1 2009.
[44] Linton C. Freeman. Centrality in social networks: Conceptual clarication.
Social Networks, 1(3):215{239, 1979.
[45] Lori S. Friedman, Elizabeth A. Ostermeyer, Csilla I. Szabo, Patrick Dowd,
Erick D. Lynch, Sarah E. Rowell, and Mary-Claire King. Conrmation of
BRCA1 LAY Analysis of Germline Mutations Linked to Breast and Ovarian
Cancer in 10 Families. Nature Genetics, 8(4):399{404, DEC 1994.
[46] P. Andrew Futreal, Lachlan Coin, Mhairi Marshall, Thomas Down, Timothy
Hubbard, Richard Wooster, Nazneen Rahman, and Michael R. Stratton. A
census of human cancer genes. Nature Reviews Cancer, 4(3):177{183, March
2004.
[47] Adrian Lopez Garcia De Lomana, Qasim K. Beg, G. De Fabritiis, and Jordi
Villa-Freixa. Statistical Analysis of Global Connectivity and Activity Distributions
in Cellular Networks. Journal of Computational Biology, 17(7):869{
878, JUL 2010.
[48] Robert Gentleman and Ross Ihaka. R. http://www.r-project.org/, 1997.
[49] Thomas G. Graeber and David Eisenberg. Bioinformatic identication of
potential autocrine signaling loops in cancers from gene expression proles.
Nature Genetics, 29(3):295{300, NOV 2001.
[50] Andrei Grigoriev. A relationship between gene expression and protein interactions
on the proteome scale: analysis of the bacteriophage T7 and the
yeast Saccharomyces cerevisiae. Nucleics Acids Research, 29(17):3513{3519,
SEP 1 2001.
[51] Bora Gurel, Tsuyoshi Iwata, Cheryl M. Koh, Robert B. Jenkins, Fusheng
Lan, Chi Van Dang, Jessica L. Hicks, James Morgan, Toby C. Cornish,
Siobhan Sutclie, William B. Isaacs, Jun Luo, and Angelo M. De Marzo.
Nuclear MYC protein overexpression is an early alteration in human prostate
carcinogenesis. Modern Pathology, 21(9):1156{1167, SEP 2008.
[52] Ada Hamosh, Alan F. Scott, Joanna S. Amberger, Carol. A. Bocchini, and
Victor. A. McKusick. Online mendelian inheritance in man (omim), a knowledgebase
of human genes and genetic disorders. Nucleics Acids Research,
33(Database issue), January 2005.
[53] Shyh-Min Huang and Paul M. Harari. Epidermal growth factor receptor inhibition
in cancer therapy: Biology, rationale and preliminary clinical results.
Investigational New Drugs, 17:259{269, 1999. 10.1023/A:1006384521198.
[54] Jeremy Hubble, Janos Demeter, Heng Jin, Maria Mao, Michael Nitzberg,
T. B. Reddy, Farrell Wymore, Zachariah K. Zachariah, Gavin Sherlock,
and Catherine A. Ball. Implementation of genepattern within the stanford
microarray database. Nucleics Acids Research, 37(Database issue), January
2009.
[55] Sarah Hunter, Rolf Apweiler, Teresa K. Attwood, Amos Bairoch, Alex
Bateman, David Binns, Peer Bork, Ujjwal Das, Louise Daugherty, Lauranne
Duquenne, Robert D. Finn, Julian Gough, Daniel Haft, Nicolas Hulo,
Daniel Kahn, Elizabeth Kelly, Aurelie Laugraud, Ivica Letunic, David Lonsdale,
Rodrigo Lopez, Martin Madera, John Maslen, Craig McAnulla, Jennifer
McDowall, Jaina Mistry, Alex Mitchell, Nicola Mulder, Darren Natale,
Christine Orengo, Antony F. Quinn, Jeremy D. Selengut, Christian J. A.
Sigrist, Manjula Thimma, Paul D. Thomas, Franck Valentin, Derek Wilson,
Cathy H. Wu, and Corin Yeats. InterPro: the integrative protein sigNature
database. Nucleics Acids Research, 37(Sp. Iss. SI):D211{D215, JAN 2009.
[56] Janna E. Hutz, Aldi T. Kraja, Howard L. McLeod, and Michael A. Province.
Candid: a
exible method for prioritizing candidate genes for complex human
traits. Genetic Epidemiology, 32(8):779{790, 2008.
[57] JGraph. JGraph. http://www.jgraph.com/, 2010.
[58] Donald B. Johnson. Ecient algorithms for shortest paths in sparse networks.
J. ACM, 24(1):1{13, 1977.
[59] Bjorn H. Junker and Falk Schreiber, editors. Analysis of Biological Networks.
Wiley Series on Bioinformatics: Computational Techniques and Engineering.
John Wiley & Sons, Inc., Hoboken, New Jersey, USA, 2008.
[60] Minoru Kanehisa, Susumu Goto, Shuichi Kawashima, Yasushi Okuno, and
Masahiro Hattori. The kegg resource for deciphering the genome. Nucleics
Acids Research, 32(Database issue):D277{280, January 2004.
[61] Shaul Karni, Hermona Soreq, and Roded Sharan. A network-based method
for predicting disease-causing genes. Journal of Computational Biology,
16(2):181{189, 2009.
[62] Jeroen Kazius, Ross McGuire, and Roberta Bursi. Derivation and validation
of toxicophores for mutagenicity prediction. Journal of Medicinal Chemistry,
48(1):312{320, 2005.
[63] Evan T. Keller, Zheng Fu, and Meghan Brennan. The biology of a prostate
cancer metastasis suppressor protein: Raf kinase inhibitor protein. Journal
of Cellular Biochemestry, 94(2):273{278, FEB 1 2005.
[64] Sebastian Koehler, Sebastian Bauer, Denise Horn, and Peter N. Robinson.
Walking the interactome for prioritization of candidate disease genes. Amer-
ican Journal of Human Genetics, 82(4):949{958, APR 2008.
[65] Kakajan Komurov, Michael A. White, and Prahlad T. Ram. Use of Data-
Biased Random Walks on Graphs for the Retrieval of Context-Specic Networks
from Genomic Data. Plos Computational Biology, 6(8), AUG 2010.
[66] Jacques Lapointe, Chunde Li, John P. Higgins, Matt van de Rijn, Eric Bair,
Kelli Montgomery, Michelle Ferrari, Lars Egevad, Walter Rayford, Ulf Bergerheim,
Peter Ekman, Angelo M. DeMarzo, Robert Tibshirani, David Botstein,
Patrick O. Brown, James D. Brooks, and Jonathan R. Pollack. Gene
expression proling identies clinically relevant subtypes of prostate cancer.
Proceedings of the National Academy of Sciences of the United States
of America, 101(3):811{816, 2004.
[67] Long-Cheng Li, Hong Zhao, Hiroaki Shiina, Christopher J. Kane, and Rajvir
Dahiya. Pgdb: a curated and integrated database of genes related to the
prostate. Nucleics Acids Research, 31(1):291{293, 2003.
[68] Hsue-Chuan Liu, Carlos Roberto Arias, and Von-Wun Soo. Bioir: An approach
to public domain resource integration of human protein-protein interaction.
In The proceeding of the Seventh Asia Pacic Bioinformatics
Conference, 2009.
[69] Nuria. Lopez-Bigas and Christos. A. Ouzounis. Genome-wide identication
of genes likely to be involved in human genetic disease. Nucleic Acids Res,
32(10):3108{3114, 2004.
[70] Hong-Wu Ma and An-Ping Zeng. The connectivity structure, giant
strong component and centrality of metabolic networks. Bioinformatics,
19(11):1423{1430, July 2003.
[71] Xiaotu Ma, Hyunju Lee, Li Wang, and Fengzhu Sun. Cgi: a new approach
for prioritizing genes by combining gene expression and protein-protein interaction
data. Bioinformatics, 23(2):215{221, January 2007.
[72] Kartik M. Mani, Celine Lefebvre, Kai Wang, Wei Keat K. Lim, Katia Basso,
Riccardo Dalla-Favera, and Andrea Califano. A systems biology approach
to prediction of oncogenes and molecular perturbation targets in b-cell lymphomas.
Molecular systems biology, 4, February 2008.
[73] Yoshio Miki, Je Swensen, Donna Shattuck-Eidens, P. Andrew Futreal,
Keith Harshman, Sean Tavtigian, Qingyun Liu, Charles Cochran,
L. Michelle Bennett, Wei Ding, and Al Et. A strong candidate for the
breast and ovarian cancer susceptibility gene brca1. Science, 266(5182):66{
71, October 1994.
[74] Michael Mitzenmacher and Eli Upfal. Probability and computing: random-
ized algorithms and probabilistic analysis. Cambridge University Press, New
York, NY, USA, 2005.
[75] Julie L Morrison, Rainer Breitling, Desmond J Higham, and David R
Gilbert. Generank: using search engine technology for the analysis of microarray
experiments. BMC Bioinformatics, 6:233, 2005.
[76] R. A. Oldenburg, H. Meijers-Heijboer, C. J. Cornelisse, and P. Devilee.
Genetic susceptibility for breast cancer: How many more genes to be found?
Critical Reviews in Oncology Hematology, 63(2):125{149, AUG 2007.
[77] Joshua O'Madadhain, Danyel Fisher, Scott White, and Yan-Biao Boey. The
jung (java universal network/graph) framework. Technical report, School of
Information and Computer Science, University of California, Irvine, 2003.
[78] Arzucan Ozgur, Thuy Vu, Gunes Erkan, and Dragomir R. Radev. Identifying
gene-disease associations using centrality on a literature mined geneinteraction
network. In ISMB, pages 277{285, 2008.
[79] Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. The
pagerank citation ranking: Bringing order to the web. Technical report,
Stanford Digital Library Technologies Project, 1998.
[80] Philipp Pagel, Stefan Kovac, Matthias Oesterheld, Barbara Brauner, Irmtraud
Dunger-Kaltenbach, Goar Frishman, Corinna Montrone, Pekka Mark,
Volker Stump
en, Hans-Werner Mewes, Andreas Ruepp, and Dmitrij Frishman.
The MIPS mammalian protein-protein interaction database. Bioin-
formatics, 21(6):832{834, MAR 15 2005.
[81] Serk In Park, Jing Zhang, Kacy A. Phihips, John C. Araujo, Amer M.
Najjar, Andrei Y. Volgin, Juri G. Gelovani, Sun-Jin Kim, Zhengxin Wang,
and Gary E. Gallick. Targeting src family kinases inhibits growth and lymph
node metastases of prostate cancer in an orthotopic nude mouse model.
Cancer Research, 68(9):3323{3333, MAY 1 2008.
[82] Rosario M. Piro, Ivan Molineris, Ugo Ala, Paolo Provero, and Ferdinando
Di Cunto. Candidate gene prioritization based on spatially mapped gene
expression: an application to XLMR. Bioinformatics, 26(18):I618{I624, SEP
2010. 9th European Conference on Computational Biology, Ghent, Belgium,
SEP 26-29, 2010.
[83] T. S. Keshava Prasad, Renu Goel, Kumaran Kandasamy, Shivakumar
Keerthikumar, Sameer Kumar, Suresh Mathivanan, Deepthi Telikicherla,
Rajesh Raju, Beema Shafreen, Abhilash Venugopal, Lavanya Balakrishnan,
Arivusudar Marimuthu, Sutopa Banerjee, Devi S. Somanathan, Aimy Sebastian,
Sandhya Rani, Somak Ray, C. J. Harrys Kishore, Sashi Kanth,
Mukhtar Ahmed, Manoj K. Kashyap, Riaz Mohmood, Y. L. Ramachandra,
V. Krishna, B. Abdul Rahiman, Sujatha Mohan, Prathibha Ranganathan,
Subhashri Ramabadran, Raghothama Chaerkady, and Akhilesh Pandey. Human
Protein Reference Database-2009 update. Nucleics Acids Research,
37(Sp. Iss. SI):D767{D772, JAN 2009.
[84] Carlos Prieto and Javier De Las Rivas. APID: Agile Protein Interaction
DataAnalyzer. Nucleics Acids Research, 34(Sp. Iss. SI):W298{W302, JUL 1
2006.
[85] Lukasz Salwinski, Christopher S. Miller, Adam J. Smith, Frank K. Pettit,
James U. Bowie, and David Eisenberg. The Database of Interacting Proteins:
2004 update. Nucleics Acids Research, 32(Sp. Iss. SI):D449{D451, JAN 1
2004.
[86] Erick Sayers and David Wheeler. Building Customized Data Pipelines Using
the Entrez Programming Utilities (eUtils). NCBI Coursework. NCBI, 2004.
[87] Andreas Schlicker, Thomas Lengauer, and Mario Albrecht. Improving disease
gene prioritization using the semantic similarity of Gene Ontology
terms. Bioinformatics, 26(18):i561{i567, SEP 2010. 9th European Conference
on Computational Biology, Ghent, Belgium, SEP 26-29, 2010.
[88] Andreas Schlicker, Joerg Rahnenfuehrer, Mario Albrecht, Thomas Lengauer,
and Francisco S. Domingues. GOTax: investigating biological processes and
biochemical activities along the taxonomic tree. Genome Biology, 8(3), 2007.
[89] Paul Shannon, Andrew Markiel, Owen Ozier, Nitin S. Baliga, Jonathan T.
Wang, Daniel Ramage, Nada Amin, Benno Schwikowski, and Trey Ideker.
Cytoscape: A software environment for integrated models of biomolecular
interaction networks. Genome Research, 13(11):2498{2504, NOV 2003. 3rd
International Conference on Systems Biology 2002, STOCKHOLM, SWEDEN,
DEC 13-15, 2002.
[90] Jitesh Shetty and Jafar Adibi. Discovering important nodes through graph
entropy the case of enron email database. In LinkKDD '05: Proceedings of
the 3rd international workshop on Link discovery, pages 74{81, New York,
NY, USA, August 2005. ACM.
[91] Gabor Simonyi. Graph entropy: A survey. Combinatorial Optimization,
20:399{441, 1995.
[92] Robert L. Strausberg, Andrew J.G. Simpson, and Richard Wooster.
Sequence-based cancer genomics: Progress, lessons and opportunities. Na-
ture Reviews Genetics, 4(6):409{418, JUN 2003.
[93] Yuxin Tang, Yunquan Zhang, and Hu Chen. A parallel shortest path algorithm
based on graph-partitioning and iterative correcting. In Proceedings of
the 2008 10th IEEE International Conference on High Performance Comput-
ing and Communications, Dalian, China, 2008, pages 155{161, Washington,
DC, USA, Sept. 2008. IEEE Computer Society.
[94] The Gene Ontology Consortium. The gene ontology project in 2008. Nucl.
Acids Res., 36(36 Database issue):440{444, January 2008.
[95] Yuanyuan Tian and Jignesh M. Patel. Tale: A tool for approximate large
graph matching. In International Conference on Data Engineering, pages
963{972, Los Alamitos, CA, USA, April 2008. IEEE Computer Society.
[96] Nicki Tin, Janet F. Kelso, Alan R. Powell, Hong Pan, Vladimir B. Bajic,
and Winston A. Hide. Integration of text- and data-mining using ontologies
successfully selects disease gene candidates. Nucleics Acids Research,
33(5):1544{1552, 2005.
[97] Nicki Tin, Miguel A. Andrade, and Carolina Perez-Iratxeta. Linking genes
to diseases: it's all in the data. Genome Medicine, 1(8):77, 2009.
[98] Olga Troyanskaya, Michael Cantor, Gavin Sherlock, Pat Brown, Trevor
Hastie, Robert Tibshirani, David Botstein, and Russ B. Altman. Missing
value estimation methods for dna microarrays. Bioinformatics, 17(6):520{
525, June 2001.
[99] Marc A. van Driel, Koen Cuelenaere, Patrick P.C.W. Kemmeren, Jack A.M.
Leunissen, and Han G. Brunner. A new web-based data mining tool for
the identication of candidate genes for human genetic disorders. European
Journal of Human Genetics, 11(1):57{63, JAN 2003.
[100] Merijn van Erp and Lambert Schomaker. Variants of the borda count method
for combining ranked classier hypotheses. In In the Seventh International
Workshop of Frontiers in Handwriting Recognition. 2000. AMSTERDAM
Learning Methodology Inspired by Human's Intelligence Bo Zhang, Dayong
Ding, AND Ling Zhang, pages 443{452, 2000.
[101] Carla Vandenberg, Xin-Yuan Guan, Daniel Vonho, Robert Jenkins,
Michael Bittner, Constance Grin, Olio Kallioniemi, Tapio Visakorpi, John
McGill, John Herath, Jonathan Epstein, Michael Sarosdy, Paul Meltzer,
and Jerey Trent. DNA-Sequence Amplication in Human Prostate-Cancer
Identied by Chromosome Microdissection - Potential Prognostic Implications.
Clinical Cancer Research, 1(1):11{18, JAN 1995.
[102] Bert Vogelstein and Kenneth W. Kinzler. Cancer genes and the pathways
they control. Nature medicine, 10(8):789{799, August 2004.
[103] Wikipedia. Gml. Online: http://en.wikipedia.org/wiki/XGMML, 2010. Accesed
26-Nov-2010.
[104] Wikipedia. Web service. http://en.wikipedia.org/wiki/Web service, 2010.
Accesed 26-Nov-2010.
[105] Wikipedia. Xgmml. Online: http://en.wikipedia.org/wiki/XGMML, 2010.
Accesed 26-Nov-2010.
[106] Chuu-Yun A. Wong, Hada Wuriyanghan, Yan Xie, Ming-Fong Lin, Peter W.
Abel, and Yaping Tu. Epigenetic Regulation of Phosphatidylinositol 3,4,5-
Triphosphate-dependent Rac Exchanger 1 Gene Expression in Prostate Cancer
Cells. Journal of Biological Chemestry, 286(29):25813{25822, JUL 22
2011.
[107] Richard Wooster, Graham Bignell, Jonathan Lancaster, Sally Swift, Sheila
Seal, Jonathan Mangion, Nadine Collins, Simon Gregory, Curtis Gumbs, and
Gos Micklem. Identication of the breast cancer susceptibility gene brca2.
Nature, 378(6559):789{792, 1995.
[108] Xuebing Wu, Rui Jiang, Michael Q. Zhang, and Shao Li. Network-based
global inference of human disease genes. Molecular Systems Biology, 4, MAY
2008.
[109] Hsiang-Yuan Yeh, Shih-Wu Cheng, Yu-Chun Lin, Cheng-Yu Yeh, Shih-Fang
Lin, and Von-Wun Soo. Identifying signicant genetic regulatory networks
in the prostate cancer from microarray data based on transcription factor
analysis and conditional independency. BMC Medical Genomics, 2, DEC 21
2009.
[110] yWorks GmbH. yFILES Developer's Guide. 2004. Accesed 26-Nov-2010.
[111] Xianghong Zhou, Ming-Chih J. Kao, and Wing-HungWong. Transitive functional
annotation by shortest-path analysis of gene expression data. Proceed-
ings of the National Academy of Sciences of the United States of America,
99(20):12783{12788, OCT 1 2002.