简介:Thecommonapproachtofindco-regulatedgenesistoclustergenesbasedongeneexpression.However,duetothelimitedinformationpresentinanydataset,genesinthesameclustermightbeco-expressedbutnotnecessarilyco-regulated.Inthispaper,weproposetointegrateknowntranscriptionfactorbindingsiteinformationandgeneexpressiondataintoasingleclusteringscheme.Thisschemewillfindclustersofco-regulatedgenesthatarenotonlyexpressedsimilarlyunderthemeasuredconditions,butalsosharearegulatorystructurethatmayexplaintheircommonregulation.Wedemonstratetheutilityofthisapproachonamicroarraydatasetofyeastgrownunderdifferentnutrientandoxygenlimitations.Ourintegratedclusteringmethodnotonlyunravelsmanyregulatorymodulesthatareconsistentwithcurrentbiologicalknowledge,butalsoprovidesamoreprofoundunderstandingoftheunderlyingprocess.Theaddedvalueofourapproach,comparedwiththeclusteringsolelybasedongeneexpression,isitsabilitytouncoverclustersofgenesthatareinvolvedinmorespecificbiologicalprocessesandareevidentlyregulatedbyasetoftranscriptionfactors.