摘要
Themulticoreevolutionhasstimulatedrenewedinterestsinscalingupapplicationsonshared-memorymultiprocessors,significantlyimprovingthescalabilityofmanyapplications.Butthescalabilityislimitedwithinasinglenode;thereforeprogrammersstillhavetoredesignapplicationstoscaleoutovermultiplenodes.Thispaperrevisitsthedesignandimplementationofdistributedsharedmemory(DSM)asawaytoscaleoutapplicationsoptimizedfornon-uniformmemoryaccess(NUMA)architectureoverawell-connectedcluster.ThispaperpresentsMAGI,anefficientDSMsystemthatprovidesatransparentsharedaddressspacewithscalableperformanceonaclusterwithfastnetworkinterfaces.MAGIisuniqueinthatitpresentsaNUMAabstractiontofullyharnessthemulticoreresourcesineachnodethroughhierarchicalsynchronizationandmemorymanagement.MAGIalsoexploitsthememoryaccesspatternsofbig-dataapplicationsandleveragesasetofoptimizationsforremotedirectmemoryaccess(RDMA)toreducethenumberofpagefaultsandthecostofthecoherenceprotocol.MAGIhasbeenimplementedasauser-spacelibrarywithpthread-compatibleinterfacesandcanrunexistingmultithreadedapplicationswithminimizedmodifications.WedeployedMAGIoveran8-nodeRDMA-enabledcluster.ExperimentalevaluationshowsthatMAGIachievesupto9.25:4speedupcomparedwithanunoptimizedimplementation,leadingtoasealableperformanceforlarge-scaledata-intensiveapplications.
出版日期
2019年01月11日(中国期刊网平台首次上网日期,不代表论文的发表时间)