融合关联性分析的图数据处理加速方法的研究Research on Acceleration Method of Graph Data Processing with Correlation Analysis
郑策;尤佳莉;
摘要(Abstract):
针对目前视频服务场景下的电影资源中存在海量的关系型数据,现有的基于图相关的推荐算法需要将这些关系型数据映射成图结构后进行处理,由于图数据规模较大造成了传统的图数据处理方法中语义匹配算法的效率降低、通信开销增大的问题,本文融合关联性分析提出了一种基于语义匹配的图数据加速处理方案——一种在单一大图中查询图序列的子图匹配加速方法。该方法通过考虑时间因素的关联性来加快定位到海量数据中有效信息所在的范围,从而达到缓解数据查找效率低、通信开销大的问题;同时,对该方法进行了实验分析,验证其有效性。
关键词(KeyWords): 关联性分析;图数据;子图匹配
基金项目(Foundation): 中国科学院先导专项课题:SEANET技术标准化研究与系统研制(编号:XDC02010701);中国科学院青年创新促进会项目(编号:Y529111601)
作者(Authors): 郑策;尤佳莉;
参考文献(References):
- [1] Han W,Miao Y,Li K,et al.Chronos:a graph engine for temporal graph analysis[C]//Proc of the 9th European Conference on Computer Systems.2014:1.
- [2] 卓煜,尤佳莉,王劲林.爱奇艺视频网站服务特性分析[J].网络新媒体技术,2017,6(2):1-6.
- [3] Fan W,Wang X,Wu Y,Incremental graph pattern matching[J].ACM Transactions on Data Systems (TODS).2013,38(3):1-47.
- [4] C.Shi,Y.Li,J.Zhang,Y.Sun,and P.S.Yu.A survey of heterogeneous information network analysis[J].IEEE Trans.Knowl.Data Eng.,Jan.2017,29(1):17-37.
- [5] Gonzalez J E,Low Y,Gu H,etal.PowerGraph:distributed graph-parallel computation on natural graphs[C]//Proc of USENIX Conference on Operating Systems Design and Implementation.2012:17-30.
- [6] Shuai Ma,Yang Cao,Jinpeng Huai,Tianyu Wo,Distributed Graph Pattern Matching[C]//Proceedings of the 21st international conference on world wide web.2012:949-958.
- [7] W.Fan,J.Li,S.Ma,H.Wang,and Y.Wu,Graph homomorphism revisited for graph matching\[J\].PVLDB,2010,3:1161-1172.
- [8] Pang-Ning Tan,Micheale Steinbach,Vipin Kumar.范明,等译.数据挖掘导论[M].北京:人民邮电出版社,2011.
- [9] 于静,刘燕兵,张宇,刘梦雅,谭建龙,郭莉.大规模图数据匹配技术综述[J].计算机研究与发展,2015,52(2):391-409.
- [10] Hassan J,Sevignon M,Gozzi C,et al.Arylaryl bond formation one century after the discovery of the Ullmann reaction[J].Chemical Reviews,2002,102(5):1359-1470.
- [11] Giugno R,Shasha D.Graphgrep:A fast and universal method for querying graphs[C]//Object recognition supported by user interaction for service robots.IEEE,2002,2:112-115.
- [12] 郭聪敏.图集的子图查询算法研究[D].北京:燕山大学,2012.
- [13] Lei Zou,Lei Chen,M.Tamerazsu,Dongyan Zhao.Answering pattern match queries in large graph databases via graph embedding[J].The VLDB Journal.2012,21(1):97-120.
- [14] Yuan P,Xie C,Liu L,et al.PathGraph:A Path Centric Graph Processing System[J].IEEE Trans on Parallel & Distributed Systems,2016,27(10):2998-3012.
- [15] 崔斌,高军,童咏昕,许建秋,张东祥,邹磊.新型数据管理系统研究进展与趋势[J].软件学报,2019,30(1):164-193.
- [16] Y.Deldjoo,M.Elahi,P.Cremonesi,et al.Content-based video recommendation system based on stylistic visual features[J].Jomul on Data Semantics,2016,5(2):99-113.
- [17] G.Wu,V.Swaminathan,S.Mitra,and R.Kumar.Context-aware video recommendation based on session progress prediction[M].in Proc.IEEE Int.Conf.Multimedia Expo,Jul.2017,1428-1433.
- [18] 黄博.图数据库中多子图匹配查询算法研究[D].上海:复旦大学,2012.
- [19] Hervtwich R G.Nework and Openating System Support for Digital Audio and Video Second International Workshop Heidelberg.Germany,November 18-19 1991 Proceedings[C]//Conference Proceedings NOSSDAV 1991:101.
- [20] Low Y,Gonzalez J E,Kyrola A,et al.Graphlab:A new framework for parallel machine learning[J].arXiv preprint arXiv:1408.2041,2014.
- [21] Shao B,Wang H,Li Y.Trinity:A distributed araph engine on a memory cloud[C]//Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data.2013:505-516.
- [22] S.Liu,X.Tu,and R.Li.Unifying explicit and implicit feedback for top-N recommendation[C].//in Proc.IEEE 2nd Int.Conf.Big Data Anal.,Mar.2017:35-39.
- [23] Dongoran E S S,Saleh W K R,Gozali A A.Analysis and implementation of graph indexing for graph database using GraphGrep algorithm[C]//2015 3rd International Conference on Information and Communication Technology (ICoICT).IEEE,2015:59-64.