Education background
Bachelor of Computer Science, Tsinghua University, Beijing, China, 1991;
Master of Computer Science, Tsinghua University, Beijing, China, 1993;
Ph.D. in Computer Science, Tsinghua University, Beijing, China, 2006.
Social service
Department of Computer Science and Technology, Tsinghua University: Vice Dean (2007-);
Instructional Sub-Committee of Professional Teaching of Computer Science and Technology in Colleges and Universities, Ministry of Education: Expert (2009-2010).
Areas of Research Interests/ Research Projects
Database Management Systems, Data Security and Privacy Preservation
Information Retrieval
Joint Research Project: Research and Development of a Large Column-Oriented DBMS (2010-2012);
National Natural Science Foundation of China: Research on Storage Technology and Performance Evaluation of Flash Database (2009-2011);
National 863 High-Tech Program: Research on Keyword Search over Massive Heterogeneous Data (2007-2010);
The National Basic Research Program of China (The 973 Program): Information Representation and Knowledge Discovery of Visual Media Data (2006-2011);
National Natural Science Foundation of China: Research on Key Issues of Native XML Database Management Systems (2006-2008).
Research Status
I am engaging in the research and development of database-related work. I have series of research outputs and have published high quality academic papers in native XML database management system, keyword search over heterogeneous data, data security and privacy preservation, and new types of database management systems such as a column-oriented database.
In the area of native XML database management system, I have proposed encoding methods of XML data, methods of semantic cache, and structure-and-twig join algorithms. I have also proposed an "implicit condition homomorphism" method, which effectively resolves the problems of storage, query processing and homomorphism judgment of the XML data, and builds the general framework of query processing and query optimization for native XML database management system.
In the area of keyword search over heterogeneous data, I have proposed a graph model for heterogeneous data, built the indexes and ranking mechanism for keyword search over heterogeneous data, effectively resolved the problem of the unification and the query evaluation of heterogeneous data, and formed a keyword search system with auto-completion, fuzzy match and type-ahead features. At the same time, I have worked together with foreign researchers on column-oriented database and its query optimization technologies such as delay instantiation, block iteration, specific column compression and invisible join. I am currently developing a large-scale column-oriented database management system-HUABASE.
Combining various areas of database, data mining and information retrieval, my research achievements have broadened the field of database research, which also attracted wide attentions from the academic society. My published papers have been cited more than 180 times (130+ by others) in total. Two papers published in ACM SIGMOD and ACM CIKM conferences respectively have been cited more than 80 times from more than 10 countries in two years. There are more than 40 citations from top conferences and journals in database research, such as ACM SIGMOD, VLDB, IEEE ICDE, EDBT and ACM TODS. Researchers who have cited my papers are mainly from leading universities such as MIT, UIUC, DUKE, Wisconsin, Arizona State University, Ohio State University, NUS, MPI, etc.
Honors And Awards
First YOCSEF Young Scientists Award by China Computer Federation (2010);
Beijing Educational Achievement Award, First Class-High-Level and Innovative Doctoral Training Program of Computer Science Major (2009);
HP Labs Innovation Research Award by HP Labs-Efficient Scheme-aware Keyword Search on Heterogeneous Data (2008).
Academic Achievement
[1] Guoliang Li, Jianhua Feng, Xiaofang Zhou, Jianyong Wang. Providing Built-in Keyword Search Capabilities in RDBMS. Accepted by The VLDB Journal, 2010.
[2] Guoliang Li, Jianhua Feng, Jianyong Wang, Lizhu Zhou. KEMB: A Keyword-Based XML Message Broker. Accepted by IEEE Transactions on Knowledge and Data Engineering (IEEE TKDE), 2010.
[3] Guoliang Li, Shengyue Ji, Chen Li, Jiannan Wang, Jianhua Feng: Efficient fuzzy type-ahead search in TASTIER. Proc. 26th International Conference on Data Engineering (ICDE 2010), Long Beach, California, USA, IEEE 2010, pp. 1105-1108.
[4] Jianhua Feng, Guoliang Li, Jianyong Wang, Lizhu Zhou: Finding and ranking compact connected trees for effective keyword proximity search in XML documents. Information Systems, vol. 35, no. 2, pp. 186-203, 2010.
[5] Guoliang Li, Jianhua Feng, Jianyong Wang: Structure-aware indexing for keyword search in databases. Proc. 18th ACM Conference on Information and Knowledge Management (CIKM 2009), Hong Kong, China, ACM 2009, PP. 1453-1456.
[6] Guoliang Li, Xiaofang Zhou, Jianhua Feng, Jianyong Wang: Progressive Keyword Search in Relational Databases. Proc. 25th International Conference on Data Engineering (ICDE 2009), Shanghai, China, IEEE 2009, pp. 1183-1186.
[7] Yang Ye, Yu Zheng, Yukun Chen, Jianhua Feng, Xing Xie: Mining Individual Life Pattern Based on Location History. Proc. 10th International Conference on Mobile Data Management (MDM 2009), Taipei, Taiwan, IEEE 2009, pp. 1-10.
[8] Guoliang Li, Shengyue Ji, Chen Li, Jianhua Feng: Efficient type-ahead search on relational data: a TASTIER approach. Proc. ACM SIGMOD International Conference on Management of Data (SIGMOD 2009), Providence, Rhode Island, USA, ACM 2009, pp. 695-706.
[9] Shengyue Ji, Guoliang Li, Chen Li, Jianhua Feng: Efficient interactive fuzzy keyword search. Proc. 18th International Conference on World Wide Web (WWW 2009), Madrid, Spain, ACM 2009, pp. 371-380.
[10] Guoliang Li, Jianhua Feng, Jianyong Wang, Lizhu Zhou: Incremental sequence-based frequent query pattern mining from XML queries. Data Mining and Knowledge Discovery (DMKD), vol. 18, no. 3, pp. 472-516, 2009.
[11] Guoliang Li, Chen Li, Jianhua Feng, Lizhu Zhou: SAIL: Structure-aware indexing for effective and progressive top-k keyword search over XML documents. Information Sciences, vol. 179, no. 21, pp. 3745-3762, 2009.
[12] Zhiping Zeng, Anthony K. H. Tung, Jianyong Wang, Jianhua Feng, Lizhu Zhou: Comparing Stars: On Approximating Graph Edit Distance. Proceedings of the VLDB Endowment (PVLDB), vol. 2, no. 1, pp. 25-36, 2009.
[13] Guoliang Li, Jianhua Feng, Lizhu Zhou: Retune: Retrieving and Materializing Tuple Units for Effective Keyword Search over Relational Databases. Proc. 27th International Conference on Conceptual Modeling (ER 2008), Barcelona, Spain, 2008, Lecture Notes in Computer Science, vol. 5231, pp. 469-483.
[14] Guoliang Li, Beng Chin Ooi, Jianhua Feng, Jianyong Wang, Lizhu Zhou: EASE: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data. Proc. ACM SIGMOD International Conference on Management of Data (SIGMOD 2008), Vancouver, BC, Canada, ACM 2008, pp. 903-914.
[15] Guoliang Li, Xuhui Liu, Jianhua Feng, Lizhu Zhou: Efficient Similarity Search for Tree-Structured Data. Proc. 20th International Conference on Scientific and Statistical Database Management (SSDBM 2008), Hong Kong, China, 2008, Lecture Notes in Computer Science, vol. 5069, pp. 131-149.
[16] Jianhua Feng, Guoliang Li, Na Ta: A Semantic Cache Framework for Secure XML Queries. Journal of Computer Science and Technology (JCST), vol. 23, no. 6, pp. 988-997, 2008.
[17] Guoliang Li, Jianhua Feng, Jianyong Wang, Lizhu Zhou: Effective keyword search for valuable lcas over xml documents. Proc. 16th ACM Conference on Information and Knowledge Management (CIKM 2007), Lisbon, Portugal, ACM 2007, pp. 31-40.
[18] Charu C. Aggarwal, Na Ta, Jianyong Wang, Jianhua Feng, Mohammed Javeed Zaki: Xproj: a framework for projected structural clustering of xml documents. Proc. 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD 2007), San Jose, California, USA, ACM 2007, pp. 46-55.
[19] Yi Wang, Shi-Xia Liu, Jianhua Feng, Lizhu Zhou: Mining Naturally Smooth Evolution of Clusters from Dynamic Data. Proc. 7th SIAM International Conference on Data Mining (SDM 2007), Minneapolis, Minnesota, USA, SIAM 2007.
[20] Jianhua Feng, Qian Qian, Jianyong Wang, Li-Zhu Zhou: Efficient Mining of Frequent Closed XML Query Pattern. Journal of Computer Science and Technology (JCST), vol. 22, no. 5, pp. 725-735, 2007.
[21] Jianhua Feng, Yuguo Liao, Yong Zhang: HCH for Checking Containment of XPath Fragment. Journal of Computer Science and Technology (JCST), vol. 22, no. 5, pp. 736-748, 2007.
[22] Yi Wang, Lizhu Zhou, Jianhua Feng, Jianyong Wang, Zhi-Qiang Liu: Mining Complex Time-Series Data by Learning Markovian Models. Proc. 6th IEEE International Conference on Data Mining (ICDM 2006), Hong Kong, China, IEEE 2006, pp. 1136-1140.
[23] Guoliang Li, Jianhua Feng, Jianyong Wang, Yong Zhang, Lizhu Zhou: Incremental Mining of Frequent Query Patterns from XML Queries for Caching. Proc. 6th IEEE International Conference on Data Mining (ICDM 2006), Hong Kong, China, IEEE 2006, pp. 350-361.