Publications

Journal Papers

  1. Jianmin Wang, Shaoxu Song, Xiaochen Zhu, Xuemin Lin, Jiaguang Sun. Efficient Recovery of Missing Events. IEEE Transactions on Knowledge and Data Engineering, TKDE 28(11): 2943-2957 (2016) [paper] [bibtex]
  2. Fei Gao, Shaoxu Song, Lei Chen, Jianmin Wang. Efficient Set-Correlation Operator Inside Databases. Journal of Computer Science and Technology 31(4): 683-701 (2016) [paper] [bibtex]
  3. Xiangdong Huang, Jianmin Wang, Yu Zhong, Shaoxu Song, Philip S. Yu. Optimizing data partition for scaling out NoSQL cluster. Concurrency and Computation: Practice and Experience 27(18): 5793-5809 (2015) [paper] [bibtex]
  4. Shaoxu Song, Lei Chen, Hong Cheng. Efficient Determination of Distance Thresholds for Differential Dependencies. IEEE Transactions on Knowledge and Data Engineering, TKDE 26(9): 2179-2192 (2014) [paper] [bibtex]
  5. Shaoxu Song, Han Zhu, Lei Chen. Probabilistic correlation-based similarity measure on text records. Information Sciences 289: 8-24 (2014) [paper] [bibtex]
  6. Shaoxu Song, Lei Chen, Philip S. Yu. Comparable Dependencies over Heterogeneous Data. The VLDB Journal, VLDBJ 22(2): 253-274 (2013) [paper] [bibtex]
  7. Shaoxu Song, Lei Chen. Efficient Discovery of Similarity Constraints for Matching Dependencies. Data & Knowledge Engineering, DKE 87: 146-166 (2013) [paper] [bibtex]
  8. Shaoxu Song, Lei Chen. Indexing Dataspaces with Partitions. World Wide Web Journal, WWWJ 16(2): 141-170 (2013) [paper] [bibtex]
  9. Shaoxu Song, Lei Chen. Differential Dependencies: Reasoning and Discovery. ACM Transactions on Database Systems, TODS 36(3): 16 (2011) [paper] [bibtex]
  10. Shaoxu Song, Lei Chen, Mingxuan Yuan. Materialization and Decomposition of Dataspaces for Efficient Search. IEEE Transactions on Knowledge and Data Engineering, TKDE 23(12): 1872-1887 (2011) [paper] [bibtex]
  11. Shaoxu Song, Lei Chen, Jeffrey Xu Yu. Answering Frequent Probabilistic Inference Queries in Databases. IEEE Transactions on Knowledge and Data Engineering, TKDE 23(4): 512-526 (2011) [paper] [bibtex]

Conference Papers

  1. Aoqian Zhang, Shaoxu Song, Jianmin Wang. Sequential Data Cleaning: A Statistical Approach. ACM SIGMOD International Conference on Management of Data, SIGMOD 2016: 909-924 [paper] [bibtex]
  2. Shaoxu Song, Han Zhu, Jianmin Wang. Constraint-Variance Tolerant Data Repairing. ACM SIGMOD International Conference on Management of Data, SIGMOD 2016: 877-892 [paper] [bibtex]
  3. Shaoxu Song, Yue Cao, Jianmin Wang. Cleaning Timestamps with Temporal Constraints. Proceedings of the VLDB Endowment, PVLDB 9(10): 708-719 (2016) [paper] [bibtex]
  4. Weiguo Zheng, Lei Zou, Wei Peng, Xifeng Yan, Shaoxu Song, Dongyan Zhao. Semantic SPARQL Similarity Search Over RDF Knowledge Graphs. Proceedings of the VLDB Endowment, PVLDB 9(11): 840-851 (2016)
  5. Shaoxu Song, Chunping Li, Xiaoquan Zhang. Turn Waste into Wealth: On Simultaneous Clustering and Cleaning over Dirty Data. ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD, 2015: [paper] [bibtex]
  6. Shaoxu Song, Aoqian Zhang, Lei Chen, Jianmin Wang. Enriching Data Imputation with Extensive Similarity Neighbors. Proceedings of the VLDB Endowment, PVLDB 8(11): 1286-1297 (2015) [paper] [bibtex]
  7. Shaoxu Song, Aoqian Zhang, Jianmin Wang, Philip S. Yu. SCREEN: Stream Data Cleaning under Speed Constraints. ACM SIGMOD International Conference on Management of Data, SIGMOD 2015: 827-841 [paper] [bibtex]
  8. Weiguo Zheng, Lei Zou, Xiang Lian, Jeffrey Xu Yu, Shaoxu Song, Dongyan Zhao. How to Build Templates for RDF Question/Answering: An Uncertain Graph Similarity Join Approach. ACM SIGMOD International Conference on Management of Data, SIGMOD, 2015: 1809-1824
  9. Jianmin Wang, Shaoxu Song, Xuemin Lin, Xiaochen Zhu, Jian Pei. Cleaning Structured Event Logs: A Graph Repair Approach. IEEE International Conference on Data Engineering, ICDE 2015: 30-41 [paper] [bibtex]
  10. Shaoxu Song, Hong Cheng, Jeffrey Xu Yu, Lei Chen. Repairing Vertex Labels under Neighborhood Constraints. Proceedings of the VLDB Endowment, PVLDB 7(11): 987-998 (2014) [paper] [bibtex]
  11. Shaoxu Song, Lei Chen, Hong Cheng. On Concise Set of Relative Candidate Keys. Proceedings of the VLDB Endowment, PVLDB 7(12): 1179-1190 (2014) [paper] [bibtex]
  12. Xiaochen Zhu, Shaoxu Song, Xiang Lian, Jianmin Wang, Lei Zou. Matching Heterogeneous Event Data. ACM SIGMOD International Conference on Management of Data, SIGMOD 2014: 1211-1222 [paper] [bibtex]
  13. Xiaochen Zhu, Shaoxu Song, Jianmin Wang, Philip S. Yu, Jiaguang Sun. Matching Heterogeneous Events with Patterns. IEEE International Conference on Data Engineering, ICDE 2014: 376-387 [paper] [bibtex]
  14. Jian Wu, Chunping Li, Yishu Miao, Shaoxu Song, Li Li, Qiang Ding. Context-aware reasoning middle ware applied in the mobile environment. International Conference on Machine Learning and Cybernetics, ICMLC 2013: 1829-1835
  15. Jianmin Wang, Shaoxu Song, Xiaochen Zhu, Xuemin Lin. Efficient Recovery of Missing Events. Proceedings of the VLDB Endowment, PVLDB 6(10): 841-852 (2013) [paper] [bibtex]
  16. Shaoxu Song, Lei Chen, Hong Cheng. Parameter-Free Determination of Distance Thresholds for Metric Distance Constraints. IEEE International Conference on Data Engineering, ICDE 2012: 846-857 [paper] [bibtex]
  17. Shaoxu Song, Lei Chen, Philip S. Yu. On Data Dependencies in Dataspaces. IEEE International Conference on Data Engineering, ICDE 2011: 470-481 [paper] [bibtex]
  18. Xiang Lian, Lei Chen, Shaoxu Song. Consistent Query Answers in Inconsistent Probabilistic Databases. ACM SIGMOD International Conference on Management of Data, SIGMOD 2010: 303-314
  19. Shaoxu Song, Lei Chen. Efficient Set-Correlation Operator inside Databases. ACM Conference on Information and Knowledge Management, CIKM 2010: 139-148 [paper] [bibtex]
  20. Shaoxu Song, Lei Chen, Jeffrey Xu Yu. Extending Matching Rules with Conditions. 8th International Workshop on Quality in Databases, QDB 2010: 7 [paper] [bibtex]
  21. Shaoxu Song, Lei Chen. Discovering Matching Dependencies. ACM Conference on Information and Knowledge Management, CIKM 2009: 1421-1424 [paper] [bibtex]
  22. Shaoxu Song, Lei Chen. Probabilistic Correlation-based Similarity Measure of Unstructured Records. ACM Conference on Information and Knowledge Management, CIKM 2007: 967-970 [paper] [bibtex]
  23. Shaoxu Song, Lei Chen. Similarity Joins of Text with Incomplete Information Formats. International Conference on Database Systems for Advanced Applications, DASFAA 2007: 313-324 [paper] [bibtex]
  24. Shaoxu Song, Chunping Li. Improved ROCK for Text Clustering Using Asymmetric Proximity. Theory and Practice of Computer Science, SOFSEM 2006, 3831: 501-510 [bibtex]
  25. Shaoxu Song, Chunping Li. Semantic Correlation Network Based Text Clustering. Advances in Artificial Intelligence, AI 2005, 3809: 604-613 [bibtex]
  26. Shaoxu Song, Jian Zhang, Chunping Li. Concept Chain Based Text Clustering. Computational Intelligence and Security, CIS 2005, 3801: 713-720 [bibtex]
  27. Shaoxu Song, Chunping Li. TCUAP: A Novel Approach of Text Clustering Using Asymmetric Proximity. Indian International Conference on Artificial Intelligence, IICAI 2005: 676-685 [bibtex]