窦志成

窦志成,于2003年和2008年分别获得南开大学学士和博士学位。毕业后加入微软亚洲研究院,任研究员。2014年9月份加入中国人民大学,任特别研究员,副教授。主要研究兴趣为信息检索、数据挖掘,信息抽取以及机器学习。已在国际知名会议和学术期刊上(如SIGIR、WWW、CIKM、WSDM、EMNLP及IEEE TKDE等)发表论文20余篇。担任过多个国际学术会议(如SIGIR、WWW、KDD、WSDM、CIKM)的程序委员会成员,是亚洲信息检索协会筹划指导委员会成员。
除研究工作外,还乐于将研究想法实现成可运行的系统。 在亚洲研究院任职期间,参与了多个项目的开发, 如WebStudio、 ProjectQ、 和 WebSensor等。 他拥有多项专利,参与研发的多项技术已经成功转化到微软产品中(如必应搜索Bing和Office)。

详细>>

个人主页:http://www.playbigdata.com/dou/

电子邮箱:dou at ruc.edu.cn

详细资料

教育经历

1999-2008 南开大学 本科-硕士-博士

工作经历

2008年-2014年 微软研究院 研究员

2014年至今 中国人民大学 副教授

研究方向

信息检索,数据挖掘,大数据,信息抽取,机器学习,文本分析,自然语言处理

讲授课程

智能信息检索

Web开发技术

计算机科学研究方法概论

对学生的培养要求

具有丰富的学生培养和指导经验,在微软亚洲研究院工作6年多的时间内,先后指导20多个实习生。

报名要求:

读研究生的目的:想要在硕士生或博士生阶段培养自己的项目开发或者科研能力,为将来的工作或进一步深造打好基础,而不是仅仅为了拿到研究生学历或硕士博士学位;

态度:踏实、勤奋、做事有责任心,能够认真对待老师分配给的项目或者研究课题;

基础:具有一定的编程开发动手能力,具有一定的自我学习能力,能够将研究想法编程实现;

对学生的培养:

能力培养:本着对学生负责的态度,同时培养学生的系统开发(编程、系统设计、项目管理)和科学研究能力(论文阅读、工作调研、问题分析、方法设计、实验分析、论文写作等),为结合学生的特长和职业规划,为不同学生制定不同的能力培养计划;

素质培养:培养学生做事的态度,锻炼语言沟通能力,增强团队合作意识;

欢迎各位有意向攻读硕士或博士学位的同学报考!

科研项目

目前正在进行的项目:

(1)互联网搜索排序算法,个性化搜索、搜索结果多样化

(2)互联网文本大数据分析;时事探针系统: http://websensor.playbigdata.com/fss3/

(3)信息抽取算法

(4)中国人民大学校园搜索与分析引擎

(5)互联网数据挖掘

(6)查询意图分析

(7)信息检索评价

科研成果

2016

•Zhicheng Dou, Zhengbao Jiang, Sha Hu, Ji-Rong Wen, Ruihua Song: Automatically Mining Facets for Queries from Their Search Results. IEEE Trans. Knowl. Data Eng. (TKDE) 28(2):385-397 (2016)

2015

•Zhongqi Lu, Zhicheng Dou, Xing Xie, Jianxun Lian, Qiang Yang. Content-based Collaborative Filtering for News Topic Recommendation. In Proceedings of Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2015), Austin Texas, USA, Jan 25-29, 2015.

•Sha Hu, Zhicheng Dou, Xiaojie Wang, Tetsuya Sakai, and Ji-Rong Wen. 2015. Search Result Diversification Based on Hierarchical Intents. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM \\\'15). ACM, New York, NY, USA, 63-72. DOI=http://dx.doi.org/10.1145/2806416.2806455

•Sha Hu, Zhicheng Dou, Xiao-Jie Wang, Ji-Rong Wen: Search Result Diversification Based on Query Facets. J. Comput. Sci. Technol. (JCST) 30(4):888-901 (2015)

2014

•Yiqun Liu, Ruihua Song, Min Zhang, Zhicheng Dou, Takehiro Yamamoto, Makoto Kato, Hiroaki Ohshima, Ke Zhou. Overview of the NTCIR-11 IMine Task. Proceedings of the 11th NTCIR conference.

•Fei Chen, Yiqun Liu, Zhicheng Dou, Keyang Xu, Yujie Cao, Min Zhang, and Shaoping Ma, Revisiting the Evaluation of Diversified Search Evaluation Metrics with User Preferences. Proceedings of the 10th Asia Information Retrieval Society Conference (AIRS 2014)

•Jingfei Li, Dawei Song, Peng Zhang, Ji-Rong Wen, and Zhicheng Dou, Personalizing Web Search Results Based on Subspace Projection, Proceedings of the 10th Asia Information Retrieval Society Conference (AIRS 2014)

•Shu Tang, Zhicheng Dou, Xing Xie, and Jun He, Detecting and Monitoring Dynamic Content Blocks of a Web Page by Merging its Historical Versions, in SIGIR 2014 Workshop on Temporal, Social and Spatially-aware Information Access (TAIA2014), 2014

2013

•Xiao Ding, Zhicheng Dou, Bing Qin, Ting Liu, and Ji-Rong Wen, Improving Web Search Ranking by Incorporating Structured Annotation of Queries, in Proceedings of EMNLP 2013, pages 468-478, October 2013

•Kosetsu Tsukuda, Tetsuya Sakai, Zhicheng Dou, and Katsumi Tanaka, Estimating Intent Types for Search Result Diversification, in Information Retrieval Technology, pages 25-37, Springer Berlin Heidelberg, 2013

•Ke Zhou, Tetsuya Sakai, Mounia Lalmas, Zhicheng Dou, and Joemon M. Jose, Evaluating Heterogeneous Information Access, in ACM SIGIR 2013 Workshop on Modeling User Behavior for Information Access Evaluation, 2013

•Qinglei Wang, Yanan Qian, Ruihua Song, Zhicheng Dou, Fan Zhang, Tetsuya Sakai, and Qinghua Zheng, Mining Subtopics from Text Fragments for a Web Query, in Information Retrieval 16(4) pages 484-503, 2013

•Tetsuya Sakai and Zhicheng Dou, Summaries, Ranked Retrieval and Sessions: A Unified Framework for Information Access Evaluation, in Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2013), pages 473-482, ACM, 2013 (The Best Paper Runner-Up Award)

•Tetsuya Sakai, Zhicheng Dou, and Carles Clarke, The Impact of Intent Selection on Diversified Search Evaluation, in Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2013), pages 921-924, ACM, 2013

•Tetsuya Sakai, Zhicheng Dou, Takehiro Yamamoto, Yiqun Liu, Min Zhang, Makoto Kato, Ruihua Song, and Mayu Iwata, Summary of the NTCIR-10 INTENT-2 Task: Subtopic Mining and Search Result Diversification, in Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2013), pages 761 - 764, ACM, 2013

•Tetsuya Sakai, Zhicheng Dou, Takehiro Yamamoto, Yiqun Liu, Min Zhang, and Ruihua Song, Overview of the NTCIR-10 INTENT-2 Task, in Proceedings of the 10th NTCIR Conference, pages 94-123, June 18-21, 2013

•Kosetsu Tsukuda, Zhicheng Dou, and Tetsuya Sakai, Microsoft Research Asia at the NTCIR-10 Intent Task, in Proceedings of the 10th NTCIR Conference, June 2013

•Kazuya Narita, Tetsuya Sakai, Zhicheng Dou, and Young-In Song, MSRA at NTCIR-10 1CLICK-2, in Proceedings of the 10th NTCIR Conference, 2013

2012

•Tetsuya Sakai, Zhicheng Dou, Ruihua song, and Noriko Kando, The Reusability of a Diversified Search Test Collection, in Information Retrieval Technology (AIRS 2012), pages 26-38, Springer Berlin Heidelberg, 20 December 2012 (The Best Paper Award)

2011

•Zhicheng Dou, Sha Hu, Kun Chen, Ruihua Song, and Ji-Rong Wen, Multi-dimensional Search Result Diversification, in Proceedings of the fourth ACM international conference on Web search and data mining (WSDM 2011), pages 475-484, ACM, February 2011

•Zhicheng Dou, Finding Dimensions for Queries, in Proceedings of the 20th ACM international conference on Information and knowledge management (CIKM 2011), pages 1311-1320, ACM, 2011

•Jialong Han, Qinglei Wang, Naoki Orii, Zhicheng Dou, Tetsuya Sakai, and Ruihua Song, Microsoft Research Asia at the NTCIR-9 Intent Task, in Proceedings of the 10th NTCIR Conference (NTCIR-9), National Institute of Informatics, 2011

2010

•Tetsuya Sakai, Nick Craswell, Ruihua Song, Stephen Robertson, Zhicheng Dou, and Chin-Yew Lin, Simple Evaluation Metrics for Diversified Search Results, in Proceedings of the Third International Workshop on Evaluating Information Access (EVIA), Volumn 26, pages 27, National Institute of Informatics, June 2010

•Ruihua Song, Zhicheng Dou, Hsiao-Wuen Hon, and Yong Yu, Learning Query Ambiguity Models by Using Search Logs, Journal of Computer Science and Technology, 25(4), pages 782-738, Springer, July 2010

2009

•Zhicheng Dou, Kun Chen, Ruihua Song, Yunxiao Ma, Shuming Shi, and Ji-Rong Wen, Microsoft Research Asia at the Web Track of TREC 2009, in Proceedings of TREC 2009, November 2009

•Ji-Rong Wen, Zhicheng Dou, and Ruihua Song, Personalized Web Search, in Encyclopedia of Database Systems, pages 2099-2103, Springer-Verlag, New York, USA, September 2009

•Zhicheng Dou, Ruihua Song, Jian-Yun Nie, and Ji-Rong Wen, Using Anchor Texts with Their Hyperlink Structure for Web Search, in Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval(SIGIR 2009), pages 227-234, ACM, July 2009

•Zhicheng Dou, Ruihua Song, Ji-Rong Wen, and Xiaojie Yuan, Evaluating the Effectiveness of Personalized Web Search, in IEEE Transactions on Knowledge and Data Engineering (TKDE), 21(8), pages 1178-1190, IEEE computer Society Digital Library, Aug., 2009

2008

•Zhicheng Dou, Ruihua Song, Xiaojie Yuan, and Ji-Rong Wen, Are click-through data adequate for learning web search rankings?, in Proceeding of the 17th ACM conference on Information and knowledge management (CIKM 2008), pages 73-82, ACM, New York, NY, USA, 2008

•Zhicheng Dou, Xiaojie Yuan, and Songbai He, Analysis of Query Repetition in a Large-scale Chinese Search Log (大规模中文搜索日志中查询重复性分析), in Computer Engineering (In Chinese), Volumn 21, 2008

•Xiaojie Yuan, Zhicheng Dou, Lu Zhang, and Fang Liu, Automatic User Goals Identification Based on Anchor Text and Click-through Data, in Wuhan University Journal of Natural Sciences (WISA2008), 13(4), pages 495-500, 2008

•Xiaojie Yuan, Zhicheng Dou, Fang Liu, and Lu Zhang, Personalized Web Search Based on Dynamic User Profile (一种基于动态用户模型的个性化Web搜索算法), in NDBC 2008: Proceedings of the 25th National Database Conference (In Chinese), 2008

•Lu ZHANG, Xiao-jie YUAN, Fang LIU, and Zhicheng Dou, Research on Distributed Index Mechanism for Large Dataset, Microelectronics & Computer, Volume 10, Pages 037, 2008

2007

•Zhicheng Dou, Ruihua Song, and Ji-Rong Wen, A large-scale evaluation and analysis of personalized search strategies, in Proceedings of the 16th international conference on World Wide Web (WWW2007), pages 581-590, ACM Press, New York, NY, USA, 2007

社会兼职

Steering Committee member of Asia Information Retrieval Society (AIRS)

•Co-organizer: NTCIR-10 Intent-2 task, NTCIR-11 IMINE task and NTCIR-12 IMINE-2 task

•PC/Reviewer ◦SIGIR 2016, IJCAI 2016, ICDM 2016

◦SIGIR 2015, BigData 2015, CCIR 2015, NLPCC 2015, EMNLP 2015, WWW 2015, AAAI 2015, ICDE 2015, DASFAA 2015 demo track, HIA 2015

◦SIGIR 2014, CIKM 2014, WSDM 2014, KDD 2014, BigData 2014, NLPCC 2014

◦SIGIR 2013, CIKM 2013, IEEE BIG Data 2013, OAIR 2013, WWW 2013, SDM 2013

◦KDD 2012, WIDM 2009, KDD\'08, WWW\'07, KDD\'07, APWeb\'07, ICDM\'06, et al

◦Reviewer, TKDE, TIST, JCST, KAIS, GeoInformatica, Information Retrieval Journal, et al.

荣誉获奖

•Collaboration Award, Microsoft, 2013

•ACM SIGIR 2013 Best Paper Runner-Up Award

•AIRS 2012 Best Paper Award

•Microsoft Spot Award, 2012

•As one of the best interns at Microsoft Research Asia, visited Bill Gates\\\' home in 2007