李锡荣

副教授、博士生导师。分别于2005年、2007年获清华大学计算机专业本科、硕士学位,2012年获荷兰阿姆斯特丹大学计算机博士学位。同年5月份加入中国人民大学数据工程与知识工程教育部重点实验室,任讲师。2016年晋升副教授,2017年晋升博导,并入选中国人民大学首批杰出学者支持计划。主要研究领域是人工智能与媒体计算(AI and Media Computing)。在相关领域主要国际会议和期刊发表论文 50 余篇,Google scholar被引用数 1800 多次,H指数19。获 2017中国多媒体大会优秀论文奖, ACM Multimedia 2016 Grand Challenge Award, PCM 2016 Best Paper Runner-up, ACM SIGMM 2013年杰出博士论文奖、IEEE Transactions on Multimedia 2012年度期刊最佳论文奖、2011年国家优秀自费留学生奖学金、2010年国际图像与视频检索会议 CIVR 最佳论文奖。中国计算机学会会员、多媒体专委会委员。

详细>>

个人主页:http://lixirong.net/

电子邮箱:xirong_li@126.com

详细资料

教育经历

2007.08 - 2012.03, 博士, 荷兰阿姆斯特丹大学 Intelligent Systems Lab Amsterdam

2005.09 - 2007.06, 硕士, 清华大学 计算机系

2001.09 - 2005.07, 本科, 清华大学 计算机系

工作经历

2016.08 - 至今, 副教授,中国人民大学

2012.05 - 2016.08, 讲师, 中国人民大学

研究方向

图像/视频搜索

图像自然语言理解

跨媒体表示学习

基于深度学习的医学影像分析

学术报告/讲座:

+ 2018.06 人工智能与影像识别,中央财经大学文化与传媒学院

+ 2018.05 人工智能在眼科的应用, 眼科E20新设备新技术高峰论坛

+ 2018.04 基于深度学习的图像内容识别,CFDA医疗器械技术审评中心

+ 2017.11 Multi-Scale Word2VisualVec for Video Caption Retrieval, TRECVID 2017 workshop

+ 2017.10 人工智能机遇与挑战: 以影像内容理解为例, 2017眼科影像与信息高峰论坛

+ 2016.11 Word2VisualVec for Video-To-Text Matching and Ranking, TRECVID 2016 workshop

+ 2016.10 Tag Embeddings for Multimedia Retrieval and Description, SIGMM Raising Stars Symposium 2016

+ 2016.04 图片句子生成的新进展, 北京大学语言、逻辑、认知及计算论坛 (LLCC)

讲授课程

1. 模式识别

2. 图像内容分析

3. 实用Python编程

对学生的培养要求

招生要求:

+ 踏实勤奋

+ 有较强的求知欲

+ 具备C++ / Python编程能力

+ 具备较强的英文阅读、写作和表达能力

+ 保研的同学, 需通过人大信息学院优秀大学生夏令营选拔

科研项目

[7] 中国人民大学决策咨询及预研委托项目:多媒体内容的中文语言自动描述,2018.01-2020.12 (No. 18XNLG19)

[6] 国家自然科学基金(面上项目):面向中文的看图造句若干关键问题研究, 2017.01-2020.12 (No. 61672523)

[5] 上海市智能信息处理重点实验室开放基金:基于相关样本的图像标签相关性计算研究,2014.01-2015.12 (No. IIPL-2013-002)

[4] 国家自然科学基金(青年基金项目): 基于网上弱标注数据的个性化图像标注研究,2014.01-2016.12 (No. 61303184)

[3] 教育部高等学校博士点专项科研基金(新教师类): 基于分类的社会化标签与图像相关度估计方法研究,2014.01-2016.12 (No. 20130004120006)

[2] 教育部留学回国人员科研启动基金项目: 社会网上图像检索若干关键问题研究,2014.01-2015.12

[1] 中国人民大学新教师启动金项目: 基于社会化媒体的图像检索新方法研究, 2013.01-2015.12 (No. 13XNLF05)

科研成果

*** 论文 ***

[28] Xin Lai, Xirong Li, Rui Qian, Dayong Ding, Jun Wu, Jieping Xu (2019): Four Models for Automatic Recognition of Left and Right Eye in Fundus Images. the 25th International Conference on MultiMedia Modeling (MMM), 2019

[27] Qijie Wei, Xirong Li, Hao Wang, Dayong Ding, Weihong Yu, Youxin Chen (2018): Laser Scar Detection in Fundus Images using Convolutional Neural Networks. Asian Conference on Computer Vision (ACCV), 2018

[26] Jianfeng Dong, Xirong Li, Chaoxi Xu, Gang Yang, Xun Wang, Feature Re-Learning with Data Augmentation for Content-based Video Recommendation, ACM Multimedia, 2018 (Grand challenge paper)

[25] Gang Yang, Jinlu Liu, Jieping Xu, Xirong Li, Dissimilarity Representation Learning for Generalized Zero-Shot Recognition, ACM Multimedia, 2018

[24] Bin Liang, Hongcheng Li, Miaoqiang Su, Pan Bian, Xirong Li, Wenchang Shi (2018): Deep Text Classification Can be Fooled. IJCAI, 2018

[23] Jianfeng Dong, Xirong Li, Cees G. M. Snoek (2018): Predicting Visual Features from Text for Image and Video Caption Retrieval. IEEE Transactions on Multimedia (TMM), 2018

[22] Gang Yang, Jinlu Liu, Xirong Li (2018): Imagination Based Sample Construction for Zero-Shot Learning. SIGIR, 2018

[21] 蓝玮毓, 王晓旭, 杨刚, 李锡荣 (2018): 标签增强的中文看图造句, 计算机学报, 2018

[20] Jianfeng Dong, Xirong Li, Duanqing Xu (2018): Cross-Media Similarity Evaluation for Web Image Retrieval in the Wild. IEEE Transactions on Multimedia (TMM), 2018

[19] Cees G. M. Snoek, Xirong Li, Chaoxi Xu, Dennis C. Koelma (2017): University of Amsterdam and Renmin University at TRECVID 2017: Searching Video, Detecting Events and Describing Video. TRECVID Workshop, 2017

[18] Weiyu Lan, Xirong Li, Jianfeng Dong (2017): Fluency-Guided Cross-Lingual Image Captioning. ACM Multimedia, 2017

[17] Qijie Wei, Xiaoxu Wang, Xirong Li (2017): Harvesting Deep Models for Cross-Lingual Image Annotation. CBMI, 2017

[16] Xirong Li (2017): Tag Relevance Fusion for Social Image Retrieval. In: Multimedia Systems, 23 (1), pp. 29–40, 2017

[15] Cees G. M. Snoek, Jianfeng Dong, Xirong Li, Xiaoxu Wang, Qijie Wei, Weiyu Lan, Efstratios Gavves, Noureldien Hussein, Dennis C. Koelma, Arnold W. M. Smeulders (2016): University of Amsterdam and Renmin University at TRECVID 2016: Searching Video, Detecting Events and Describing Video. TRECVID Workshop, 2016

[14] Jianfeng Dong, Xirong Li, Weiyu Lan, Yujia Huo, Cees G. M. Snoek (2016): Early Embedding and Late Reranking for Video Captioning. ACM Multimedia, 2016

[13] Xirong Li, Yujia Huo, Qin Jin, Jieping Xu (2016): Detecting Violence in Video using Subclasses. ACM Multimedia, 2016

[12] Xirong Li, Qin Jin (2016): Improving Image Captioning by Concept-based Sentence Reranking. PCM, 2016

[11] Masoud Mazloom, Xirong Li, Cees G. M. Snoek (2016): TagBook: A Semantic Video Representation Without Supervision for Event Detection. In: IEEE Transactions on Multimedia (TMM), 18 (7), pp. 1378-1388, 2016

[10] Xirong Li, Weiyu Lan, Jianfeng Dong, Hailong Liu (2016): Adding Chinese Captions to Images. ICMR, 2016

[9] Xirong Li, Tiberio Uricchio, Lamberto Ballan, Marco Bertini, Cees G. M. Snoek, Alberto Del Bimbo (2016): Socializing the Semantic Gap: A Comparative Survey on Image Tag Assignment, Refinement, and Retrieval. ACM Computing Surveys (CSUR), 49 (1), pp. 14:1-14:39, 2016

[8] Xirong Li, Tiberio Uricchio, Lamberto Ballan, Marco Bertini, Cees G. M. Snoek, Alberto Del Bimbo (2015): Image Tag Assignment, Refinement and Retrieval. ACM Multimedia, 2015

[7] Jianfeng Dong, Xirong Li, Shuai Liao, Jieping Xu, Duanqing Xu, Xiaoyong Du (2015): Image Retrieval by Cross-Media Relevance Fusion. ACM Multimedia, 2015

[6] Qin Jin, Xirong Li, Haibing Cao, Yujia Huo, Shuai Liao, Gang Yang, Jieping Xu (2015): RUCMM at MediaEval 2015 Affective Impact of Movies Task: Fusion of Audio and Visual Cues. In: Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

[5] Xirong Li, Qin Jin, Shuai Liao, Junwei Liang, Xixi He, Yujia Huo, Weiyu Lan, Bin Xiao, Yanxiong Lu, Jieping Xu (2015): RUC-Tencent at ImageCLEF 2015: Concept Detection, Localization and Sentence Generation. CLEF (Working Notes), 2015

[4] Xirong Li, Shuai Liao, Weiyu Lan, Xiaoyong Du, Gang Yang (2015): Zero-shot Image Tagging by Hierarchical Semantic Embedding. SIGIR, 2015

[3] Shuai Liao, Xirong Li, Heng Tao Shen, Yang Yang, Xiaoyong Du (2015): Tag Features for Geo-Aware Image Classification. In: IEEE Transactions on Multimedia (TMM), 17 (7), pp. 1058-1067, 2015

[2] Junwei Liang, Qin Jin, Xixi He, Gang Yang, Jieping Xu, Xirong Li (2015): Detecting semantic concepts in consumer videos using audio. In: International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2279–2283, 2015

[1] Svetlana Kordumova, Xirong Li, Cees G.M. Snoek (2015): Best Practices for Learning Video Concept Detectors from Social Media Examples. In: Multimedia Tools and Applications (MTAP), 74 (4), pp. 1291–1315, 2015

*** 国际评测 ***

[5] Top performer of the TRECVID 2018 Ad-hoc Video Search (AVS) task

[4] Top performer of the TRECVID 2016 Video-to-Text (VTT) task

[3] Top performer of the ImageCLEF 2015 Image Sentence Generation task

[2] Top performer of the MSR Bing Image Retrieval Challenge at ACM Multimedia 2015

[1] Top performer of the TRECVID 2013 Video Semantic Indexing with No Annotation task

社会兼职

http://lixirong.net/prof

+ Area Chair of ACM Multimedia 2018

+ Area Chair of ICPR 2016

荣誉获奖

[8] 2017 中国多媒体大会优秀论文奖 (标签增强的中文看图造句)

[7] ACM Multimedia 2016 Grand Challenge Award (Early Embedding and Late Reranking for Video Captioning)

[6] PCM 2016 Best Paper Runner-Up (Improving Image Captioning by Concept-based Sentence Reranking)

[5] PCM 2014 Outstanding Reviewer Award

[4] SIGMM 2013 Best PhD Thesis Award (Content-based Visual Search Learned from Social Media)

[3] IEEE Transactions on Multimedia 2012 Prize Paper Award (Learning Social Tag Relevance by Neighbor Voting)

[2] 2011 国家优秀自费留学生奖学金

[1] ACM International Conference on Image and Video Retrieval 2010 Best Paper Award (Unsupervised multi-feature tag relevance learning for social image retrieval)