Clicky

Research Interests

Natural Language Processing, Artificial Intelligence, Machine Learning

Project Experience

Deep Keyphrase Generation
We apply a deep generative model (encoder-decoder model) on the task of keyphrase summarization, which provides a novel perspective on this long-studied problem. A copy mechanism is effectively employed to enhance the model with extractive ability. Experiments demonstrate our model not only outperforms baselines on keyphrase extraction benchmarks but also has the capability of predicting semantically related phrases.
Kaggle: Springleaf Market Response
A large-scale classification task aims to locate the target users for advertising. More than 290,000 customer records with nearly 2,000 anonymized features are used in this study. A complete pipeline has been implemented in this study, including data analysis (statistics and visualization), feature engineering (selection, reduction) and classification (ensemble models).
Citation Semantic Classification
Citation performs in different semantic roles in scientific papers. A fine-grined classification is conducted, based on support vector machine with multiple types of semantic features (n-grams, pos-tagging, linguistic pattern, typed dependency, entity etc.).
National Olympiad in Informatics in Provinces
The most influential programming competition for senior high school students in China. Similar to ACM Programming Contest, the NOIP require the contestants to show great IT skills as problem analysis, design of algorithms and data structures, programming and testing. The winners (less 1% of contestants) are recommended and accepted to top universities in China.

Education

University of Pittsburgh
Information Science and Technology - Ph.D
Pittsburgh, PA, USA
Sep 2015 - Current
Wuhan University
Management Science and Engineering - Master
Wuhan, Hubei, China
Sep 2012 - Jul 2015
Wuhan University
Information Management and System - Bachelor
Wuhan, Hubei, China
Sep 2008 - Jul 2012

Experience

Salesforce Research
Research Scientist
Oct 2021 - Current
Google AI
Research Intern
May 2020 - Aug 2020
  • Build lighter BERT models for Mobile Devices
Salesforce Research
Research Intern
May 2019 - Dec 2019
  • Pre-training for text summarization
Google AI
Research Intern
May 2018 - Aug 2018
  • Recognize user action patterns in mobile search
  • Predict user satisfaction and quality of search results
Yahoo Research
Research Intern
May 2017 - Aug 2017
  • Developed an online evaluation method for large-scale dialogue systems

Publications

  • {"title"=>"Unsupervised Deep Keyphrase Generation", "authors"=>"Xianjie Shen, Yinghan Wang, Rui Meng, Jingbo Shang", "publisher"=>"36th AAAI Conference on Artificial Intelligence. (AAAI 2022).", "pdf"=>"https://arxiv.org/pdf/2104.08729.pdf", "code"=>"https://github.com/Jayshen0/Unsupervised-Deep-Keyphrase-Generation"}
  • {"title"=>"Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents", "authors"=>"Rui Meng, Khushboo Thaker, Lei Zhang, Yue Dong, Xingdi Yuan, Tong Wang, Daqing He", "publisher"=>"59th Annual Meeting of the Association for Computational Linguistics. 2021. (ACL 2021).", "pdf"=>"https://arxiv.org/pdf/2106.00130.pdf", "code"=>"https://github.com/hfthair/emerald_crawler", "video"=>"#", "slides"=>"../uploads/acl2021-emerald-faceted-summarization-slides.pdf"}
  • {"title"=>"Predicting User Engagement Status for Online Evaluation of Intelligent Assistants", "authors"=>"Rui Meng, Zhen Yue, Alyssa Glass", "publisher"=>"43rd European Conference on Information Retrieval. 2021. (ECIR 2021).", "pdf"=>"https://arxiv.org/pdf/2010.00656.pdf", "arxiv"=>"https://arxiv.org/pdf/2010.00656.pdf", "slides"=>"../uploads/ecir2021-user-engagement-slides.pdf"}
  • {"title"=>"Integrating transformer and paraphrase rules for sentence simplification", "authors"=>"Sanqiang Zhao, Rui Meng, Daqing He, Andi Saptono, Bambang Parmanto", "publisher"=>"Conference on Empirical Methods in Natural Language Processing. 2018. (EMNLP 2018).", "pdf"=>"https://arxiv.org/pdf/1810.11193.pdf", "arxiv"=>"https://arxiv.org/pdf/1810.11193.pdf", "code"=>"https://github.com/Sanqiang/text_simplification", "video"=>"https://vimeo.com/305927122"}
  • {"title"=>"Deep Keyphrase Generation", "authors"=>"Rui Meng, Sanqiang Zhao, Shuguang Han, Daqing He, Peter Brusilovsky, Yu Chi", "publisher"=>"55th Annual Meeting of Association for Computational Linguistics. 2017. (ACL 2017).", "pdf"=>"https://arxiv.org/pdf/1704.06879.pdf", "arxiv"=>"https://arxiv.org/pdf/1704.06879.pdf", "code"=>"https://github.com/memray/OpenNMT-kpg-release", "data"=>"https://drive.google.com/file/d/1z1JGWMnQkkWw_4tjptgO-dxXD0OeTfuP/view", "model"=>"https://drive.google.com/file/d/18Pfs0ePAMl17kfjYRU_9HxYc0eUXet-_/view", "video"=>"https://vimeo.com/234956524", "slides"=>"../uploads/acl17-deep-keyphrase-generation-slides.pdf"}

Technical Skills

Machine Learning:
TensorFlow, PyTorch, Theano, Scikit-learn, NLTK, Weka, Mallet, Stanford NLP toolkits
Research Tools:
MATLAB, LaTeX
Programming:
Python, Java, Linux, Bash, R, C/C++, Web Development

Selected Awards, Scholarships, & Achievements

Amazon Research Awards
Awarded to dissertation study "Transferable, Controllable, Applicable Keyphrase Generation"
Amazon.com
Jan 2020
First Prize of National Olympiad in Informatics in Provinces
Awarded to top 1% participants for outstanding skills in algorithms and computer programming
China Computer Federation
Jan 2008

Interests

Technology: AI, AI, and AI, start-up
Fun: film, travelling