Book
Preprint
Jiaming Shen , Ran Xu, Yennie Jun, Zhen Qin, Tianqi Liu, Carl Yang, Yi Liang, Simon Baumgartner, and Michael Bendersky. Boosting Reward Model with Preference-Conditional Multi-Aspect Synthetic Data Generation .
Wei Xiong, Chengshuai Shi, Jiaming Shen , Aviv Rosenberg, Zhen Qin, Daniele Calandriello, Misha Khalman, Rishabh Joshi, Bilal Piot, Mohammad Saleh, Chi Jin, Tong Zhang, and Tianqi Liu. Building Math Agents with Multi-Turn Iterative Preference Learning .
Tianqi Liu, Zhen Qin, Junru Wu, Jiaming Shen , Misha Khalman, Rishabh Joshi, Yao Zhao, Mohammad Saleh, Simon Baumgartner, Jialu Liu, Peter J Liu, and Xuanhui Wang. LiPO: Listwise Preference Optimization through Learning-to-Rank .
Tianqi Liu, Wei Xiong, Jie Ren, Lichang Chen, Junru Wu, Rishabh Joshi, Yang Gao, Jiaming Shen , Zhen Qin, Tianhe Yu, Daniel Sohn, Anastasia Makarova, Jeremiah Zhe Liu, Yuan Liu, Bilal Piot, Abe Ittycheriah, Aviral Kumar, and Mohammad Saleh. RRM: Robust Reward Model Training Mitigates Reward Hacking .
Yi Liang, You Wu, Honglei Zhuang, Li Chen, Jiaming Shen , Yiling Jia, Zhen Qin, Sumit Sanghai, Xuanhui Wang, Carl Yang, and Michael Bendersky. Integrating Planning into Single-Turn Long-Form Text Generation .
Yunyi Zhang, Ruozhen Yang, Xueqiang Xu, Jinfeng Xiao, Jiaming Shen , and Jiawei Han. TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision .
2024
Jiaming Shen , Tianqi Liu, Jialu Liu, Zhen Qin, Jay Pavagadhi, Simon Baumgartner, and Michael Bendersky. Multilingual Fine-Grained News Headline Hallucination Detection In Proc. of The Findings of 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP Findings 2024 ).
[Dataset ]
Zhen Qin, Junru Wu, Jiaming Shen , Tianqi Liu, and Xuanhui Wang. LAMPO: Large Language Models as Preference Machines for Few-shot Ordinal Classification . In Proc. of The first Conference on Language Modeling (CoLM 2024 ).
Rongzhi Zhang, Jiaming Shen , Tianqi Liu, Haorui Wang, Zhen Qin, Feng Han, Jialu Liu, Simon Baumgartner, Michael Bendersky, and Chao Zhang. PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs . In Proc. of The Findings of 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024 (Findings) ).
Yue Yu, Jiaming Shen , Tianqi Liu, Zhen Qin, Jing Nathan Yan, Jialu Liu, Chao Zhang, and Michael Bendersky. Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning . In Proc. of The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024 ).
Jing Nathan Yan, Tianqi Liu, Justin T Chiu, Jiaming Shen , Zhen Qin, Yue Yu, Yao Zhao, Charu Lakshmanan, Yair Kurzion, Alexander M Rush, Jialu Liu, and Michael Bendersky. Predicting Text Preference Via Structured Comparative Reasoning . In Proc. of The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024 ).
Rongzhi Zhang, Jiaming Shen , Tianqi Liu, Jialu Liu, Michael Bendersky, Marc Najork, and Chao Zhang. Knowledge Distillation with Perturbed Loss: From a Vanilla Teacher to a Proxy Teacher . In Proc. of The 30th SIGKDD Conference on Knowledge Discovery and Data Mining - Research Track (KDD 2024 ).
Zhen Qin, Rolf Jagerman, Kai Hui, Honglei Zhuang, Junru Wu, Le Yan, Jiaming Shen , Tianqi Liu, Jialu Liu, Donald Metzler, Xuanhui Wang, and Michael Bendersky. Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting , In Proc. of The 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024 ).
2023
Jiaming Shen , Jialu Liu, Dan Finnie, Negar Rahmati, Michael Bendersky, and Marc Najork. "Why is this misleading?": Detecting News Headline Hallucinations with Explanations , In Proc. of The ACM 2023 Web Conference (WWW 2023 ).
[Dataset ]
[Slides ]
Yizhu Jiao, Ming Zhong, Jiaming Shen , Yunyi Zhang, Chao Zhang, and Jiawei Han. Unsupervised Event Chain Mining from Multiple Documents , In Proc. of The ACM 2023 Web Conference (WWW 2023 ).
Jiaying Lu, Jiaming Shen , Bo Xiong, Wenjing Ma, Steffen Staab, and Carl Yang. HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting , In Proc. of The 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2023 ).
Boshi Wang, Sewon Min, Xiang Deng, Jiaming Shen , You Wu, Luke Zettlemoyer, and Huan Sun. Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters , In Proc. of The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023 ).
[ICLR (ME-FoMo Workshop) Version ]
[Code ]
Yue Yu, Rongzhi Zhang, Ran Xu, Jieyu Zhang, Jiaming Shen , and Chao Zhang. Cold-Start Data Selection for Better Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Propagation Approach , In Proc. of The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023 ).
[Code ]
Yue Yu, Yuchen Zhuang, Rongzhi Zhang, Yu Meng, Jiaming Shen , and Chao Zhang. ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval , In Proc. of The Findings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL Findings 2023 ).
[Code and Dataset ]
Yunan Zhang, Le Yan, Zhen Qin, Honglei Zhuang, Jiaming Shen , Xuanhui Wang, Michael Bendersky, and Marc Najork. Towards Disentangling Relevance and Bias in Unbiased Learning to Rank , In Proc. of 29th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD 2023 ).
Rongzhi Zhang, Yue Yu, Jiaming Shen , Xiquan Cui, and Chao Zhang. Local Boosting for Weakly-Supervised Learning , In Proc. of 29th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD 2023 ).
Sizhe Zhou, Suyu Ge, Jiaming Shen , and Jiawei Han. Corpus-Based Relation Extraction by Identifying and Refining Relation Patterns , In Proc. of The 2023 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD 2023 ).
Yue Yu, Yuchen Zhuang, Jieyu Zhang, Yu Meng, Alexander Ratner, Ranjay Krishna, Jiaming Shen , and Chao Zhang. Large Language Model as Attributed Training Data Genera- tor: A Tale of Diversity and Bias , In Proc. of The 37th Conference on Neural Information Processing Systems (NeurIPS 2023 ).
2022
Dongha Lee, Jiaming Shen , Seonghyeon Lee, Susik Yoon, Hwanjo Yu, and Jiawei Han. Topic Taxonomy Expansion via Hierarchy-Aware Topic Phrase Generation , In Proc. of The Findings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP Findings 2022 ).
Yunyi Zhang, Fang Guo, Jiaming Shen , and Jiawei Han. Unsupervised Key Event Detection from Massive Text Corpus , In Proc. of 2022 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD 2022 ).
[Code ]
Yiqing Xie, Jiaming Shen , Sha Li, Yuning Mao, and Jiawei Han. EIDER: Empowering Document-level Relation Extraction with Efficient Evidence Extraction and Inference-stage Fusion , In Proc. of The Findings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022 (Findings) ).
[Code ]
Xiaotao Gu, Yikang Shen, Jiaming Shen , Jingbo Shang, and Jiawei Han. Phrase-aware Unsupervised Constituency Parsing , In Proc. of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022 ).
Dongha Lee, Jiaming Shen , SeongKu Kang, Susik Yoon, Jiawei Han, and Hwanjo Yu. TaxoCom: Topic Taxonomy Completion with Hierarchical Discovery of Novel Topic Clusters , In The 33rd International Conference in the Web (WWW 2022 ).
[Code ]
2021
Jiaming Shen , Yunyi Zhang, Heng Ji, and Jiawei Han. Corpus-based Open-Domain Event Type Induction , In Proc. of The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021 ).
[Code ]
[Data ]
Jiaming Shen , Jialu Liu, Tianqi Liu, Cong Yu, and Jiawei Han. Training ELECTRA Augmented with Multi-word Selection , In Proc. of The Findings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021 (Findings) ).
[Code ]
Jiaming Shen , Wenda Qiu, Yu Meng, Jingbo Shang, Xiang Ren, and Jiawei Han. TaxoClass: Hierarchical Multi-Label Text Classification Using Only Class Names , In Proc. of The 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2021 ).
[Data ]
Jiaming Shen , Xiaotao Gu, Yu Meng, and Jiawei Han. Automated Taxonomy Discovery and Exploration , In Proc. of The 2021 IEEE International Conference on Data Mining (ICDM 2021 Tutorial ).
[Slides ]
Xiangchen Song, Jiaming Shen , Jieyu Zhang, and Jiawei Han. Who Should Go First? A Self-Supervised Concept Sorting Model for Improving Taxonomy Expansion , In 2021 WWW workshop on Self-Supervised Learning for the Web (SSL@WWW 2021 ).
Jieyu Zhang, Xiangchen Song, Ying Zeng, Jiaze Chen, Jiaming Shen , Yuning Mao, and Lei Li. Taxonomy Completion via Triplet Matching Network , In Proc. of The 35th AAAI Conference on Artificial Intelligence (AAAI 2021 ).
[Code ]
2020
Jiaming Shen* , Wenda Qiu*, Jingbo Shang, Michelle Vanni, Xiang Ren, and Jiawei Han. SynSetExpan: An Iterative Framework for Joint Entity Set Expansion and Synonym Discovery , In Proc. of The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020 ).
[Data ]
[Slide ]
Jiaming Shen , Heng Ji, and Jiawei Han. Near-imperceptible Neural Linguistic Steganography via Self-Adjusting Arithmetic Coding , In Proc. of The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020 ).
[Code & Data ]
[Slide ]
Jiaming Shen , Zhihong Shen, Chenyan Xiong, Chi Wang, Kuansan Wang, and Jiawei Han. TaxoExpan: Self-supervised Taxonomy Expansion with Position-Enhanced Graph Neural Network , In The 31st International Conference in the Web (WWW 2020 ).
[Code & Data ]
[Blog Post ]
[Recorded Presentation ]
Yunyi Zhang, Jiaming Shen , Jingbo Shang, and Jiawei Han. Empower Entity Set Expansion via Language Model Probing , In The 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020 ).
[Code & Data ]
Dominic Seyler, Jiaming Shen , Jinfeng Xiao, Yiren Wang, and ChengXiang Zhai. Leveraging Personalized Sentiment Lexicons for Sentiment Analysis , In The 10th International Conference on the Theory of Information Retrieval (ICTIR 2020 ).
Yue Yu, Yinghao Li, Jiaming Shen , Hao Feng, Jimeng Sun, and Chao Zhang. STEAM: Self-Supervised Taxonomy Expansion with Mini-Paths , In The 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2020 ).
[Code & Data ]
Wanzheng Zhu, Hongyu Gong, Jiaming Shen , Chao Zhang, Jingbo Shang, Suma Bhat, and Jiawei Han. FUSE: Multi-faceted set expansion by coherent clustering of skip-grams , In The 2020 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD 2020 ).
Jiaxin Huang, Yiqing Xie, Yu Meng, Jiaming Shen , Yunyi Zhang, and Jiawei Han. Guiding Corpus-based Set Expansion by Auxiliary Sets Generation and Co-Expansion , In The 31st International Conference in the Web (WWW 2020 ).
2019
Jiaming Shen , Ruiliang Lyu, Xiang Ren, Michelle Vanni, Brian Sadler, and Jiawei Han. Mining Entity Synonyms with Efficient Neural Set Generation , In The 33rd AAAI Conference on Artificial Intelligence (AAAI 2019 ).
[Code ]
[Full Documentation ]
[Data ]
[Slide ]
Yu Shi* , Jiaming Shen* , Yuchen Li, Naijing Zhang, Xinwei He, Zhengzhi Lou, Qi Zhu, Matthew Walker, Myunghwan Kim, and Jiawei Han. Discovering Hypernymy in Text-Rich Heterogeneous Information Network by Exploiting Context Granularity , In The 28th ACM International Conference on Information and Knowledge Management (CIKM 2019 ).
[Code ]
Yu Meng, Jiaming Shen , Chao Zhang, and Jiawei Han. Weakly-Supervised Hierarchical Text Classification , In The 33rd AAAI Conference on Artificial Intelligence (AAAI 2019 ).
[Code ]
[Slide ]
Jingbo Shang, Jiaming Shen , Liyuan Liu, and Jiawei Han. Constructing and Mining Heterogeneous Information Networks from Massive Text , In The 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2019 Tutorial ).
Junyi Du, He Jiang, Jiaming Shen , and Xiang Ren. Eliciting Knowledge from Experts: Automatic Transcript Parsing for Cognitive Task Analysis , In The 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019 ).
[Code & Data ]
2018
Jiaming Shen , Maryam Karimzadehgan, Michael Bendersky, Zhen Qin, and Donald Metzler. Multi-Task Learning for Personal Search Ranking with Auxiliary Query Clustering , In The 27th ACM International Conference on Information and Knowledge Management (CIKM 2018 ).
[Code ]
[Slide ]
Jiaming Shen , Zeqiu Wu, Dongming Lei, Chao Zhang, Xiang Ren, Michelle T. Vanni, Brian M. Sadler, and Jiawei Han. HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion , In The 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2018 ).
[Code ]
Jiaming Shen , Jinfeng Xiao, Xinwei He, Jingbo Shang, Saurabh Sinha, and Jiawei Han. Entity Set Search of Scientific Literature: An Unsupervised Ranking Approach , In The 41st International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018 ).
[Code ]
Jiaming Shen , Jinfeng Xiao, Yu Zhang, Carl Yang, Jingbo Shang, Jinda Han, Saurabh Sinha, Peipei Ping, Richard Weinshilboum, Zhiyong Lu and Jiawei Han, SetSearch+: Entity-Set-Aware Search and Mining for Scientific Literature , In The 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2018 Demo Track ).
[System ]
Yu Meng, Jiaming Shen , Chao Zhang, and Jiawei Han. Weakly-Supervised Neural Text Classification , In The 27th ACM International Conference on Information and Knowledge Management (CIKM 2018 ).
[Code ]
Jingbo Shang, Jiaming Shen , Tianhang Sun, Xingbang Liu, Anja Gruenheid, Flip Korn, Adam Lelkes, Cong Yu, and Jiawei Han. Investigating Rumor News Using Agreement Aware Search , In The 27th ACM International Conference on Information and Knowledge Management (CIKM 2018 ).
Hanwen Zha, Jiaming Shen , Keqian Li, Warren Greiff, Michelle Vanni, Jiawei Han and Xifeng Yan, FTS: Faceted Taxonomy Construction and Search for Scientific Publications , In The 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2018 Demo Track ).
[System ]
[Demo Video ]
Jingbo Shang, Qi Zhu, Jiaming Shen , Xuan Wang, Xiaotao Gu, Lance Kaplan, Timothy Harratty and Jiawei Han, AutoNet: Automated Network Construction and Exploration System from Domain-Specific Corpora , In The 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2018 Demo Track ).
Yuning Mao, Xiang Ren, Jiaming Shen , and Jiawei Han. End-to-End Reinforcement Learning for Automatic Taxonomy Induction , In The 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018 ).
[Code ]
Jingbo Shang, Chao Zhao, Jiaming Shen , and Jiawei Han. Towards Multidimensional Analysis of Text Corpora", In The 24th ACM SIGKDD International Conference on Knowledge (KDD 2018 ).
[Tutorial Website ]
Chao Zhang, Fangbo Tao, Xiusi Chen, Jiaming Shen , Meng Jiang, Brian M. Sadler, Michelle T. Vanni, and Jiawei Han. TaxoGen: Constructing Topical Concept Taxonomy by Adaptive Term Embedding and Clustering , In The 24th ACM SIGKDD International Conference on Knowledge (KDD 2018 ).
[Code ]
2017
Jiaming Shen* , Zeqiu Wu* , Dongming Lei, Jingbo Shang, Xiang Ren, and Jiawei Han, SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble , In The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD 2017 ).
[Code ]
[Data ]
Xiang Ren, Jiaming Shen , Meng Qu, Xuan Wang, Zeqiu Wu, Qi Zhu, Meng Jiang, Fangbo Tao, Saurabh Sinha, David Liem, Peipei Ping, Richard Weinshilboum, and Jiawei Han, Life-iNet: A Structured Network-Based Knowledge Exploration and Analytics System for Life Sciences , In The 55th annual meeting of the Association for Computational Linguistics (ACL 2017 System Demo ).
2016
2015
Jiaming Shen , Zhenyu Song, Shitao Li, Zhaowei Tan, Yuning Mao, Luoyi Fu, Li Song, and Xinbing Wang, Modeling Topic-level Academic Influence in Scientific Literatures , In The Thirtieth AAAI Conference on Artificial Intelligence (AAAI 2016 ) Workshop on Scholarly Big Data.
[Slide ]
Jiaming Shen , Zhaowei Tan, Luoyi Fu, and Xinbing Wang, Trend Analysis of Top-tier Conferences in Computer Network Field , In Communications of the China Computer Federation , 11(9), 62-66, 2015 .
Zhaowei Tan, Changfeng Liu, Yuning Mao, Jiaming Shen , Bin Wang, Luoyi Fu, Li Song, and Xinbing Wang, AceMap: A Novel Approach towards Displaying Relationship among Academic Literatures , In The 25th International World Wide Web Conference (WWW 2016 ).
[System ]