{"id":2000,"date":"2025-06-07T16:35:08","date_gmt":"2025-06-07T16:35:08","guid":{"rendered":"https:\/\/kdd2025.kdd.org\/?page_id=2000"},"modified":"2025-07-23T20:55:42","modified_gmt":"2025-07-23T20:55:42","slug":"datasets-and-benchmarks-track-papers-2","status":"publish","type":"page","link":"https:\/\/kdd2025.kdd.org\/datasets-and-benchmarks-track-papers-2\/","title":{"rendered":"Datasets and Benchmarks Track Papers"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\">Datasets and Benchmarks Track Papers<\/h2>\n\n\n\n<div style=\"height:20px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p><em>DOI links will be available by August 3<sup>rd<\/sup>, please check back then to access the direct links below<\/em><\/p>\n\n\n\n<figure class=\"wp-block-table is-style-stripes has-small-font-size\"><table class=\"has-fixed-layout\"><tbody><tr><td><strong>BatteryLife: A Comprehensive Dataset and Benchmark for Battery Life Prediction<\/strong><br>DOI: 10.1145\/3711896.3737372<\/td><\/tr><tr><td>Ruifeng Tan (The Hong Kong University of Science and Technology  (Guangzhou),The Hong Kong University of Science and Technology); Weixiang Hong (The Hong Kong University of Science and Technology  (Guangzhou),The Hong Kong University of Science and Technology); Jiayue Tang (The Hong Kong University of Science and Technology  (Guangzhou),The Hong Kong University of Science and Technology); Xibin Lu (The Hong Kong University of Science and Technology  (Guangzhou),The Hong Kong University of Science and Technology); Ruijun Ma (CALB Group Co., Ltd.); Xiang Zheng (CALB Group Co., Ltd.); Jia Li (The Hong Kong University of Science and Technology  (Guangzhou)); Jiaqiang Huang (The Hong Kong University of Science and Technology  (Guangzhou)); Tong-Yi Zhang (The Hong Kong University of Science and Technology  (Guangzhou))<\/td><\/tr><tr><td><strong>Bridging the Binary Analysis Gap: A Cross-Compiler Dataset and Neural Framework for Industrial Control Systems<\/strong><br>DOI: 10.1145\/3711896.3737373<\/td><\/tr><tr><td>Yonatan G. Achamyeleh (University of California, Irvine); Shih-Yuan Yu (University of California, Irvine); Gustavo Q. Araya (Siemens Corporate Research); Mohammad A. Al Faruque (University of California, Irvine)<\/td><\/tr><tr><td><strong>ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models<\/strong><br>DOI: 10.1145\/3711896.3737374<\/td><\/tr><tr><td>Haibin Chen (Taobao &amp; Tmall Group of Alibaba); Kangtao Lv (Zhejiang University); Chengwei Hu (Taobao &amp; Tmall Group of Alibaba); Yanshi Li (Taobao &amp; Tmall Group of Alibaba); Yujin Yuan (Taobao &amp; Tmall Group of Alibaba); Yancheng He (Taobao &amp; Tmall Group of Alibaba); Xingyao Zhang (Taobao &amp; Tmall Group of Alibaba); Langming Liu (Taobao &amp; Tmall Group of Alibaba); Shilei Liu (Taobao &amp; Tmall Group of Alibaba); Wenbo Su (Taobao &amp; Tmall Group of Alibaba); Bo Zheng (Taobao &amp; Tmall Group of Alibaba)<\/td><\/tr><tr><td><strong>CityBench: Evaluating the Capabilities of Large Language Models for Urban Tasks<\/strong><br>DOI: 10.1145\/3711896.3737375<\/td><\/tr><tr><td>Jie Feng (Department of Electronic Engineering, BNRist, Tsinghua University); Jun Zhang (Department of Electronic Engineering, BNRist, Tsinghua University); Tianhui Liu (School of Electronic and Information Engineering, Beijing Jiaotong University); Xin Zhang (Shenzhen International Graduate School, Tsinghua University); Tianjian Ouyang (Department of Electronic Engineering, BNRist, Tsinghua University); Junbo Yan (Department of Electronic Engineering, BNRist, Tsinghua University); Yuwei Du (Department of Electronic Engineering, BNRist, Tsinghua University); Siqi Guo (Department of Electronic Engineering, Tsinghua University); Yong Li (Department of Electronic Engineering, BNRist, Tsinghua University)<\/td><\/tr><tr><td><strong>Delving into Instance-Dependent Label Noise in Graph Data: A Comprehensive Study and Benchmark<\/strong><br>DOI: 10.1145\/3711896.3737376<\/td><\/tr><tr><td>Suyeon Kim (Pohang University of Science and Technology); SeongKu Kang (Korea University); Dongwoo Kim (Pohang University of Science and Technology); Jungseul Ok (Pohang University of Science and Technology); Hwanjo Yu (Pohang University of Science and Technology)<\/td><\/tr><tr><td><strong>HiBench: Benchmarking LLMs Capability on Hierarchical Structure Reasoning<\/strong><br>DOI: 10.1145\/3711896.3737378<\/td><\/tr><tr><td>Zhuohang Jiang (Department of Computing, The Hong Kong Polytechnic University,School of Computer Science, Sichuan University); Pangjing Wu (Derpartment of Computing, The Hong Kong Polytechnic University,College of Computer and Information, Hohai University); Ziran Liang (Derpartment of Computing, The Hong Kong Polytechnic University,Sun Yat-sen University); Peter Q. Chen (Department of Computing, The Hong Kong Polytechnic University); Xu Yuan (Department of Computing, The Hong Kong Polytechnic University,Harbin Institute of Technology); Ye Jia (Department of Computing, The Hong Kong Polytechnic University); Tu Jiancheng (Department of Computing, The Hong Kong Polytechnic University); Chen Li (Department of Computing, The Hong Kong Polytechnic University,Department of Applied Social Sciences, The Hong Kong Polytechnic University); Peter H. F. Ng (Department of Computing, The Hong Kong Polytechnic University); Qing Li (Department of Computing, The Hong Kong Polytechnic University,Department of Computer Science, City University of Hong Kong)<\/td><\/tr><tr><td><strong>HtFLlib: A Comprehensive Heterogeneous Federated Learning Library and Benchmark<\/strong><br>DOI: 10.1145\/3711896.3737379<\/td><\/tr><tr><td>Jianqing Zhang (Shanghai Jiao Tong University,Institute for AI Industry Research, Tsinghua University); Xinghao Wu (Beijing University); Yanbing Zhou (Chongqing University); Xiaoting Sun (Tongji University); Qiqi Cai (Shanghai Jiao Tong University); Yang Liu (Hong Kong Polytechnic University,Shanghai Artificial Intelligence Laboratory); Yang Hua (The Queen&#8217;s University of Belfast); Zhenzhe Zheng (Shanghai Jiao Tong University); Jian Cao (Shanghai Jiao Tong University,Shanghai Key Laboratory of Trusted Data Circulation and Governance in Web3); Qiang Yang (Hong Kong Polytechnic University)<\/td><\/tr><tr><td><strong>IrrMap: A Large-Scale Comprehensive Dataset for Irrigation Method Mapping<\/strong><br>DOI: 10.1145\/3711896.3737380<\/td><\/tr><tr><td>Nibir Chandra Mandal (Dept. of Computer Science, University of Virginia); Oishee Bintey Hoque (Dept. of Computer Science, University of Virginia); Abhijin Adiga (Biocomplexity Institute, University of Virginia); Samarth Swarup (Biocomplexity Institute, University of Virginia); Mandy L. Wilson (Biocomplexity Institute, University of Virginia); Lu Feng (Dept. of Computer Science, University of Virginia); Yangfeng Ji (Dept. of Computer Science, University of Virginia); Miaomiao Zhang (Dept. of Computer Science, Dept. of Electrical and Computer Engineering, University of Virginia); Geoffrey Fox (Dept. of Computer Science, Biocomplexity Institute, University of Virginia); Madhav Marathe (Dept. of Computer Science, Biocomplexity Institute, University of Virginia)<\/td><\/tr><tr><td><strong>IVMR suite: An Industrial-scale Virtual Machine Rescheduling Dataset and Benchmark for Elastic Cloud Service<\/strong><br>DOI: 10.1145\/3711896.3737381<\/td><\/tr><tr><td>Yupeng Zhang (Alibaba Damo Academy, Alibaba Group); Xu Wan (Zhejiang University,DAMO Academy, Alibaba Group); Xiangyun Kong (Alibaba Cloud Intelligence Group, Alibaba Group); Chao Yang (Alibaba DAMO Academy, Alibaba Group); Binda Ma (Alibaba Cloud Intelligence Group, Alibaba Group); Wotao Yin (DAMO Academy, Alibaba Group); Jian Zhou (Alibaba Cloud Intelligence Group, Alibaba Group)<\/td><\/tr><tr><td><strong>ComputAgeBench: Epigenetic Aging Clocks Benchmark<\/strong><br>DOI: 10.1145\/3711896.3737382<\/td><\/tr><tr><td>Dmitrii Kriukov (Artificial Intelligence Research Institute,Skolkovo Institute of Science and Technology); Evgeniy Efimov (Artificial Intelligence Research Institute,Skolkovo Institute of Science and Technology); Ekaterina Kuzmina (Artificial Intelligence Research Institute,Skolkovo Institute of Science and Technology); Anastasiia Dudkovskaia (Skolkovo Institute of Science and Technology,Higher School of Economics); Ekaterina E. Khrameeva (Skolkovo Institute of Science and Technology); Dmitry V. Dylov (Artificial Intelligence Research Institute,Skolkovo Institute of Science and Technology)<\/td><\/tr><tr><td><strong>Evaluating and Generating Query Workloads for High Dimensional Vector Similarity Search<\/strong><br>DOI: 10.1145\/3711896.3737383<\/td><\/tr><tr><td>Matteo Ceccarello (University of Padua); Alexandra Levchenko (Isep, LISITE); Ioana Ileana (Universit\u00e9 Paris Cit\u00e9); Themis Palpanas (Universite Paris Cit\u00e9)<\/td><\/tr><tr><td><strong>FoodPuzzle: Toward Developing Large Language Models as Autonomous Flavor Scientists<\/strong><br>DOI: 10.1145\/3711896.3737384<\/td><\/tr><tr><td>Dong Hee Lee,Emily Steliotes,Jiatong Shi,John Sweeney,Jonathan May,Matthew Lange,Muhao Chen,Tenghao Huang<\/td><\/tr><tr><td><strong>UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering<\/strong><br>DOI: 10.1145\/3711896.3737385<\/td><\/tr><tr><td>Langming Liu (Taobao &amp; Tmall Group of Alibaba); Shilei Liu (Taobao &amp; Tmall Group of Alibaba); Yujin Yuan (Taobao &amp; Tmall Group of Alibaba); Yizhen Zhang (Taobao &amp; Tmall Group of Alibaba); Bencheng Yan (Taobao &amp; Tmall Group of Alibaba); Zhiyuan Zeng (Taobao &amp; Tmall Group of Alibaba); Zihao Wang (Taobao &amp; Tmall Group of Alibaba); Jiaqi Liu (Taobao &amp; Tmall Group of Alibaba); Di Wang (Taobao &amp; Tmall Group of Alibaba); Wenbo Su (Taobao &amp; Tmall Group of Alibaba); Pengjie Wang (Taobao &amp; Tmall Group of Alibaba); Jian Xu (Taobao &amp; Tmall Group of Alibaba); Bo Zheng (Taobao &amp; Tmall Group of Alibaba)<\/td><\/tr><tr><td><strong>A Framework for Evaluating AI Agents in Open-Ended Conversations via Scripted Simulation<\/strong><br>DOI: 10.1145\/3711896.3737390<\/td><\/tr><tr><td>Clarice Wang (University of Pennsylvania); Yimin Shi (National University of Singapore); Xiaokui Xiao (National University of Singapore)<\/td><\/tr><tr><td><strong>Differentially Private Synthetic Data Release for Topics API Outputs<\/strong><br>DOI: 10.1145\/3711896.3737391<\/td><\/tr><tr><td>Travis Dick (Google Research); Alessandro Epasto (Google Research); Adel Javanmard (University of Southern California,Google Research); Josh Karlin (Google Chrome); Andr\u00e9s Mu\u00f1oz Medina (Google Chrome); Vahab Mirrokni (Google Research); Sergei Vassilvitskii (Google Research); Peilin Zhong (Google Research)<\/td><\/tr><tr><td><strong>Fairness-Aware Graph Learning: A Benchmark<\/strong><br>DOI: 10.1145\/3711896.3737392<\/td><\/tr><tr><td>Yushun Dong (Florida State University); Song Wang (The University of Virginia); Zhenyu Lei (The University of Virginia); Zaiyi Zheng (The University of Virginia); Jing Ma (Case Western Reserve University); Chen Chen (The University of Virginia); Jundong Li (The University of Virginia)<\/td><\/tr><tr><td><strong>MentalChat16K: A Benchmark Dataset for Conversational Mental Health Assistance<\/strong><br>DOI: 10.1145\/3711896.3737393<\/td><\/tr><tr><td>George Demiris,Jia Xu,Joost Wagenaar,Li Shen,Patryk Orzechowski,Rachael Paulbeck,Ruochen Jin,Shu Yang,Tianyi Wei,Bojian Hou<\/td><\/tr><tr><td><strong>MonoDeMB: Comprehensive Monocular DepthMap Benchmark<\/strong><br>DOI: 10.1145\/3711896.3737394<\/td><\/tr><tr><td>Vaagn Chopuryan (Sber AI); Mikhail Kuznetsov (Sber AI,Skolkovo Institute of Science and Technology); Vasilii Latonov (Sber AI); Vladimir Mashurov (Sber AI,ITMO National Research University); Natalia Semenova (Sber AI,Artificial Intelligence Research Institute)<\/td><\/tr><tr><td><strong>ZooplanktonBench: A Geo-Aware Zooplankton Recognition and Classification Dataset from Marine Observations<\/strong><br>DOI: 10.1145\/3711896.3737395<\/td><\/tr><tr><td>Fukun Liu (University of Georgia); Adam T. Greer (University of Georgia); Gengchen Mai (University of Texas at Austin,University of Georgia); Jin Sun (University of Georgia)<\/td><\/tr><tr><td><strong>Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding<\/strong><br>DOI: 10.1145\/3711896.3737396<\/td><\/tr><tr><td>Dawei Zhou (Computer Science, Virginia Polytechnic Institute and State University); Feng Guo (Statistics, Virginia Polytechnic Institute and State University,Virginia Tech Transportation Institute, Virginia Polytechnic Institute and State University); Liang Shi (Statistics, Virginia Polytechnic Institute and State University,Virginia Tech Transportation Institute, Virginia Polytechnic Institute and State University); Longfeng Wu (Computer Science, Virginia Polytechnic Institute and State University); Tong Zeng (Computer Science, Virginia Polytechnic Institute and State University)<\/td><\/tr><tr><td><strong>Neurophysiologically Realistic Environment for Comparing Adaptive Deep Brain Stimulation Algorithms in Parkinson\u2019s Disease<\/strong><br>DOI: 10.1145\/3711896.3737397<\/td><\/tr><tr><td>Ekaterina Kuzmina (Artificial Intelligence Research Institute,Skolkovo Institute of Science and Technology); Dmitrii Kriukov (Artificial Intelligence Research Institute,Skolkovo Institute of Science and Technology); Mikhail Lebedev (Lomonosov Moscow State University); Dmitry V. Dylov (Artificial Intelligence Research Institute,Skolkovo Institute of Science and Technology)<\/td><\/tr><tr><td><strong>Flexible Generation of Preference Data for Recommendation Analysis<\/strong><br>DOI: 10.1145\/3711896.3737398<\/td><\/tr><tr><td>Simone Mungari (University of Calabria,ICAR-CNR); Erica Coppolillo (University of Calabria,ICAR-CNR); Ettore Ritacco (University of Udine); Giuseppe Manco (ICAR-CNR)<\/td><\/tr><tr><td><strong>HiDF: A Human-Indistinguishable Deepfake Dataset<\/strong><br>DOI: 10.1145\/3711896.3737399<\/td><\/tr><tr><td>Chaewon Kang (Sungkyunkwan University); Seoyoon Jeong (Sungkyunkwan University); Jonghyun Lee (Sungkyunkwan University); Daejin Choi (Incheon National University,Ewha Women&#8217;s University); Simon S. Woo (Sungkyunkwan University); Jinyoung Han (Sungkyunkwan University)<\/td><\/tr><tr><td><strong>Simulated Infectious Diseases Datasets with Controlled Data Bias<\/strong><br>DOI: 10.1145\/3711896.3737401<\/td><\/tr><tr><td>Ruochen Kong (Computer Science, Emory University); Taylor Anderson (George Mason University); Matthew Scotch (Arizona State University); David J. Heslop (School of Population Health, University of New South Wales); Yonchanok Khaokaew (University of New South Wales); Hao Xue (University of New South Wales); Li Xiong (Emory University); Chandini Raina MacIntyre (University of New South Wales); Flora D. Salim (University of New South Wales); Andreas Z\u00fcfle (Emory University)<\/td><\/tr><tr><td><strong>SciHorizon: Benchmarking AI-for-Science Readiness from Scientific Data to Large Language Models<\/strong><br>DOI: 10.1145\/3711896.3737403<\/td><\/tr><tr><td>Chuan Qin (Department of Big Data Technology and Application Development, Computer Network Information Center, Chinese Academy of Sciences,University of Chinese Academy of Sciences); Xin Chen (Big Data Technology and Development, Computer Network Information Center, Chinese Academy of Sciences,University of Chinese Academy of Sciences); Chengrui Wang (Computer Network Information Center, Chinese Academy of Sciences); Pengmin Wu (Big Data Technology and Development, Computer Network Information Center, Chinese Academy of Sciences); Xi Chen (School of Computer Science and Technology, University of Science and Technology of China,Computer Network Information Center, Chinese Academy of Sciences); Yihang Cheng (Computer Network Information Center, Chinese Academy of Sciences); Jingyi Zhao (Computer Network Information Center, Chinese Academy of Sciences); Meng Xiao (Computer Network Information Center, Chinese Academy of Sciences); Xiangchao Dong (Computer Network Information Center, Chinese Academy of Sciences); Qingqing Long (Computer Network Information Center, Chinese Academy of Sciences); Boya Pan (Computer Network Information Center, Chinese Academy of Sciences); Han Wu (School of Computer Science and Information Engineering, Hefei University of Technology); Chengzan Li (Computer Network Information Center, Chinese Academy of Sciences,University of Chinese Academy of Sciences); Yuanchun Zhou (Computer Network Information Center, Chinese Academy of Sciences,University of Chinese Academy of Sciences); Hui Xiong (Thrust of Artificial Intelligence, Hong Kong University of Science and Technology  (Guangzhou),Department of Computer Science and Engineering, The Hong Kong University of Science and Technology); Hengshu Zhu (Computer Network Information Center, Chinese Academy of Sciences,University of Chinese Academy of Sciences)<\/td><\/tr><tr><td><strong>When Graph Meets Multimodal: Benchmarking and Meditating on Multimodal Attributed Graph Learning<\/strong><br>DOI: 10.1145\/3711896.3737404<\/td><\/tr><tr><td>Hao Yan (Central South University); Chaozhuo Li (Microsoft Research Asia); Jun Yin (Central South University); Zhigang Yu (Central South University); Weihao Han (Microsoft AI); Mingzheng Li (Microsoft AI); Zhengxin Zeng (Microsoft AI); Hao Sun (Microsoft AI); Senzhang Wang (Central South University)<\/td><\/tr><tr><td><strong>Revolutionizing Database QA with Large Language Models: Comprehensive Benchmark and Evaluation<\/strong><br>DOI: 10.1145\/3711896.3737405<\/td><\/tr><tr><td>Yihang Zheng (Xiamen University); Bo Li (Xiamen University); Zhenghao Lin (Xiamen University); Yi Luo (Xiamen University); Xuanhe Zhou (Shanghai Jiaotong University); Chen Lin (Xiamen University); Guoliang Li (Tsinghua University); Jinsong Su (Xiamen University)<\/td><\/tr><tr><td><strong>ClimateIQA: A New Dataset and Benchmark to Advance Vision-Language Models in Meteorology Anomalies Analysis<\/strong><br>DOI: 10.1145\/3711896.3737406<\/td><\/tr><tr><td>Jian Chen (Thrust of Artificial Intelligence, The Hong Kong University of Science and Technology  (Guangzhou),HSBC); Peilin Zhou (Thrust of Data Science and Analytics, The Hong Kong University of Science and Technology  (Guangzhou)); Yining Hua (Harvard University); Dading Chong (Peking University); Meng Cao (Mohamed bin Zayed University of Artificial Intelligence); Yaowei Li (Harvard University); Wei Chen (Thrust of Data Science and Analytics, The Hong Kong University of Science and Technology  (Guangzhou)); Bing Zhu (HSBC); Junwei Liang (Thrust of Artificial Intelligence, The Hong Kong University of Science and Technology  (Guangzhou)); Zixuan Yuan (Thrust of Financial Technology, The Hong Kong University of Science and Technology  (Guangzhou))<\/td><\/tr><tr><td><strong>T3Set: A Multimodal Dataset with Targeted Suggestions for LLM-based Virtual Coach in Table Tennis Training<\/strong><br>DOI: 10.1145\/3711896.3737407<\/td><\/tr><tr><td>Ji Ma (State Key Lab of CAD&amp;CG, Zhejiang University); Jiale Wu (State Key Lab of CAD&amp;CG, Zhejiang University); Haoyu Wang (State Key Lab of CAD&amp;CG, Zhejiang University); Yanze Zhang (State Key Lab of CAD&amp;CG, Zhejiang University); Xiao Xie (Department of Sports Science, Zhejiang University); Zheng Zhou (Department of Sports Science, Zhejiang University); Hui Zhang (Department of Sports Science, Zhejiang University); Jiachen Wang (Department of Sports Science, Zhejiang University); Yingcai Wu (State Key Lab of CAD&amp;CG, Zhejiang University)<\/td><\/tr><tr><td><strong>On the Generalization and Adaptation Ability of Machine-Generated Text Detectors in Academic Writing<\/strong><br>DOI: 10.1145\/3711896.3737408<\/td><\/tr><tr><td>Yule Liu (DSA, The Hong Kong University of Science and Technology); Zhiyuan Zhong (The Hong Kong University of Science and Technology (Guangzhou),Southern University of Science and Technology); Yifan Liao (CS, National University of Singapore,Chongqing University); Zhen Sun (Infomation Hub, The Hong Kong University of Science and Technology (Guangzhou)); Jingyi Zheng (IOT, The Hong Kong University of Science and Technology (Guangzhou)); Jiaheng Wei (Data Science and Analytics, The Hong Kong University of Science and Technology (Guangzhou)); Qingyuan Gong (Research Institute of Intelligent Complex Systems, Fudan University,School of Computer Science, Fudan University); Fenghua Tong (Qilu University of Technology); Yang Chen (Fudan University);  Yang Zhang (Group Zhang, CISPA Helmholtz Center for Information Security);  Xinlei He (DSA &amp; IoT Thrust, The Hong Kong University of Science and Technology (Guangzhou))<\/td><\/tr><tr><td><strong>Judge Anything: MLLM as a Judge Across Any Modality<\/strong><br>DOI: 10.1145\/3711896.3737409<\/td><\/tr><tr><td>Shu Pu (Huazhong University of Science and Technology); Yaochen Wang (Huazhong University of Science and Technology); Dongping Chen (Huazhong University of Science and Technology); Yuhang Chen (Huazhong University of Science and Technology); Guohao Wang (Huazhong University of Science and Technology); Qi Qin (Huazhong University of Science and Technology); Zhongyi Zhang (Huazhong University of Science and Technology); Zhiyuan Zhang (Huazhong University of Science and Technology); Zetong Zhou (Huazhong University of Science and Technology); Shuang Gong (Huazhong University of Science and Technology); Yi Gui (Huazhong University of Science and Technology); Yao Wan (Huazhong University of Science and Technology); Philip S. Yu (University of Illinois Chicago)<\/td><\/tr><tr><td><strong>Benchmarking Graph Foundation Models<\/strong><br>DOI: 10.1145\/3711896.3737410<\/td><\/tr><tr><td>Jinyu Yang (Beijing University of Posts and Telecommunications); Liangwei Yang (University of Illinois Chicago); Zeyuan Guo (Beijing University of Posts and Telecommunications); Jiayi Gao (Beijing University of Posts and Telecommunications); Jing Wu (Beijing University of Posts and Telecommunications); Tianhao Chai (Beijing University of Posts and Telecommunications); Hai Huang (Beijing University of Posts and Telecommunications); Cheng Yang (Beijing University of Posts and Telecommunications); Chuan Shi (Beijing University of Posts and Telecommunications)<\/td><\/tr><tr><td><strong>VFLAIR-LLM: A Comprehensive Framework and Benchmark for Split Learning of LLMs<\/strong><br>DOI: 10.1145\/3711896.3737411<\/td><\/tr><tr><td>Zixuan Gu (School of Software, Tsinghua University); Qiufeng Fan (Privacy Computing, Wuxi Innovation Center of Tsinghua AIR); Long Sun (Privacy Computing, Wuxi Innovation Center of Tsinghua AIR); Yang Liu (the Hong Kong Polytechnic University,the Shanghai Artificial Intelligence Laboratory); Xiaojun Ye (School of Software, Tsinghua University)<\/td><\/tr><tr><td><strong>BurstGPT: A Real-World Workload Dataset to Optimize LLM Serving Systems<\/strong><br>DOI: 10.1145\/3711896.3737413<\/td><\/tr><tr><td>Yuxin Wang (Huawei Hong Kong Research Center); Yuhan Chen (The Hong Kong University of Science and Technology  (Guangzhou)); Zeyu Li (The Hong Kong University of Science and Technology  (Guangzhou)); Xueze Kang (Hong Kong University of Science and Technology  (Guangzhou)); Yuchu Fang (Huawei Technologies Ltd.); Yeju Zhou (Huawei Technologies Ltd.); Yang Zheng (Huawei Technologies Ltd.); Zhenheng Tang (The Hong Kong University of Science and Technology); Xin He (Hong Kong Baptist University); Rui Guo (Tsinghua University); Xin Wang (Tsinghua University); Qiang Wang (Harbin Institute of Technology, Shenzhen); Amelie Chi Zhou (Hong Kong Baptist University); Xiaowen Chu (Hong Kong University of Science and Technology  (Guangzhou))<\/td><\/tr><tr><td><strong>Saliency-Bench: A Comprehensive Benchmark for Evaluating Visual Explanations<\/strong><br>DOI: 10.1145\/3711896.3737414<\/td><\/tr><tr><td>Yifei Zhang (Emory University); James Song (Emory University); Siyi Gu (Stanford University); Tianxu Jiang (University of Michigan &#8211; Ann Arbor); Bo Pan (Emory University); Guangji Bai (Emory University); Liang Zhao (Emory University)<\/td><\/tr><tr><td><strong>MethaneS2CM: A Dataset for Multispectral Deep Methane Emission Detection<\/strong><br>DOI: 10.1145\/3711896.3737415<\/td><\/tr><tr><td>Hongxuan Liu (Department of Electrical and Computer Engineering, University of Alberta); Juliana Y. Leung (Department of Civil and Environmental Engineering, University of Alberta); Di Niu (Department of Electrical and Computer Engineering, University of Alberta)<\/td><\/tr><tr><td><strong>MetamatBench: Integrating Heterogeneous Data, Computational Tools, and Visual Interface for Metamaterial Discovery<\/strong><br>DOI: 10.1145\/3711896.3737416<\/td><\/tr><tr><td>Jianpeng Chen (Computer Science, Virginia Polytechnic Institute and State University); Wangzhi Zhan (Computer Science, Virginia Polytechnic Institute and State University); Haohui Wang (Computer Science, Virginia Polytechnic Institute and State University); Zian Jia (Materials Science and Engineering, University of Pennsylvania,Ecology and Evolutionary Biology, Princeton University); Jingru Gan (Computer Science, University of California, Los Angeles); Junkai Zhang (Computer Science, University of California, Los Angeles); Jingyuan Qi (Computer Science, Virginia Polytechnic Institute and State University); Tingwei Chen (EECS, University of Tennessee, Knoxville); Lifu Huang (Computer Science, University of California, Davis); Muhao Chen (Computer Science, University of California, Davis); Ling Li (University of Pennsylvania); Wei Wang (Computer Science, University of California, Los Angeles); Dawei Zhou (Computer Science, Virginia Polytechnic Institute and State University)<\/td><\/tr><tr><td><strong>VideoConviction: A Multimodal Benchmark for Human Conviction and Stock Market Recommendations<\/strong><br>DOI: 10.1145\/3711896.3737417<\/td><\/tr><tr><td>Michael Galarnyk (Georgia Institute of Technology); Veer Kejriwal (Georgia Institute of Technology); Agam Shah (Georgia Institute of Technology); Yash Bhardwaj (Georgia Institute of Technology); Nicholas Watney Meyer (Georgia Institute of Technology); Anand Krishnan (Stanford University); Sudheer Chava (Georgia Institute of Technology)<\/td><\/tr><tr><td><strong>TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors<\/strong><br>DOI: 10.1145\/3711896.3737418<\/td><\/tr><tr><td>Jingyi Zheng (IOT, The Hong Kong University of Science and Technology  (Guangzhou)); Junfeng Wang (The Hong Kong University of Science and Technology  (Guangzhou)); Wenhan Dong (AI, The Hong Kong University of Science and Technology  (Guangzhou)); Xinlei He (DSA &amp; IoT Thrust, The Hong Kong University of Science and Technology  (Guangzhou)); Yule Liu (DSA, The Hong Kong University of Science and Technology  (Guangzhou)); Zhen Sun (Infomation Hub, The Hong Kong University of Science and Technology  (Guangzhou))<\/td><\/tr><tr><td><strong>IdeaBench: Benchmarking Large Language Models for Research Idea Generation<\/strong><br>DOI: 10.1145\/3711896.3737419<\/td><\/tr><tr><td>Aidong Zhang (Computer Science, University of Virginia); Albert Huang (Department of Computer Science, University of Virginia, Charlottesville); Amir Hassan Shariatmadari (Computer Science, University of Virginia, Charlottesville); Corey M. Williams (Immunology &amp; Biomedical Engineering, University of Virginia, Charlottesville); Guangzhi Xiong (Computer Science, University of Virginia, Charlottesville,English Language, Tsinghua University); Myles Kim (University of Virginia, Charlottesville); Sikun Guo (Computer Science, University of Virginia, Charlottesville,College of Electronic Information and Optical Engineering , Nankai University); Stefan Bekiranov (Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville,Gaasterland Lab, Rockefeller University)<\/td><\/tr><tr><td><strong>Exploring the Potential of Foundation Models as Reliable AI Contact Centers<\/strong><br>DOI: 10.1145\/3711896.3737420<\/td><\/tr><tr><td>Hoyoon Byun (Department of Applied Statistics and Data Science, Yonsei University); Minhoi Park (Institute of Data Science, Yonsei University); Seolah Kim (120 Dasan Call Foundation); EunBi Kim (120 Dasan Call Foundation); Kyungwoo Song (Department of Applied Statistics and Data Science, Yonsei University)<\/td><\/tr><tr><td><strong>When Heterophily Meets Heterogeneity: Challenges and a New Large-Scale Graph Benchmark<\/strong><br>DOI: 10.1145\/3711896.3737421<\/td><\/tr><tr><td>Junhong Lin (Massachusetts Institute of Technology,Huazhong University of Science and Technology); Xiaojie Guo (IBM Research,George Mason University); Shuaicheng Zhang (Virginia Tech,University of Maryland, College Park); Yada Zhu (IBM Research,Rutgers University); Julian Shun (Massachusetts Institute of Technology)<\/td><\/tr><tr><td><strong>DCA-Bench: A Benchmark for Dataset Curation Agents<\/strong><br>DOI: 10.1145\/3711896.3737422<\/td><\/tr><tr><td>Benhao Huang (Carnegie Mellon University); Yingzhuo Yu (University of Illinois at Urbana-Champaign); Jin Huang (University of Michigan &#8211; Ann Arbor); Xingjian Zhang (University of Michigan &#8211; Ann Arbor); Jiaqi W. Ma (University of Illinois Urbana-Champaign)<\/td><\/tr><tr><td><strong>UP-Bench: A Benchmark for Underwater Path Planning Algorithms<\/strong><br>DOI: 10.1145\/3711896.3737424<\/td><\/tr><tr><td>Di Yang (Data Science, College of William and Mary); Yanhai Xiong (Data Science, College of William and Mary)<\/td><\/tr><tr><td><strong>Capillary Dataset: A Dataset of Nail-fold Capillaries Captured by Microscopy for Diabetes Detection<\/strong><br>DOI: 10.1145\/3711896.3737425<\/td><\/tr><tr><td>Hang Thi Phuong Nguyen (AI Convergence, Chonnam National University); Hieyong Jeong (Artificial Intelligence Convergence, Chonnam National University,Graduate School of Medicine, Osaka University)<\/td><\/tr><tr><td><strong>$EFO_{k}$-CQA: Towards Knowledge Graph Complex Query Answering beyond Set Operation<\/strong><br>DOI: 10.1145\/3711896.3737426<\/td><\/tr><tr><td>Hang Yin (Department of Mathematical Science, Tsinghua University); Zihao Wang (Department of Computer Science and Engineering, The Hong Kong University of Science and Technology); Weizhi Fei (Department of Mathematical Science, Tsinghua University); Yangqiu Song (Department of Computer Science and Engineering, Hong Kong University of Science and Technology)<\/td><\/tr><tr><td><strong>NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation<\/strong><br>DOI: 10.1145\/3711896.3737427<\/td><\/tr><tr><td>Xinyu Liu (The Hong Kong University of Science and Technology  (Guangzhou)); Shuyu Shen (The Hong Kong University of Science and Technology  (Guangzhou)); Boyan Li (The Hong Kong University of Science and Technology  (Guangzhou)); Nan Tang (The Hong Kong University of Science and Technology  (Guangzhou)); Yuyu Luo (The Hong Kong University of Science and Technology  (Guangzhou))<\/td><\/tr><tr><td><strong>EBES: Easy Benchmarking for Event Sequences<\/strong><br>DOI: 10.1145\/3711896.3737428<\/td><\/tr><tr><td>Dmitry Osin (Skolkovo Institute of Science and Technology); Egor Shvetsov (Skolkovo Institute of Science and Technology); Evgeny Burnaev (Skolkovo Institute of Science and Technology,Artificial Intelligence Research Institute); Igor Udovichenko (Skolkovo Institute of Science and Technology,Vega Institute Foundation); Viktor Moskvoretskii (Skolkovo Institute of Science and Technology,Higher School of Economics)<\/td><\/tr><tr><td><strong>TrustGLM: Evaluating the Robustness of GraphLLMs Against Prompt, Text, and Structure Attacks<\/strong><br>DOI: 10.1145\/3711896.3737429<\/td><\/tr><tr><td>Qihai Zhang (Center for Data Science, New York University); Xinyue Sheng (Data Science, New York University Shanghai); Yuanfu Sun (Courant Institute, New York University,School of Artificial Intelligence, Jilin University); Qiaoyu Tan (Computer Science, New York University Shanghai,Texas A&amp;M University)<\/td><\/tr><tr><td><strong>POVE: A Preoptimized Vault of Expressions for Symbolic Regression Research and Benchmarking<\/strong><br>DOI: 10.1145\/3711896.3737430<\/td><\/tr><tr><td>Kei Sen Fong (Department of Electrical and Computer Engineering, National University of Singapore); Mehul Motani (Department of Electrical and Computer Engineering, Institute of Data Science, N.1 Institute for Health, Institute for Digital Medicine  (WisDM), National University of Singapore)<\/td><\/tr><tr><td><strong>EMBER2024 &#8212; A Benchmark Dataset for Holistic Evaluation of Malware Classifiers<\/strong><br>DOI: 10.1145\/3711896.3737431<\/td><\/tr><tr><td>Robert J. Joyce (Booz Allen Hamilton); Gideon Miller (Laboratory for Physical Sciences); Phil Roth (Data Science, CrowdStrike); Richard J. Zak (Booz Allen Hamilton); Elliott Zaresky-Williams (Booz Allen Hamilton); Hyrum Anderson (Cisco Systems); Edward Raff (Booz Allen Hamilton); James Holt (Laboratory for Physical Sciences)<\/td><\/tr><tr><td><strong>ScIRGen: Synthesize Realistic and Large-Scale RAG Dataset for Scientific Research<\/strong><br>DOI: 10.1145\/3711896.3737432<\/td><\/tr><tr><td>Junyong Lin (The Hong Kong University of Science and Technology  (Guangzhou)); Lu Dai (The Hong Kong University of Science and Technology  (Guangzhou),The Hong Kong University of Science and Technology); Ruiqian Han (The Hong Kong University of Science and Technology  (Guangzhou)); Yijie Sui (Institute of Tibetan Plateau Research, Chinese Academy of Sciences); Ruilin Wang (Lanzhou University); Sun Xingliang (Lanzhou University); Qinglin Wu (Institute of Tibetan Plateau Research, Chinese Academy of Sciences); Min Feng (Institute of Tibetan Plateau Research, Chinese Academy of Sciences,College of Resources and Environment, University of Chinese Academy of Sciences); Hao Liu (The Hong Kong University of Science and Technology  (Guangzhou),The Hong Kong University of Science and Technology); Hui Xiong (The Hong Kong University of Science and Technology  (Guangzhou),The Hong Kong University of Science and Technology)<\/td><\/tr><tr><td><strong>RL4CO: An Extensive Reinforcement Learning for Combinatorial Optimization Benchmark<\/strong><br>DOI: 10.1145\/3711896.3737433<\/td><\/tr><tr><td>Federico Berto (KAIST,OMELET); Chuanbo Hua (KAIST,OMELET); Junyoung Park (KAIST); Laurin Luttmann (Leuphana University); Yining Ma (MIT); Fanchen Bu (KAIST); Jiarui Wang (Southeast University); Haoran Ye (Peking University); Minsu Kim (Mila,KAIST); Sanghyeok Choi (KAIST); Nayeli Gast Zepeda (Bielefeld University); Andr\u00e9 Hottung (Bielefeld University); Jianan Zhou (Nanyang Technological University); Jieyi Bi (Nanyang Technological University); Yu Hu (Soochow University); Fei Liu (City University of Hong Kong); Hyeonah Kim (Mila,Universit\u00e9 de Montr\u00e9al); Jiwoo Son (OMELET); Haeyeon Kim (KAIST); Davide Angioni (University of Brescia); Wouter Kool (ORTEC); Zhiguang Cao (Singapore Management University); Qingfu Zhang (City University of Hong Kong); Joungho Kim (KAIST); Jie Zhang (Nanyang Technological University); Kijung Shin (KAIST); Cathy Wu (MIT); Sungsoo Ahn (KAIST); Guojie Song (Peking University); Changhyun Kwon (KAIST,OMELET); Kevin Tierney (Bielefeld University); Lin Xie (Brandenburg University of Technology); Jinkyoo Park (KAIST,OMELET)<\/td><\/tr><tr><td><strong>Towards Understanding Link Predictor Generalizability Under Distribution Shifts<\/strong><br>DOI: 10.1145\/3711896.3737434<\/td><\/tr><tr><td>Jay Revolinsky (Michigan State University); Harry Shomer (Michigan State University); Jiliang Tang (Michigan State University)<\/td><\/tr><tr><td><strong>CURE: A dataset for Clinical Understanding &amp; Retrieval Evaluation<\/strong><br>DOI: 10.1145\/3711896.3737435<\/td><\/tr><tr><td>Nadia Athar Sheikh (Clinia); Daniel Buades Marcos (Clinia); Anne-Laure Jousse (Clinia); Akintunde Oladipo (Clinia); Olivier Rousseau (Clinia); Jimmy Lin (University of Waterloo)<\/td><\/tr><tr><td><strong>MathWriting: A Dataset For Handwritten Mathematical Expression Recognition<\/strong><br>DOI: 10.1145\/3711896.3737436<\/td><\/tr><tr><td>Philippe Gervais (Inceptive); Anastasiia Fadeeva (Google DeepMind); Andrii Maksai (Google DeepMind)<\/td><\/tr><tr><td><strong>A Guide to Misinformation Detection Data and Evaluation<\/strong><br>DOI: 10.1145\/3711896.3737437<\/td><\/tr><tr><td>Camille Thibault (Universit\u00e9 de Montr\u00e9al); Jacob-Junqi Tian (Vector Institute,Mila &#8211; Quebec Artificial Intelligence Institute); Gabrielle P\u00e9loquin-Skulski (Massachusetts Institute of Technology); Taylor Lynn Curtis (Mila &#8211; Quebec Artificial Intelligence Institute); James Zhou (University of California, Berkeley); Florence Laflamme (Universit\u00e9 de Montr\u00e9al); Yuxiang Guan (McMaster University); Reihaneh Rabbany (McGill University,Mila &#8211; Quebec Artificial Intelligence Institute); Jean-Fran\u00e7ois Godbout (Universit\u00e9 de Montr\u00e9al); Kellin Pelrine (McGill University,Mila &#8211; Quebec Artificial Intelligence Institute)<\/td><\/tr><tr><td><strong>Towards Better Benchmark Datasets for Inductive Knowledge Graph Completion<\/strong><br>DOI: 10.1145\/3711896.3737438<\/td><\/tr><tr><td>Harry Shomer (Michigan State University); Jay Revolinsky (Michigan State University); Jiliang Tang (Michigan State University)<\/td><\/tr><tr><td><strong>TimeGraph: Synthetic Benchmark Datasets for Robust Time-Series Causal Discovery<\/strong><br>DOI: 10.1145\/3711896.3737439<\/td><\/tr><tr><td>Muhammad Hasan Ferdous (Information Systems, University of Maryland, Baltimore County); Emam Hossain (Information Systems, University of Maryland, Baltimore County); Md Osman Gani (Information Systems, University of Maryland, Baltimore County)<\/td><\/tr><tr><td><strong>SatHealth: A Multimodal Public Health Dataset with Satellite-based Environmental Factors<\/strong><br>DOI: 10.1145\/3711896.3737440<\/td><\/tr><tr><td>Yuanlong Wang (The Ohio State University); Pengqi Wang (The Ohio State University); Changchang Yin (The Ohio State University); Ping Zhang (The Ohio State University)<\/td><\/tr><tr><td><strong>BTS: A Comprehensive Benchmark for Tie Strength Prediction<\/strong><br>DOI: 10.1145\/3711896.3737441<\/td><\/tr><tr><td>Xueqi Cheng (Vanderbilt University); Catherine Yang (Vanderbilt University); Yuying Zhao (Vanderbilt University); Yu Wang (University of Oregon); Hamid Karimi (Utah State University); Tyler Derr (Vanderbilt University)<\/td><\/tr><tr><td><strong>TSFM-Bench: A Comprehensive and Unified Benchmark of Foundation Models for Time Series Forecasting<\/strong><br>DOI: 10.1145\/3711896.3737442<\/td><\/tr><tr><td>Zhe Li (School of Data Science and Engineering, East China Normal University); Xiangfei Qiu (School of Data Science and Engineering, East China Normal University); Peng Chen (School of Data Science and Engineering, East China Normal University); Yihang Wang (School of Data Science and Engineering, East China Normal University); Hanyin Cheng (School of Data Science and Engineering, East China Normal University); Yang Shu (School of Data Science and Engineering, East China Normal University); Jilin Hu (School of Data Science and Engineering, East China Normal University); Chenjuan Guo (School of data science and engineering, East China Normal University); Aoying Zhou (School of Data Science and Engineering, East China Normal University); Christian S. Jensen (Department of Computer Science, Aalborg University); Bin Yang (School of Data Science and Engineering, East China Normal University,Department of Computer Science, Aalborg University)<\/td><\/tr><tr><td><strong>FULTR: A Large-scale Fusion Learning to Rank Dataset and its application for Satisfaction-Oriented Ranking<\/strong><br>DOI: 10.1145\/3711896.3737443<\/td><\/tr><tr><td>Yuchen Li (Baidu Inc.); Hao Zhang (Baidu Inc.); Haojie Zhang (Baidu Inc.); Hengyi Cai (Baidu Inc.); Xinyu Ma (Baidu Inc.); Shuaiqiang Wang (Baidu Inc.); Haoyi Xiong (Baidu Inc.); Zhaochun Ren (Leiden University); Maarten de Rijke (University of Amsterdam); Dawei Yin (Baidu Inc.)<\/td><\/tr><tr><td><strong>WikiRAG:  Revisiting Wikidata KGC Datasets with Community Updates and Retrieval-Augmented Generation<\/strong><br>DOI: 10.1145\/3711896.3737444<\/td><\/tr><tr><td>Djellel Difallah (New York University Abu Dhabi)<\/td><\/tr><\/tbody><\/table><\/figure>\n","protected":false},"excerpt":{"rendered":"<p>Datasets and Benchmarks Track Papers DOI links will be available by August 3rd, please check back then to access the direct links below BatteryLife: A Comprehensive Dataset and Benchmark for Battery Life PredictionDOI: 10.1145\/3711896.3737372 Ruifeng Tan (The Hong Kong University of Science and Technology (Guangzhou),The Hong Kong University of Science and Technology); Weixiang Hong (The [&hellip;]<\/p>\n","protected":false},"author":6,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"ngg_post_thumbnail":0,"footnotes":""},"class_list":["post-2000","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/kdd2025.kdd.org\/wp-json\/wp\/v2\/pages\/2000","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/kdd2025.kdd.org\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/kdd2025.kdd.org\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/kdd2025.kdd.org\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/kdd2025.kdd.org\/wp-json\/wp\/v2\/comments?post=2000"}],"version-history":[{"count":8,"href":"https:\/\/kdd2025.kdd.org\/wp-json\/wp\/v2\/pages\/2000\/revisions"}],"predecessor-version":[{"id":2418,"href":"https:\/\/kdd2025.kdd.org\/wp-json\/wp\/v2\/pages\/2000\/revisions\/2418"}],"wp:attachment":[{"href":"https:\/\/kdd2025.kdd.org\/wp-json\/wp\/v2\/media?parent=2000"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}