Poster (Scientific congresses, symposiums and conference proceedings)
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions
Li, Ziming; ZANG, Qianbo; Ma, David et al.
2024ICLR 2025 Worshop Emergent Possibilities and Challenges in Deep Learning for Code
Peer reviewed
 

Files


Full Text
AutoKaggle.pdf
Publisher postprint (1.46 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Computer Science - Artificial Intelligence; Computer Science - Computation and Language
Abstract :
[en] Data science tasks involving tabular data present complex challenges that require sophisticated problem-solving approaches. We propose AutoKaggle, a powerful and user-centric framework that assists data scientists in completing daily data pipelines through a collaborative multi-agent system. AutoKaggle implements an iterative development process that combines code execution, debugging, and comprehensive unit testing to ensure code correctness and logic consistency. The framework offers highly customizable workflows, allowing users to intervene at each phase, thus integrating automated intelligence with human expertise. Our universal data science toolkit, comprising validated functions for data cleaning, feature engineering, and modeling, forms the foundation of this solution, enhancing productivity by streamlining common tasks. We selected 8 Kaggle competitions to simulate data processing workflows in real-world application scenarios. Evaluation results demonstrate that AutoKaggle achieves a validation submission rate of 0.85 and a comprehensive score of 0.82 in typical data science pipelines, fully proving its effectiveness and practicality in handling complex data science tasks.
Disciplines :
Computer science
Author, co-author :
Li, Ziming
ZANG, Qianbo  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > FINATRAX
Ma, David
Guo, Jiawei
Zheng, Tuney
Liu, Minghao
Niu, Xinyao
Wang, Yue
Yang, Jian
Liu, Jiaheng
Zhong, Wanjun
Zhou, Wangchunshu
Huang, Wenhao
Zhang, Ge
More authors (4 more) Less
External co-authors :
yes
Language :
English
Title :
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions
Publication date :
05 March 2024
Event name :
ICLR 2025 Worshop Emergent Possibilities and Challenges in Deep Learning for Code
Event place :
Singapore
Event date :
28/04/2025
By request :
Yes
Peer reviewed :
Peer reviewed
Commentary :
44 pages, 10 figures
Available on ORBilu :
since 06 May 2025

Statistics


Number of views
109 (10 by Unilu)
Number of downloads
45 (0 by Unilu)

OpenCitations
 
0
OpenAlex citations
 
1

Bibliography


Similar publications



Contact ORBilu