About Me

Hello! I’m SHEN Jiyuan (沈纪元), a Data Scientist at SAP AI Lab. My work focuses on developing automated evaluation pipelines for model performance assessment and providing model and method discovery for performance optimization. Prior to joining SAP, I earned my Master of Engineering (M.Eng.) degree at Nanyang Technological University with the SCRIPTS scholarship. I am fortunate to be supervised by Prof. LAM, Kwok Yan during my studies, where I primarily focused on privacy-preserving federated learning and data-efficient learning techniques. I obtained my B.Eng. degree with a major in Intelligent Science and Technology (Department of Computer Science and Engineering) from Shanghai University in 2022, under the supervision of Prof. Xing Wu. During my undergraduate studies, I had the opportunity to intern at Bosch Corporate Research, where I was mentored by Dr. Dong Chen and mainly conducted an individual research project on long-video action recognition.

My primary research interests lie in the following directions:

  1. Agent Evaluation Service: Design, build and maintain an end-to-end agent evaluation platform—spanning synthetic data pipelines (in collaboration), static/dynamic user proxies, metric specification and validation, automated error handling, and stability/consistency analysis.
  2. LLM-Related Technique Benchmarking: Develop a holistic evaluation framework for large language models, encompassing multi-modal LLM assessment, application-level AI performance evaluation, prompt optimization algorithms, and safety and security evaluation.
  3. Dataset Compression: Data-efficient Learning, Dataset Distillation, One-shot FL
  4. Federated Learning: Secure Aggregation, Fast Recovery from Poison Attacks, Model Inversion Attacks
  5. Computer Vision: Object Detection, Semantic Segmentation, Data Generation, Portrait Beautification

Selected Publications

Talk, Evaluate, Diagnose: User-aware Agent Evaluation with Automated Error Analysis

Penny Chong, Harshavardhan Abichandani, Jiyuan Shen, Atin Ghosh, Min Moe, Yifan Mai, Daniel Dahlmeier

International Conference on Learning Representations (ICLR), 2026

OCR or Not? Rethinking Document Information Extraction in the MLLMs Era with Real-World Large-Scale Datasets

Jiyuan Shen, Peiyue Yuan, Atin Ghosh, Yifan Mai, Daniel Dahlmeier

European Chapter of the Association for Computational Linguistics (EACL) Industry Track, 2026

Towards Efficient and Certified Recovery from Poisoning Attacks in Federated Learning

Jiyuan Shen, Yu Jiang, Ziyao Liu, Chee Wei Tan, Kwok-Yan Lam

IEEE Transactions on Information Forensics & Security, 2025

Effective Intrusion Detection in Heterogeneous Internet-of-Things Networks via Ensemble Knowledge Distillation-based Federated Learning

Jiyuan Shen, Wenzhuo Yang, Zhaowei Chu, Jiani Fan, Dusit Niyato, Kwok-Yan Lam

IEEE International Conference on Communications (ICC), 2024

View all publications →


Recent News

Older News
  • Apr 15 2024. 📃 Our paper "Privacy-Preserving Federated Unlearning with Certified Client Removal" is uploaded. [arxiv]
  • March 25, 2024. 👨‍💻 I made a presentation about the development and potential application of Dataset Distillation. [slide]
  • March 18, 2024. 🤔 I wrote a reflection about what I've mainly done over the past half year, including my struggles, choices, and rethinking throughout the job pursuit process. [zhihu]
  • January 30 - May 15, 2024. 💼 I began my second internship as an IoT Research Intern at the Corporate Research Center of Robert Bosch (SEA) Pte Ltd.
  • January 17, 2024. 🎉 My first author research paper Effective Intrusion Detection in Heterogeneous Internet-of-Things Networks via Ensemble Knowledge Distillation-based Federated Learning is accepted by ICC 2024 (oral). [slide]
  • January 16, 2024. 📃 Our paper "Towards Efficient and Certified Recovery from Poisoning Attacks in Federated Learning" is uploaded. [arxiv]
  • October 31, 2023. 📃 Our survey paper about the promising field "Federated Unlearning" is uploaded. [arxiv]
  • October 16, 2023. 📃 My first paper about learning dataset structure (Dataset Distillation) is uploaded. [arxiv]
  • September, 2023. 🏆 I won the 15th prize in the Baidu Disk AI Competition: Beauty, Freckles and Acne Removal. [report]
  • August 11, 2023. 🎉 I passed the qualification examination (QE).
  • November, 2022. 🎉 I finished the courseworks and got an average score of 4.17/5.0.
  • August 8, 2022. 🎉 I became a Master by research student at NTU and received a fully sponsored scholarship from the college.
  • July 5, 2022. 🎉 I successfully graduated from Shanghai University and won the honor title of outstanding graduate.
  • January 2022 - July 2022. 💼 I interned as a Visual Algorithm Researcher at the Corporate Research Center of Bosch China Investment Co., Ltd.
  • October 2021. 🎉 One paper is accepted by AIAHPC 2022. [paper]
  • June 2021 - July 2021. 💼 I interned as an Algorithm Engineer at SAIC General Motors Co., Ltd.
  • January 2021. 🏆 We won the 2nd national prize in the First Ocean Target Intelligent Perception International Challenge Competition (Top 3/200). [report] [slide]
  • November 2020. 🏆 We won the 3rd national prize in the Artificial Intelligence Application Competition (Top 9/2100, 无人艇队) and were invited to share project experience in Zhuhai. [report]
  • June 2020. 🏆 We won the 2nd prize in the East China Division of the WeChat Applet Application Development Competition of the China University Computer Competition. [demo]