Li (Lilly) Wu

lilly_postdoc.jpg

I am a Postdoctoral Research Associate at the Manning College of Information & Computer Sciences, University of Massachusetts Amherst, working with Prof. Prashant Shenoy.

My research interests lie broadly in distributed systems. I design and build systems and algorithms that enhance the reliability and sustainability of large-scale computing through fault tolerance, performance diagnosis, and resource management. I also work on systems for AI, AI for systems, and IoT security. My current research areas include:

  • Fault-Tolerant Edge AI Systems: Low-latency, resource-aware model serving on constrained edge environments (SoCC 2025, ICCCN 2024, MILCOM 2024).
  • Reliable Cloud Microservices: Intelligent, online diagnosis and recovery from cascading failures at cloud scale (NOMS 2020, AIOps 2021, ACSOS 2021).
  • Sustainable Data Centers: Energy- and carbon-efficient resource management for cloud and edge data centers (HPDC 2025, CCGrid 2024).

I completed a Ph.D. in Computer Science at Technische Universität Berlin (TU Berlin), Germany, advised by Prof. Odej Kao, Dr. Johan Tordsson, and Prof. Erik Elmroth. My dissertation on automatic performance diagnosis and recovery in cloud microservices led to the MicroX series, with open-source projects such as MicroRCA and a fault-injected microservices benchmark.

In the past, I worked as an SRE at IBM Cloud, a Systems Scientist at Elastisys, and a Senior Research Scientist at Bosch Research, where I gained first-hand exposure to reliability challenges in cloud and edge systems.

You can find my publications on Google Scholar and my CV here (Jan. 2026).

I am currently on the 2025–2026 academic job market. Please feel free to reach out!

See my Research Statement and Teaching Statement.

Recent News

Jan 12, 2026 I will present “FailLite: Failure-Resilient Model Serving for Resource-Constrained Edge Environments” at the 2026 New England Systems Day, hosted by Harvard SEAS.
Jan 04, 2026 Serving on the Technical Program Committee (TPC) of GreenSys 2026, colocated with EuroSys 2026.
Dec 02, 2025 Guest lectured in COMPSCI 377: Operating Systems. Thanks to Nikko for the kind invitation.
Nov 21, 2025 Presented FailLite at ACM SoCC 2025 (virtual), USA.
Nov 18, 2025 Serving on the External Review Committee (ERC)of MLSys 2026.

Selected Publications

  1. SoCC
    FailLite.png
    FailLite: Failure-Resilient Model Serving for Resource-Constrained Edge Environments
    Li Wu, Walid Hanafy, Tarek Abdelzaher, and 3 more authors
    In Proceedings of the 2025 ACM Symposium on Cloud Computing, 2025
  2. NOMS
    MicroRCA.png
    MicroRCA: Root cause localization of performance issues in microservices
    Li Wu, Johan Tordsson, Erik Elmroth, and 1 more author
    In NOMS 2020-2020 IEEE/IFIP Network Operations and Management Symposium, 2020
  3. HPDC
    CarbonEdge.png
    Carbonedge: Leveraging mesoscale spatial carbon-intensity variations for low carbon edge computing
    Li Wu, Walid A Hanafy, Abel Souza, and 5 more authors
    In Proceedings of the 34th International Symposium on High-Performance Parallel and Distributed Computing, 2025

Awards & Honors

Selected national, international, and industry recognitions.

  • Marie Skłodowska-Curie PhD Fellowship (European Commission)
  • National Scholarship (2×; Top 0.2% nationwide; Chinese Ministry of Education)
  • Magna cum laude, PhD Dissertation - Technische Universität Berlin
  • Invited Interview on Green Internet (UN Climate Change Conference, COP29)
  • Best of IBMer: Best New SRE (IBM Cloud Foundation Services)