publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. EuroSys
    FlexPipe: Adapting Dynamic LLM Serving Through Inflight Pipeline Refactoring in Fragmented Serverless Clusters
    Yanying Lin, Shijie Peng, Chengzhi Lu, and 2 more authors
    In Proceedings of the 21st European Conference on Computer Systems, 2026

2025

  1. SoCC
    Understanding Diffusion Model Serving in Production: A Top-Down Analysis of Workload, Scheduling, and Resource Efficiency
    Yanying Lin, Shuaipeng Wu, Shutian Luo, and 8 more authors
    In Proceedings of the 2025 ACM Symposium on Cloud Computing, 2025
  2. Cluster
    ROCK: Serving Multimodal Model in Cloud with Heterogeneous-Aware Resource Orchestration for Thousands of LoRA Adapters
    Shuaipeng Wu, Yanying* Lin, Shijie Peng, and 4 more authors
    In Proceedings of the 2025 IEEE International Conference on Cluster Computing, 2025
  3. IEEE TSC
    Serving LLM in Distributed GPU Cluster with Fine-Grain Pipeline Constraints
    Yanying Lin, Shijie Peng, Shuaipeng Wu, and 4 more authors
    IEEE Transactions on Services Computing, 2025

2024

  1. ICDCS
    Quart: Latency-Aware FaaS System for Pipelining Large Model Inference
    Yanying Lin, Yanbo Li, Shijie Peng, and 5 more authors
    In Proceedings of the 44th IEEE International Conference on Distributed Computing Systems, 2024
  2. ICWS
    Plank: Optimizing LLM Inference Performance in Pipeline Parallelism with Fine-Grained SLO Constraint
    Yanying Lin, Shijie Peng, Shuaipeng Wu, and 4 more authors
    In Proceedings of the 31st IEEE International Conference on Web Services, 2024
  3. IEEE TCC
    Understanding Serverless Inference in Mobile-Edge Networks: a Benchmark Approach
    Junhong Chen, Yanying* Lin, Shijie Peng, and 5 more authors
    IEEE Transactions on Cloud Computing, 2024

2023

  1. ICPADS
    FLASH: Low-Latency Serverless Model Inference with Multi-Core Parallelism in Edge
    Yanbo Li, Yanying* Lin, Shijie Peng, and 4 more authors
    In Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

2022

  1. HPCC
    ESBench: Understanding Deep Learning Inference Overheads for Edge Serverless
    Yanying* Lin, Junhong Chen, Yang Wang, and 1 more author
    In Proceedings of the 24th IEEE International Conference on High Performance Computing and Communications, 2022
  2. ISPA
    System-level Implications of Serverless: Workload Characterizing and Performance Understanding
    Deshi Deng, Yanying* Lin, and Kejiang Ye
    In Proceedings of the 20th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2022
  3. IEEE TSC
    Serverless Computing: State-of-the-Art, Challenges and Opportunities
    Yongkang Li, Yanying Lin, Kejiang Ye, and 2 more authors
    IEEE Transactions on Services Computing, 2022

2021

  1. CLOUD
    BBServerless: Burst Traffic Benchmark for Serverless
    Yanying Lin, Kejiang Ye, and Chengzhong Xu
    In Proceedings of the 14th IEEE International Conference on Cloud Computing, 2021
  2. HPCC
    PEAN: A Packet-Level End-To-End Attentive Network for Encrypted Traffic Identification
    Peng Lin, Yishen Hu, Yanying Lin, and 2 more authors
    In Proceedings of the 23rd IEEE International Conference on High Performance Computing and Communications, 2021
  3. IEEE/ACM ToN
    A Novel End-to-end Deep Learning Framework for Encrypted Traffic Identification
    Peng Lin, Kejiang Ye, Yishen Hu, and 2 more authors
    IEEE/ACM Transactions on Networking, 2021

2020

  1. ICPADS
    LBNN: Perceives the Change of Core Telecommunications Network State via Linear Bayesian Neural Network
    Yanying Lin, Kejiang Ye, and Chengzhong Xu
    In Proceedings of the 26th IEEE International Conference on Parallel and Distributed Systems, 2020