publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
2025
2024
- ICWSPlank: Optimizing LLM Inference Performance in Pipeline Parallelism with Fine-Grained SLO ConstraintIn Proceedings of the 31st IEEE International Conference on Web Services, 2024
- IEEE TCCUnderstanding Serverless Inference in Mobile-Edge Networks: a Benchmark ApproachIEEE Transactions on Cloud Computing, 2024
2023
- ICPADSFLASH: Low-Latency Serverless Model Inference with Multi-Core Parallelism in EdgeIn Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023
2022
- HPCCESBench: Understanding Deep Learning Inference Overheads for Edge ServerlessIn Proceedings of the 24th IEEE International Conference on High Performance Computing and Communications, 2022
- ISPASystem-level Implications of Serverless: Workload Characterizing and Performance UnderstandingIn Proceedings of the 20th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2022
- IEEE TSCServerless Computing: State-of-the-Art, Challenges and OpportunitiesIEEE Transactions on Services Computing, 2022
2021
- CLOUDBBServerless: Burst Traffic Benchmark for ServerlessIn Proceedings of the 14th IEEE International Conference on Cloud Computing, 2021
- HPCCPEAN: A Packet-Level End-To-End Attentive Network for Encrypted Traffic IdentificationIn Proceedings of the 23rd IEEE International Conference on High Performance Computing and Communications, 2021
- IEEE/ACM ToNA Novel End-to-end Deep Learning Framework for Encrypted Traffic IdentificationIEEE/ACM Transactions on Networking, 2021
2020
- ICPADSLBNN: Perceives the Change of Core Telecommunications Network State via Linear Bayesian Neural NetworkIn Proceedings of the 26th IEEE International Conference on Parallel and Distributed Systems, 2020