*Result*: MAIS: Multi-Model Adaptive Inference Scheduling for Web-Based Intelligent Service Platforms.

Title:
MAIS: Multi-Model Adaptive Inference Scheduling for Web-Based Intelligent Service Platforms.
Authors:
Li, Ruiqi1 (AUTHOR) qiqiamy1018@126.com, Yang, Yongjiao1 (AUTHOR) yongjiao124@163.com, Lin, Jiaxin1 (AUTHOR) jiangziwei1024@163.com, Huang, Zhuolin2 (AUTHOR) huangzhuolinresearch@outlook.com, Huang, Yuetian1 (AUTHOR) huangyuetian24@163.com
Source:
International Journal of Pattern Recognition & Artificial Intelligence. Mar2026, p1. 19p. 5 Illustrations.
Database:
Business Source Premier

*Further Information*

*With the rapid development of digital-intelligence service platforms, web-based applications are increasingly required to provide real-time and reliable AI-driven services under diverse and dynamic workloads. However, the coexistence of multiple heterogeneous models and the variability of user requests pose critical challenges for efficient model scheduling and inference optimization. To address these issues, we propose Multi-model Adaptive Inference Scheduling (MAIS), a unified framework that integrates workload-aware multi-model scheduling with adaptive inference strategies. Specifically, MAIS dynamically allocates computational resources and selects appropriate inference modes by jointly considering user latency requirements, platform resource availability, and model complexity. Its core innovation lies in combining these factors in a unified optimization framework while also supporting hierarchical inference optimization, which allows dynamic adaptation to varying workloads and model heterogeneity. Furthermore, it supports hierarchical inference optimization through model compression, selective execution, and edge-cloud collaboration, achieving a balance between accuracy, efficiency, and service quality. Extensive experiments demonstrate that MAIS significantly improves response latency, resource utilization, and service robustness compared to existing scheduling and inference methods, making it well-suited for next-generation digital-intelligence service platforms. [ABSTRACT FROM AUTHOR]

Copyright of International Journal of Pattern Recognition & Artificial Intelligence is the property of World Scientific Publishing Company and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)*