Due to the small variance between the lump incomes obtained by allocating and VMs to the paid service request, when the arrival rate of paid service requests increases (i.e., , our model prefers action other than action , since action can accommodate more paid services to gain higher rewards of the MCC system than action , which consumes more Cloud resources of the service provisioning domain.