TY - GEN
T1 - On-line fair allocations based on bottlenecks and global priorities
AU - Zeldes, Yoel
AU - Feitelson, Dror G.
PY - 2013
Y1 - 2013
N2 - System bottlenecks, namely those resources which are subjected to high contention, constrain system performance. Hence effective resource management should be done by focusing on the bottleneck resources and allocating them to the most deserving clients. It has been shown that for any combination of entitlements and requests a fair allocation of bottleneck resources can be found, using an off-line algorithm that is given full information in advance regarding the needs of each client. We extend this result to the on-line case with no prior information. To this end we introduce a simple greedy algorithm. In essence, when a scheduling decision needs to be made, this algorithm selects the client that has the largest minimal gap between its entitlement and its current allocation among all the bottleneck resources. Importantly, this algorithm takes a global view of the system, and assigns each client a single priority based on his usage of all the resources; this single priority is then used to make coordinated scheduling decisions on all the resources. Extensive simulations show that this algorithm achieves fair allocations according to the desired entitlements for a wide range of conditions, without using any prior information regarding resource requirements. It also follows shifting usage patterns, including situations where the bottlenecks change with time.
AB - System bottlenecks, namely those resources which are subjected to high contention, constrain system performance. Hence effective resource management should be done by focusing on the bottleneck resources and allocating them to the most deserving clients. It has been shown that for any combination of entitlements and requests a fair allocation of bottleneck resources can be found, using an off-line algorithm that is given full information in advance regarding the needs of each client. We extend this result to the on-line case with no prior information. To this end we introduce a simple greedy algorithm. In essence, when a scheduling decision needs to be made, this algorithm selects the client that has the largest minimal gap between its entitlement and its current allocation among all the bottleneck resources. Importantly, this algorithm takes a global view of the system, and assigns each client a single priority based on his usage of all the resources; this single priority is then used to make coordinated scheduling decisions on all the resources. Extensive simulations show that this algorithm achieves fair allocations according to the desired entitlements for a wide range of conditions, without using any prior information regarding resource requirements. It also follows shifting usage patterns, including situations where the bottlenecks change with time.
KW - bottlenecks
KW - entitlements
KW - fairness
KW - online algorithm
KW - priority inversion
KW - resource allocation
UR - http://www.scopus.com/inward/record.url?scp=84878206440&partnerID=8YFLogxK
U2 - 10.1145/2479871.2479904
DO - 10.1145/2479871.2479904
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:84878206440
SN - 9781450316361
T3 - ICPE 2013 - Proceedings of the 2013 ACM/SPEC International Conference on Performance Engineering
SP - 229
EP - 240
BT - ICPE 2013 - Proceedings of the 2013 ACM/SPEC International Conference on Performance Engineering
T2 - 2013 4th ACM/SPEC International Conference on Performance Engineering, ICPE 2013
Y2 - 21 April 2013 through 24 April 2013
ER -