queuing analysis
DESCRIPTION
Queuing Analysis. Based on noted from Appendix A of Stallings Operating System text. Queuing Model and Analysis. Queue1. Queuing theory deals with modeling and analyzing systems with queues of items and servers that process the items. Queue2. server. Queue3. Goals of Queuing Analysis. - PowerPoint PPT PresentationTRANSCRIPT
Queuing Analysis
Based on noted from Appendix A of Stallings Operating System text
04/19/23 1
Queuing Model and Analysis• Queuing theory deals with modeling and analyzing systems with queues of items and servers that process the items.
04/19/23 2
Queue1
Queue2
Queue3
server
Goals of Queuing Analysis
• Typically used in analysis of networking system; examples,– increase in disk access time– Increase in process load– Increase in rate of arrival of packets, processes
• Especially useful of analysis of performance when either the load on a system is expected to increase or a design change is contemplated.
• While it is a popular method in network analysis, it has gained popularity within a system esp. with the advent of multi-core processors.
04/19/23 3
Analysis methods
• After the fact analysis: let the system run some n number times, collect the “real” data and analyze – problems?
• Predict some simple trends /projections based on experience – problems?
• Develop analytical model based on queuing theory – problems?
• Run simulation (not real systems) and collect data to analyze –problems?
04/19/23 4
04/19/23 5
Single server queue
w = mean # items waitingTw = mean waiting time
queuearrivals
λ= arrival rateserver
Dispatching discipline
departures
Ts = mean service timeρ = utilization
r mean # items residing in the systemTr = mean residence time
Multi-server /single queue
04/19/23 6
queuearrivals
λ= arrival rate
Dispatching discipline
server0
server1
Servern-1
……….
Multi-server /Multiple queues
04/19/23 7
server0
server1
Servern-1
……….
queue
arrivals queue
queue
Parameters
• Items arrive at the facility at some average rate (items arriving per second) l.
• At any given time, a certain number of items will be waiting in the queue (zero or more);
• The average number waiting is w, and the mean time that an item must wait is Tw.
• The server handles incoming items with an average service time Ts;
04/19/23 8
More parameters• Utilization, ρ, is the fraction of time that the
server is busy, measured over some interval of time.
• Finally, two parameters apply to the system as a whole.
• The average number of items resident in the system, including the item being served (if any) and the items waiting (if any), is r;
• and the average time that an item spends in the system, waiting and being served, is Tr; we refer to this as the mean residence time
04/19/23 9
Analysis• As the arrival rate, which is the rate of traffic passing through the
system, increases, the utilization increases and with it, congestion. The queue becomes longer, increasing waiting time. At ρ = 1, the server becomes saturated, working 100% of the time.
• Thus, the theoretical maximum input rate that can be handled by the system is:
λmax = 1/Ts• However, queues become very large near system saturation,
growing without bound when ρ = 1. Practical considerations, such as response time requirements or buffer sizes, usually limit the input rate for a single server to 70-90% of the theoretical maximum.
• For multi server queue for N servers: λmax = N/Ts
04/19/23 10
Specific Metrics• The fundamental task of a queuing analysis is as follows: Given the
following information as input:· Arrival rate· Service time• Provide as output information concerning:· Items waiting· Waiting time· Items in residence· Residence time.• We would like to know their average values (w, Tw, r, Tr) and the
respective variability the σ’s• We are also interested in some probabilities: what is probability
that items waiting in line < M is 0.99?
04/19/23 11
Important Assumptions• The arrival rate obeys the Poisson distribution, which is equivalent
to saying that the inter-arrival times are exponential;• On other words, the arrivals occur randomly and independent of
one another.• A convenient notation has been developed for summarizing the
principal assumptions that are made in developing a queuing model.
• The notation is X/Y/N, where X refers to the distribution of the inter-arrival times, Y refers to the distribution of service times, and N refers to the number of servers.
• M/M/1 refers to a single-server queuing model with Poisson arrivals and exponential service times.
• M/G/1 and M/M/1 and M/D/1
04/19/23 12
Little’s Law
• Very simple law that works from a Case Western Reserve University professor Dr. Little
• Average number of customers in a system = average arrival rate * average time spent in the system
• r = Tr * λ• w = Tw * λ• Tr = Tw + Ts• Extend it to the M/M/1 queuing model
04/19/23 13
Examples
• Page 21-22-23• Database server (can be substituted for any
server).• Tightly-coupled multi-processor system• Necessary formulae are in pages: 14, 18 (Table
3 and Table 4)
04/19/23 14