Professional Documents
Culture Documents
Abstract. This paper describes the generation of data bursts using real IPv4 data as input and compares the performance of the three aggregation algorithms. The profiles of bursts are studied, in particular the mean packet delay per burst and burst inter-arrival time followed by mean burst size and mean number of packets per burst. Observations are made regarding the identification of relations between bursts and packets for the studied data traces and assessed the performance of the aggregation algorithms under study. The conclusions are generalized to IPv6 traffic.
1 Introduction
Data packet aggregation process was initially proposed by Amstutz in 1983 [1] as a way to benefit from the statistical multiplexing effect, or as initially described, improved bandwidth efficiencies. This concept was later re-introduced in Optical Networks, contributing to the Optical Burst Switching Network paradigm, initially proposed by Qiao and Yoo around 1999 [2]. When referring to Optical Burst Switching (OBS), three major aggregation algorithms are used: time constrained; size constrained; both time / size constrained, also termed the hybrid algorithm. With the perspective increasing importance of burst switching, e.g. OBS, it is of great importance to assess the burst assembly algorithms performance under real traffic conditions. The performance of aggregation algorithms has already been studied for simulated traffic [3-5], namely for research in the area of burst assembly algorithms, QoS and burst traffic statistics. To the best of our knowledge, this is the first report regarding the evaluation of burst aggregation algorithms using real traffic input data. The algorithms under study were implemented in the Aggregation Simulator package designed in Java, and the simulator was fed with real IPv4 traffic data captured by NLARN [6] in several network sites in the United States of America. These collection sites are mostly universities and research centers, and thus, traffic nature is very heterogeneous. This data is stored in a time stamped header (.tsh)
This work has been partially funded by Fundao para a Cincia e a Tecnologia, Portugal, through the grant contract SFRH/BDE/15527/2004 and through CONDENSA Project contract POSI/EEA-CPS/60247/2004 and by Siemens S.A., Portugal.
Y. Koucheryavy, J. Harju, V.B. Iversen (Eds.): NEW2AN 2006, LNCS 4003, pp. 223 234, 2006. Springer-Verlag Berlin Heidelberg 2006
224
format, which contains the payload stripped IPv4 datagram (or packet, as we will refer to it in this document), time stamped in microseconds at their acquisition. The standard format of the .tsh data packet header is available in [7]. Although the algorithms were applied to all files (in a batch process running in several laboratory machines) only the relevant results are shown here. The performance of each algorithm was evaluated following a set of previously defined metrics. The paper also extrapolates the conclusions on the studied IPv4 to IPv6 data conversion, following the results in [8]. The remainder of this paper is organized as follows: Section II describes the burst assembly algorithms; Section III discusses the metrics devised for assessment of the burst assembly algorithms; Section IV presents the assessment of the burst assembly algorithms and Section V presents main conclusions.
225
If the traffic flow rate is too high or the incoming packets are big, the MTD algorithm may end up creating bursts that are too big. In order to prevent such a situation, a Hybrid Assembly (HA) algorithm was devised. In this assembly scheme, both thresholds time and size are considered simultaneously. Incoming packets are assembled into a new burst until one of the threshold conditions is met.
226
3.2 Burst Assembly Variables Burst Assembly was performed with several thresholds. Time thresholds used were 100s, 500s, 1ms, 10ms and 100ms and size thresholds used were 9KB, 64KB and 1MB. The first two constitute the maximum sizes of respectively the Ethernet Jumboframe and the IP packet. The third threshold was originally set to 4GB as consistent with the maximum size for an IPv6 Jumbogram [16], but it was soon discovered that even 1MB burst size required burst assembly times in excess of 1s (for MEM-1111612868-1.tsh), making the 4GB size threshold of little research interest. Hybrid aggregation was performed for the following six scenarios: size thresholds where 9KB and 64KB and time thresholds were 500s, 103s and 104s.
4 Results
The results presented were obtained by simulation for the burst assembly process executed for the following three randomly chosen traces: AMP-1107250616-1.tsh, ANL-1109366607-1.tsh and MEM-1111612868-1.tsh. AMP, ANL and MEM stand for AMPATH, Miami, Florida (OC12c), Argonne National Laboratory to STARTAP (OC3c link) and University of Memphis (OC3c link), respectively [17]. Other data trace files were also subject to simulation process, producing results coherent with the ones presented in this paper. Each of the examined trace files records the activity in the selected network point for about continuous 91 seconds. Table 1 shows the activity of each of these network points.
Table 1. Network Activity for the selected trace files
Packet load (in Bytes) Time Span (in s) Offered Load (in MB/s) 4.1 Results for MBS
Fig. 1 depicts the relative frequency of burst inter-arrival time for each of the studied data traces with the burst size threshold set to 9KB. As anticipated, the AMP data results in the lowest inter-arrival time plot, since 90% of the bursts arrive within 1555s or less and this trace contains the highest traffic load. For the other traces, the same limit is observed at 616 s and 12903s for ANL and MEM, respectively. It is worth noting that the plots do not share the same shape around 2500s ANL and MEM switch tendencies this suggests that burst inter-arrival time depends on the nature of the traffic on the network point where bursts are to be assembled. Fig. 1 does not present the whole burst inter-arrival scale: AMP reaches its maximum at 10001, ANL at 30038, and MEM at 55702s (for MEM, 99.95% of the burst arrive up until 35103s), respectively. The same behaviour is also visible for other size thresholds in all the studied traces.
227
1.E+00
1.E-01
1.E-04
40000
50000
Fig. 1. Complementary Cumulative Distribution Function for Burst Inter-arrival Time for MBS threshold = 9KB
1.E+06
1.E+05
1.E+04
1.E+03
Fig. 2. Weighted average burst inter-arrival time vs. network load (for MBS)
The comparison between burst inter-arrival times for different size thresholds lead to the conclusion that a 2 orders of magnitude increase in the burst size (from 9KB to 1MB) results in an increase of more than two orders of magnitude for the burst interarrival time. This situation is depicted in Fig. 2. It also shows the influence of network offered load on burst inter-arrival time. Here, it is assumed that the nature of the traffic in the three network points is comparable, and the offered throughput was calculated using values from Table 1. When the network load increases by a factor of 5, the average burst inter-arrival time decreases by an order of magnitude. This behaviour is also verified for several burst size thresholds.
228
Fig. 3 depicts the average packet delay per burst for the MBS assembly scheme. When the size threshold is set to 9KB, the average packet delay per burst parameters are almost the same, for the studied data traces, diverging only when the size threshold increases. As anticipated, the higher load network point shows a lower packet average delay. The increase in the average packet delay when size threshold is increased from 9KB to 64KB is different for the data in AMP when compared to the data in ANL and MEM. ANL and MEM also here seem to follow a close behaviour, expectedly suggesting that when the link is more loaded, the MBS aggregation algorithm performs better in terms of average packet delay per burst. Accordingly with the definition of the burst assembly algorithm, burst size is very close to burst size threshold, and no graph is shown.
1.000.000,00
100.000,00
10.000,00
1.000,00
65536
Burst Size
1048576
4.2 Results for MTD As expected, the average inter-arrival time between bursts is always higher than the time threshold defined for the burst assembly. Fig. 4 shows the relation between the weighted averages for the burst inter-arrival times for MTD plotted against burst assembly time threshold. It is clear that inter-arrival and threshold times converge when the later increase. AMP holds the lowest relation between inter-arrival and threshold times, thus performing better than ANL or MEM, which is a consequence of its higher traffic load. There is a quasi linear relation observed in all data traces between the number of packets present in bursts and the utilised time threshold, as shown in Fig. 5. As anticipated, the higher load traffic data yields a higher number of packets per burst, and the number of packets in bursts per data traces for AMP and ANL / MEM diverge as the time threshold increases. At a higher aggregation threshold value, there is
229
almost an order of magnitude difference in the number of packets per burst between AMP and ANL / MEM. Burst Size depends on the MTD aggregation ratio since the faster the incoming packets arrive, the bigger the bursts are. Fig. 6 shows how data is aggregated into bursts of variable sizes when the time threshold is set to 104 s. In line with well known results [3, 18], we can see that the burst size distribution resembles a Gaussian shape when the number of bursts increases, which happens when we consider the MEM trace, following the Law of Large Numbers. The plot character for each data trace is different, even for sources with comparable traffic load such as ANL and MEM. This result suggests that the bigger the load of the aggregated traffic, the more the burst size distribution tends to the Normal distribution. The Normal distribution nature of some of the characteristics of the aggregated bursts is also visible when the packet count per burst is plotted. For burst assembly time of 105 s, bursts contain from 88 to 1887 packets following a distribution close to the one depicted in Fig. 6. Fig. 7 shows the results when time was set to 100 s (the lower end of the chosen burst assembly threshold scale). At this burst assembly threshold, only high intensity traffic sources achieve bursts with more than 16 packets (the maximum was one burst in AMP which contained 21 packets). This figure shows the complementary cumulative distribution function of packets in bursts for this particular burst assembly scenario. Here, bursts containing up to three packets account for 78.5% of the traffic with AMP, 95.1% with ANL and 98.3% with MEM trace files. The average packet delay for MTD is shown in Fig. 8. Note that the average packet delay converges to the burst assembly threshold when it is higher than 103s. Additionally, low load traces (ANL and MEM) show better performance for MTD in terms of average packet delay than AMP; in case of low time thresholds, MEM performs almost twice better than AMP, possibly because bursts contain fewer packets.
100000 s
W ig e d A ra e B rst In ra e h te ve g u te rriva T e l im
10000
1000
100 100
1000
10000
100000 s
Aggregation Threshold
Fig. 4. Burst Inter-arrival Time vs. Burst Assembly (Aggregation) Time for MTD
230
100
1000
10000
100000
Fig. 5. Number of Packets in Bursts vs. Burst Assembly (Aggregation) Thresholds for MTD
4000 3500 3000 Burst Count 2500 2000 1500 1000 500 0
10001-20000 20001-30000 30001-40000 40001-50000 50001-60000 60001-70000 70001-80000 80001-90000 90001-100000 100001-110000 110001-120000 120001-130000 130001-140000 140001-150000 150001-160000 160001-170000 170001-180000 180001-190000 190001-200000 200001-210000 210001-220000 220001-230000 230001-240000 250001-260000 1-10000
10.0000%
MEM
1.0000%
0.1000%
0.0100%
0.0010%
0.0001% 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21
Number of Packets in Bursts
231
1.E+05 (s)
1.E+03
1.E+02
1.E+01
4.3 Results for HA The Hybrid Assembly algorithm combines two thresholds - time and size. Fig. 9 shows the packet inter-arrival time for three data traces, for each HA scenario. The tendency depicted in Fig. 9 remains when regarding packet delay time per burst. It is clear that ANL and MEM behave similarly, and for burst assembly times below 103s inclusive, and the performance of the HA is almost constant regardless of whether the burst size threshold is set to 9KB or 64KB. AMP performs better probably because of its higher packet load and performs best with time threshold of 104s and burst size 9KB than with time of 103s. This behaviour may be related to the change in the burst assembly threshold (from size to time or vice-versa). This hypothesis is confirmed by the plot of Fig. 10. Here, the nice dragon-like shape of the plot exhibits several interesting features of the HA algorithm. The data plotted for values of bursts 7500 bytes (approximately) show the effect of the size constrained branch of the algorithm these were the bursts that contain fast incoming packets (they were very close together in time so the time threshold was never reached). It is also visible that these bursts are responsible for a major share of the output stream (bursts whose size is bigger than 7500 bytes account for 74.70% of the total bursts count). The lump shapes on Fig. 10 are a well know phenomena related to the size distribution of IPv4 packets [8]. 1500 bytes long IPv4 packets are very common in data traces, supposedly because of path and/or link MTU issues, and also, as a consequence of the popularity of IP over Ethernet encapsulation scheme. This graph shows local modes near the multiples of 1500 bytes, namely 1500, 3000, 4500, 6000, 7500 and finally 9000 bytes. Note that 75% of the bursts in this scenario have 7500
232
10100
(s )
8100
Interarrival Tim e
6100
4100
2100
100 T=500, S=9K T=500, S=64K T=1000, S=9K T=1000, S=64K T=10000, S=9K T=10000, S=64K (T=s, S=byte) Scenarios
1000
Fig. 10. Burst Size Histogram for HA (T=103 s, S= 9 KB) on AMP data
bytes or more, which is in line with results, in particular the ones depicted in Fig. 5 and Fig. 6 considering MTD. This behaviour is also visible in other data traces and in other aggregation scenarios.
233
5 Conclusions
The assessment of the performance of burst assembly algorithms using real IPv4 data allows for the following conclusions: 1. globally, from all the studied scenarios for the burst assembly algorithms, both MTD and HA outperform MBS in terms of delay added to the burst and/or to the packet in the burst. Therefore, as the assembly of large size bursts results in high inter-arrival times and packet delay per burst, these large bursts should only be considered for transmitting data that is not time critical, such as news server synchronization or data backup between servers. The size threshold is dependent of the traffic load on the input link, typically being bigger than 1MB for these data traces; 2. the performance of the burst assembly algorithms is also a function of the network point where the data is to be assembled, as different network sites show different performances. This feature is visible in all examined burst assembly algorithms, and several threshold values may have to be studied to reach a sustained optimum performance for a specific network point. Thus, for an optimum operation regarding the compromise between large bursts and low mean packet delay in a burst, burst assembly algorithms thresholds or combinations of thresholds have to be dynamically set to adjust to time changing traffic conditions; 3. as IPv6 packet sizes and flow rates are expected to follow current IPv4 models, following the research presented in [8] we can expect similar results for the assembly of IPv6 packets into bursts. Acknowledgments. The authors would like to thank Marek Hajduczenia and Pedro Incio for their support and many valuable discussions.
References
1. S. R. Amstutz, "Burst Switching - An Introduction," in IEEE Communications Magazine, vol., issue 0163-6804/83/1100-0036, pp. 36-42, 1983. 2. C. Qiao and M. Yoo, "Optical burst switching (OBS) - A new paradigm for an optical Internet," Journal of High Speed Networks, vol. 8, issue 1, pp. 69-84, 1999. 3. S. Malik and U. Killat, "Impact of Burst Aggregation Time on Performance in Optical Burst Switching Networks," in proceedings of Optical Fibre Communications Conference, 2004, OFC 2004, Los Angeles, California, USA, 2004, vol. 2, pp. 2. 4. T. Ferrari, "End-To-End performance analysis with Traffic Aggregation," Computer Networks, vol. 34, issue 6, pp. 905-914, 2000. 5. A. Zapata and P. Bayvel, "Impact of burst aggregation schemes on delay in optical burst switched networks," in proceedings of LEOS 2003, Tucson, Arizona, USA, 2003. 6. National Laboratory for Applied Network Research, "National Laboratory for Applied Network Research," in http://pma.nlanr.net/, 2005, accessed at 2005-01-13. 7. National Laboratory for Applied Network Research. "tsh file format (Time Stamped Header)," in http://pma.nlanr.net/Traces/tsh.format.html, 2005, accessed at 2005-01-13.
234
8. N. M. Garcia, M. Hajduczenia, P. Monteiro, H. Silva, and M. Freire, "Modeling and Simulation of IPv6 Traffic," in proceedings of 7th Internet Global Congress, Global IPv6 Summit, Barcelona, 2005. 9. V. M. Vokkarane, K. Haridoss, and J. P. Jue, "Threshold-Based Burst Assembly Policies for QoS Support on Optical Burst-Switched Networks," in proceedings of Optical Networking and Communications Conference OPTICOMM 2002, Lowell, Massachussets, USA, 2002. 10. A. Ge and F. Callegati, "On Optical Burst Switching and Self-Similar Traffic," IEEE Communications Magazine, vol. 4, issue 3, 2000. 11. X. Yu, Y. Chen, and C. Qiao, "Performance Evaluation of Optical Burst Switching with Assembled Burst Traffic Input," in proceedings of GLOBECOM 2002, Taipei, 2002. 12. M. C. Yuang, J. Shil, and P. L. Tien, "QoS Burstification for Optical Burst Switched WDM Networks," in proceedings of Optical Fiber Communication Conference OFC 2002, Anaheim, California, USA, 2002. 13. G. Almes, S. Kalidindi, and M. Zekauskas, "One Way Delay Metrics for IP Performance Metrics (IPPM)," RFC 2679, IETF, 1999. 14. C. Demichelis and P. Chimento, "IP Packet Delay Variation Metric for IP Performance Metrics (IPPM)," RFC 3393, IETF, 2002. 15. IP Performance Metrics IETF workgroup, in http://www.ietf.org/html.charters/ippmcharter.html, accessed at June 15, 2005. 16. D. Borman, S. Deering, and R. Hinden, "IPv6 Jumbograms," RFC 2675, Internet Engineering Task Force (IETF), 1999. 17. National Laboratory for Applied Network Research, "NLANR Sites," in http://pma.nlanr.net/Sites/, 2005, accessed at 2005-02-15. 18. X. Yu, Y. Chen, and C. Qiao, "A Study of Traffic Statistics of Assembled Burst Traffic in Optical Burst Switched Networks," in proceedings of Optical Networking and Communications, Boston, Massachusetts, USA, 2002, vol. 4874, pp. 149-159.