CPS 114 Introduction to Computer Networks: Lab 1

CPS 114 Introduction to Computer Networks: Labs

Lab 4: Congestion control

Introduction

Your task is to complete the design and implementation of a reliable transport protocol using the sliding window protocol. You will need to implement both reliability and congestion control, but you may use your code from Lab 1 for the reliability function.

We provide you with a link emulator (relayer), a file generator (generator), and a library (rlib.[h|c]). You will implement some functions and data structures in reliable.c. rlib.[h|c] provides some useful helper functions. Your tasks include:

Implement flow control so that a fast sender will not overrun a slow receivers' buffer;
Detect congestion;
Implement TCP-style congestion control and retransmission algorithms;
Analyze the performance of your implementation.

You will implement both the sender and receiver sides of a transport layer. The sender will read in a file (the generator can generate an file of arbitrary size), break it into fixed-sized packets (1000 bytes), add a control header to the data, and send the file to the receiver through an congested link. The receiver will read these packets, and write the corresponding data to an output file. A correct implementation must guarantee that the received file is identical to the sent file.

Figure 1 shows an experiment setup. A₁ sends file₁ to A₂. B₁ sends file₂ to B₂. Both files, file₁ and file₂, can be generated by the generator. These connections share the bottleneck link L, which is emulated by the relayer code we provide. You are required to implement a congestion control algorithm that efficiently and fairly utilizes L's capacity.

Figure 1: dumbbell topology

In lab 1, you have implemented a duplex sliding window protocol. For this lab, you only need to implement one-direction data transfer. For instance, in Figure 1, A₁ runs in the sender mode and sends data to A₂. A₂ runs in the receiver mode, and will only send ACK packets (not data packets) back to A₁.

Getting started

You should be able to finish your assignment on any machine in the department's linux cluster: linux.cs.duke.edu. linux21-30 is recommended for the relayer. If you prefer to finish the assignment on your own machine, you may download and install ubuntu. You can run Ubuntu in a virtual machine such as the free VirtualBox software. We do not support MAC or Windows. Sorry.

Download and untar the lab4.tar.gz.

xwy@linux21$ wget http://www.cs.duke.edu/courses/spring10/cps114/labs/lab4/lab4.tar.gz
xwy@linux21$ tar xzvf lab4.tar.gz

There are three components in the tarball: (1) generator, (2) relayer, and (3) the reliable directory. You can use following command to generate arbitrary size of file

xwy@linux21$ ./generator <file size in byte> <file name>

The relayer emulates link L in Figure 1. You can set the relayer's parameters in the configuration file config.xml. Presently, this config file shows the configuration in Figure 1. The relayer supports the following configuration parameters:

<number_of_pairs>: The number of sender-receiver pairs this link can support in the dumbbell topology.
<propagation_delay>: The propagation delay of link L.
<bandwidth>: The bandwidth of link L. Due to some system limitations, the relayer only supports up to a 50Mb/s link. For more accurate emulation results, you should configure the link capacity to be less than 50Mb/s.
<buffer_size>: The router₁'s buffer size. Packets from a sender to a receiver will first wait in this buffer and then get transmitted on link L. When the buffer is full, the router begins to discard packets using the FIFO policy. The buffer size is usually configured to be the delay bandwidth product.
<pair>: This section specifies the incoming IP/port numbers of a sender/receiver, and the relaying ports. For instance, in Figure 1, the sender A₁ is running on linux21, port 10000, and the receiver A₂ is running on linux 22, port 20000. The relayer is running on linux25, using port 50001 to receive A₁'s packets. It may delay or discard A₁'s packets based on its available buffer space, and then send these packets via port 50002 to the receiver A₂. Similarly, ACK packets from A₂ will be relayed via the relayer back to A₁. Note that the relayer will not drop acknowledgment's packets. You can modify the configuration in this section to test different numbers of sender-receiver pairs.
<CPU_frequency>: The CPU frequency on which this relayer is running. You can type "cat /porc/cpuinfo" to find out this configuration. To achieve a high emulation accuracy, the relayer is counting CPU cycles for timing.
<enable_log>: 0 to disable packet log, and 1 to enable packet log. The relayer uses fprintf(stderr, "...") to print out packet logs, including "buffer PKT", "drop PKT", "transmit PKT" and "relay PKT". Logging is time consuming and may interfere with packet transmissions.

After finishing these configurations, you can start the relayer on linux25 with the following command:

xwy@linux25$ ./relayer config.xml

Next, you need to set up receiver A₂ on linux22 and receiver B₂ on linux24. First run make to compile the reliable. Then run

xwy@linux22$ ./reliable -r output₁ -w 200 20000 linux25:50002
xwy@linux24$ ./reliable -r output₂ -w 200 40000 linux25:50004

The "-r" option puts this reliable instance in the receiver mode where it should only send ACK packets back to the sender but not data packets. The "output₁" and "output₂" options are the output file names. The "-w 200" option sets the receiver's max receiving window size to 200 packets. The rest of the options are the local port, the relayer's DNS name, and the corresponding relaying port that connects to the reliable instance.

The last step is to set up the sender A₁ on linux21 and sender B₁ on linux23. Run

xwy@linux21$ ./reliable -s file₁ 10000 linux25:50001
xwy@linux23$ ./reliable -s file₂ 30000 linux25:50003

Similarly, the "-s" option puts this reliable instance in the sender mode, in which it will read in file₁, break it into fixed-sized packets (1000 bytes), add a header, and send packets to the receiver. The rest is the reliable's local port, the relayer's DNS name and port number. The "-w" option is optional. You can either explicitly tell the sender the receiver's max receiving window size by the "-w" option, or in your protocol, the receiver tells its max receiving window size to the sender.

By now you have established Figure 1's topology. After you implementing the sender and receiver correctly, A₁(on linux21) will send file₁ to A₂(on linux22), and A₂ will write the received data to output₁. B₁(on linux23) will send file₂ to B₂(on linux24), and B₂ will write the received data to output₂. To simplify the configuration, you may run all the above components on one machine.

Understanding the code

Packet format

There are two types of packets, Data packets and Ack-only packets. You can tell the type of a packet by length. The length of a data packet varies from 16 to 1016 bytes, and ACK packets are of 12 bytes. The packet format is defined in rlib.h:

        struct packet {
          uint16_t cksum; /* Ack and Data */
          uint16_t len;   /* Ack and Data */
          uint32_t ackno; /* Ack and Data */
          uint32_t rwnd;  /* Ack and Data */
          uint32_t seqno; /* Data only */
          char data[1000]; /* Data only; Can be less than 500 bytes*/
        };typedef struct packet packet_t;

The length, seqno, ackno and rwnd fields are always in big-endian order (use htonl/htons to write and ntohl/ntohs to read):

Most fields are exactly the same as in Lab 1. Here we add another field "rwnd" in both data and ack packet header. Receiver can use this field to tell the sender the size of its receiving window, and the sender can adjust its sending window according to this rwnd.

Requirements

To complete this lab, you need to first modify a few places of your lab1 code to implement reliability with the new packet header format. You can use diff to check if the received file is exactly the same as the original file. Then you will need to implement congestion control.

If a reliable instance runs in the receiver mode, it should send an EOF to the sender at the very beginning to tell the sender that the receiver has no data to send.
Both the sender and receiver should be able to close the connection. Use the conditions in which rel_destroy is called in Lab1. Or you can implement part of the TCP's state machine for closing a connection.
The sender should print out the amount of time to transfer a file. You may get the start time at rel_init(), and the finish time at rel_destroy(). Print out their differences. We will use this timer to evaluate your implementation.
Your algorithm should be able to use available bandwidth efficiently.
Your algorithm should be able to use available bandwidth fairly.
You should turn in a short document to describe the design of your protocol, and the analysis you use to show that your design uses bandwidth fairly and efficiently.

In your document, you should describe your congestion control algorithms clearly. You can borrow ideas from RFCs and research papers. But you may still need to adjust some parameters, or modify the existing algorithms. In your writeup, you should justify your design choices. You are also required to analyze the performance of your algorithm. For example, in Figure 1, A₁ sends a 100MB file to A₂. B₁ send another 100MB file to B₂ at the same time. (1) How much time does each transmission take? (2) What is each sender's throughput? (3) Does your algorithm fully utilize the link's bandwidth? (4) Is the bottleneck link shared fairly by the two senders? For example, Figure 2 shows the cwnd size and throughput under the config.xml configuration. What can you tell from these Figures? (5) You should draw your own figures and analyze them.

Figure 2: cwnd and throughput

You may find the following references helpful for your design and analysis.

Window size calculate: RFC2581
Retransmit time estimation: RFC2988
Handling multiple packets loss: RFC2582
TCP Connection Termination: TCP/IP guide
Reasons behind these RFCs: Congestion Avoidance and Control; Van Jacobson, Michael J. Karels; SIGCOMM 1988; LBL, UC Berkeley
Fairness index: Analysis of the Increase and Decrease Algorithms for Congestion Avoidance in Computer Networks, D. Chiu and R. Jain, Journal of Computer Networks and ISDN, Vol. 17, No. 1, June 1989, pp. 1-14.
Throughput: to calculate your implementation's throughput, you can simply count bits b (For each packet, only count the packet header 16 bytes plus the data size 1000 bytes) transmitted during a time period d (for example 10s). Than calculate b / d. By connecting all the discrete points you get the throughput curve in Figure 2. For this lab, We do not count other headers (UDP, IP, Etherheader) into the transmitted bits. You can read more about throughput on wikipedia.
Bandwidth measurement: CPU frequency may change due to some reasons beyond our control, for example the server's workload. Our relayer is counting CPU cycles for timing. This CPU frequency change will make the relayer's bandwidth not exactly the number you configured in config.xml. To get a more accurate relayer's bandwidth, you can use this simple measurement tool. You may need this tool when you are analyzing throughput and bandwidth utilization. linux21-30 is recommended for the relayer.

Functions and data structures you need to implement

For this lab, you are also required to fill the 6 functions you have done in Lab 1. There are some small changes.

rel_read: If the reliable is running in the receiver mode (see c.sender_receiver in rlib.c, you can get its value in rel_create), this receiver should send an EOF to the sender when rel_read is first called. After this first call, the function rel_read can simply return for later calls. Note that this EOF will wait in the receiver's sending window. When timeout happens, the receiver have to retransmit this EOF as well until an ACK is received. If the reliable is running in the sender mode, the rel_read's behavior is the same as that is described in Lab 1.
rel_timer: The function rel_timer is called periodically, currently at a rate of 10ms.

Submitting

There is no test case for this lab. You are required to write up a lab document to explain your protocol design and performance analysis. To submit the assignment, do the following:

Run the command make submit, this should create a file called reliable.tar.gz.
Upload reliable.tar.gz as well as the document to the Lab 4 assignment on Blackboard.

Collaboration policy

For this lab, you can form a team of two students to complete the assignment. If your code does not pass all lab1 tests, finding a partner whose code did will be very useful. You can discuss with anyone, but you must not look at other teams' code, or copy code from other teams.

Acknowledgment

The skeleton code of this lab is taken from this assignment.

Last updated: 04/2010