Network Servers and Server Performance

Starting from:

$35

This assignment gives you a chance to become familiar with concurrent network clients, servers, covering topics including threads, synchronization, wait/notify (monitor), asynchronous I/O, and benchmarking. by uploading to classes Protocol The server that we will design is a simpliﬁed version of HTTP 1.0. The most basic application message, encoded in ASCII, from the client to the server is: GET <URL HTTP/1.0 Host: <ServerName CRLF where CRLF is carriage return and line feed, representing an empty line. This request asks for the ﬁle stored at location <DocRootServerName/<URL, where DocRootServerName is the root directory for the requested server name. For example, if <DocRooServerNamet=/tmp/mydoc, and <URL is /ﬁle1.html, the server will return the ﬁle /tmp/mydoc/ﬁle1.html, if it exists. If the request does not specify the Host header, the server returns the ﬁrst server (virtual host) conﬁgured; see below. The basic reply message from the server to the client, encoded in ASCII, is: HTTP/1.0 <StatusCode <message Date: <date Server: <your server name Content-Type: text/html Content-Length: <LengthOfFile CRLF <file content CRLF again represents an empty line. If the ﬁle is found and readable, the returned <status code is 200 and you can give a message such as OK. Otherwise, please give an error code of 400. If you are curious about HTTP error codes, you can see http://www.ietf.org/rfc/rfc1945.txt. You can use Java File class to obtain ﬁle size. Part 1: Simple Client Your test client should be multi-threaded. The client can generate test requests to the server with the following command line: %java SHTTPTestClient -server <server -port <server port -parallel <# of threads -files <file name -T <time of test in seconds In particular, the <ﬁle name is the name of a ﬁle that contains a list of ﬁles to be requested. For example, a ﬁle may look like the following: file1.html file2.html file3.html file1.html Then each thread of the client will request ﬁle1.html, then ﬁle2.html, then ﬁle3.html, and then ﬁle1.html. The thread then repeats the sequence. The client simply discards the received reply. The client stops after <time of test in seconds. The client should print out the total transaction throughput (# ﬁles ﬁnished downloading by all threads, averaged over per second), data rate throughput (number bytes received, averaged over per second), and the average of wait time (i.e., time from issuing request to getting ﬁrst data). Think about how to collect statistics from multiple threads. Part 2: Sequential and Multi-threaded Servers In class we have covered multiple approaches to implementing network servers: sequential; per request thread; thread pool with service threads competing on welcome socket; thread pool with a shared queue and busy wait; thread pool with a shared queue and suspension; asynchronous i/o. In this part, you will implement the ﬁrst 5 approaches and compare their performance using your test client or a browser.

example code provided in class. Following Apache conﬁguration style (http://httpd.apache.org/docs/2.4/vhosts/examples.html; note that we implement a single server name, not multiple, as the example conﬁguration shows), we program each server by reading a conﬁguration ﬁle: %java <servername -config <config_file_name The basic conﬁguration parameter is listening port: Listen <port such as 6789 A conﬁguration ﬁle should also contain one or more virtual hosts shown below. We use the same format as the Apache, but your server ignores the *:6789 part. <VirtualHost *:6789 DocumentRoot <root dir ServerName <server name <VirtualHost For a thread pool based server, the conﬁguration ﬁle allows speciﬁcation of the number of threads: ThreadPoolSize <number of threads Each server uses a cache to speedup handling of requests for static ﬁles. The cache is a simple Java Map, with key being the ﬁle and content the whole ﬁle in an array. Before reading a ﬁle from disk, the server checks whether it is already cached. Think: how to handle multiple threads reading and adding to the Map. The cache size can be speciﬁed in the conﬁguration ﬁle: CacheSize <cache size in KBytes To simplify your server, there is no cache replacement; i.e., when the cache is full, no addition to the cache. You can always specify some conﬁguration parameters in the command line, e.g., java <servername -ThreadPoolSize <size. A commandline speciﬁcation will overwrite the conﬁguration ﬁle speciﬁcation. We recommend that you consider a hash map in your program to implement conﬁgurations. Your server must support the following: Methods: The server must support HTTP 1.0 (http://www.w3.org/Protocols/HTTP/1.0/spec.html) GET method. Headers: The server must send the Last-Modiﬁed header and understand the If-Modiﬁed-Since header from client. This means that you will need to parse date format. For this assignment, we use the rfc1123-date format. Your server also needs to understand the User-Agent header. For other headers, your server can skip. Remember that POST has a message body. URL Mapping: If the URL ends with / without specifying a ﬁle name, your server should return index.html if it exists; otherwise it will return Not Found. If the request is for DocumentRoot without specifying a ﬁle name and the User-Agent header indicates that the request is from a mobile handset (e.g., it should at least detect iphone by detecting iPhone in the User-Agent string), it should return index_m.html, if it exists; index.html next, and then Not Found. Your server needs to check if a mapped ﬁle is executable. If so, it should execute the ﬁle and relay the results back to clients. Our assignment only handles the case that the input to the external program is from GET. Please see Java ProcessBuilder on how to start set environment variables and start a dynamic process. The example of the doc can be helpful. You will need to read RFC 3875 to set the right environment variables. You will need to write a dynamic CGI program to test your invocation. Your server also needs to implement a heartbeat monitoring URL service to integrate with a load balancer (e.g., Amazon Load Balancer we covered in class). In particular, a load balancer may query a virtual URL (i.e., no mapped ﬁle) named load (i.e., with request GET /load HTTP/1.0). If the server is willing to accept new connections, it should return status code 200; otherwise, it returns code 503 to indicate overloading. Your software design should allow "plugin", at run time, of different algorithms to compute overloading conditions. Please describe a particular design and implement it. Part 3: Asynchronous Server Part 3.1: Your ASync Server In this part, you implement an asynchronous server with functions as speciﬁed in Part 2. We have the following requirements: We recommend that the software structure of your asynchronous server be based on v3 of the EchoServer that we discussed in class. You need to write a handler for the particular protocol. You can feel free to modify the structure if you see any way to improve it (ﬁx error handling, etc). You need to document your changes. The server should have a timeout thread. Upon accepting a new connection, the accept handler should register a timeout event with the timeout thread with a callback function. The timeout value is speciﬁed by IncompleteTimeout <timeout in seconds. The default timeout value is 3 seconds. If the connection does not give a complete request to the server approximately within timeout from the time of being accepted, the server should disconnect the connection. Note that the timeout monitoring thread should not directly close a channel that the dispatcher thread is still monitoring (why?). You need to think very carefully about the exact details of the interaction between these two threads, propose a software design, and implement it. Part 3.2: Comparison of Designs
A great way to learn about your design is to compare with other designs. You need to read the docuemnts or code of two related frameworks: xsocket and netty. Part 3.2(a): Comparison with xsocket Although xsocket is no longer under active development, it provides a design alternative. Please read the source code and document of x-Socket, a high performance software library for reusable, asynchronous I/O servers. Please discuss in your report the following questions (please refer to the speciﬁc location when you refer to its document or source code: How many dispatchers does x-Socket allow? If multiple, how do the dispatchers share workload? What is the basic ﬂow of a dispatcher thread? What is the calling sequence until the onData method of EchoHandler (see EchoHandler, EchoServer, and EchoServerTest) is invoked? Please check this link for testing code: http://sourceforge.net/p/xsocket/code/HEAD/tree/xsocket/core/trunk/src/test/java/org/xsocket/connection/ How does x-Socket implement Idle timeout of a connection? Please give an example of how the library does testing (see http://sourceforge.net/p/xsocket/code/HEAD/tree/xsocket/core/trunk/src/test/java/org/xsocket/connection/EchoServerTest.java for an example). Please describe how you may test your server with idle timeout? Part 3.2(b): Comparison with Netty Netty is another Java async IO framework used by many; see for example use cases. Please read Netty user's guide and answer the following questions: Netty provides multiple event loop implementations. In a typical server channel setting, two event loop groups are created, with one typically called the boss group and the second worker group. What are they? How does Netty achieve synchronization among them? Method calls such as bind return ChannelFuture. Please describe how one may implement the sync method of a future. Instead of using ByteBuffer, Netty introduces a data structure called ByteBuf. Please give one key difference between ByteBuffer and ByteBuf. A major novel, interesting feature of Netty which we did not cover in class is ChannelPipeline. A pipeline may consist of a list of ChannelHander. Compare HTTP Hello World Server and HTTP Snoop Server, what are the handlers that each insert? Please scan Netty implementation and give a high-level description of how ChannelPipeline is implemented. Part 4: Performance Benchmarking One important computer systems skill is to evaluate the performance of design alternatives. In this assignment, we conduct performance evaluation of the alternatives: To conduct the testing, you will need to setup the DocumentRoot at the server. It is highly recommended that you generate a number of ﬁles of different sizes under DocumentRoot named such as ﬁle1.html, ﬁle2.html, ..., ﬁle1000.html. If you download gen.tar, and untar it (tar -xvf gen.tar), you will see a directory named doc-root and a directory named request-patterns. To compare the performance with Apache, we will use the department zoo Apache server. We will use /home/httpd/html/zoo/classes/cs433/web/wwwroot to store testing ﬁles. Suppose we want to fetch /home/httpd/html/zoo/classes/cs433/web/www-root/html-small/doc1.html. To use the department Apache server, since the department server has set DocumentRoot as /home/httpd/html/zoo, the URL should be: http://zoo.cs.yale.edu/classes/cs433/web/www-root/html-small/doc1.html To use your server, suppose you set the DocumentRoot as /home/httpd/html/zoo/classes/cs433/web/www-root, and you run your server on cicada.cs.yale.edu at port 9876. Then the URL is: http://cicada.cs.yale.edu:9876/html-small/doc1.html For the test, you will need to generate a request ﬁle for the client. The request pattern can have a major impact on your server performance (how requests repeat). The TA will use a Pareto distribution to generate request patterns to test your server. You can write a simple Java program or script to generate the request. You should vary the client parallel (see Client command line above) with a reasonable increment schedule (e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, ...). A reasonable test time is 60 to 120 seconds. You can write a simple script to automate this task. For multithreaded server, please try two thread pool sizes: one small and one large. Part 5: Report You should submit a report on your server design. Please answer any question we speciﬁed above. Please report the measured performance of both Apache and your best server for these performance metrics: throughput and (mean) delay. You can use open ofﬁce or gnuplot to generate ﬁgures. Below is an example ﬁgure showing the performance of multiple servers.
The TA will benchmark all servers and pick the one with the highest throughput. This server will receive a bonus of 25%. Submission Please submit using class server. Please include README to tell the TA the directory structure, e.g., which ﬁle is the report. Please generate a single jar ﬁle containing all of your ﬁles. Suggestions During your async i/o design, think how you implement a ﬁnite state machine to handle each request (e.g., initial state after accepting a connection, what other states). Java async i/o does not allow you to select events on a ﬁle channel. There are can be multiple design options to handle ﬁle i/o: Use standard ﬁle i/o by assuming that ﬁle system is fast and will not become bottleneck; Try out mapped ﬁle i/o: FileInputStream fin = new FileInputStream(args[0]); FileChannel in = fin.getChannel(); ByteBuffer input = in.map(FileChannel.MapMode.READ_ONLY, 0, in.size()); Try out direct transfer: See FileChannel.transferTo; Use standard ﬁle i/o and use a thread pool to help with reading ﬁles.

More products

Lab 8: PHP Database Connectivity

$21

Add to cart

Network Servers and Server Performance _ SOLUTION

More products