.NET
- C#
- VB.NET
- F#
- Azure
- ASP.NET
  - ASP.NET AJAX
  - ASP.NET MVC
- LINQ
- ADO.NET
Java
Open Source
Mobile
Database
Architecture
RIA & Web
Toolbox

Retrieving HTTP content in .NET

17 Dec 2004 | by Rick Strahl | Filed in

Comments
PDF

Download.zip

Introduction
New HTTP tools in .NET
HTTP Cookies
Wrapping it up
POSTing data
Firing events
We can walk and chew gum at the same time!

New HTTP tools in .NET

The .NET Framework provides new tools for retrieving HTTP content that are powerful and scalable in a single package. If you've ever worked in pre-.NET applications and tried to retrieve HTTP content you probably know that there are a number of different tools available: WinInet (Win32 API), XMLHTTP (part of MSXML) and recently the new WinHTTP COM library. These tools invariably all worked in some situations, but none of them really fit the bill for all instances. For example, WinInet can't scale on the server with no multi-threading support. XMLHTTP was too simple and didn't support all aspects of the HTTP model. WinHTTP which is the latest Microsoft tool for COM solves many of these problems but it doesn't work at all on Win9x, which makes it a bad choice for a client tool integrated into broad distribution apps at least for the moment until XP take a strong hold.

The .NET framework greatly simplifies HTTP access with a pair of classes HttpWebRequest and HttpWebResponse. These classes provide just about all of the functionality provided through the HTTP protocol in a straightforward manner. The basics of returning content from the Web requires very little code (see Listing 1).

Listing 1: Simple retrieval of Web data over HTTP.


string lcUrl = "http://www.west-wind.com/TestPage.wwd";


// *** Establish the request

HttpWebRequest loHttp =

    (HttpWebRequest) WebRequest.Create(lcUrl);


// *** Set properties

loHttp.Timeout = 10000;    // 10 secs

loHttp.UserAgent = "Code Sample Web Client";


// *** Retrieve request info headers

HttpWebResponse loWebResponse = (HttpWebResponse) loHttp.GetResponse();


Encoding enc = Encoding.GetEncoding(1252);  // Windows default Code Page


StreamReader loResponseStream =

  new StreamReader(loWebResponse.GetResponseStream(),enc);


string lcHtml = loResponseStream.ReadToEnd();


loWebResponse.Close();

loResponseStream.Close();

Pretty simple, right? But beneath this simplicity lies a lot of power too. Let's start with looking at how this works.

Start by creating the HttpWebRequest object which is the base object used to initiate a Web request. A call to the static WebRequest.Create() method is used to parse the URL and pass the resolved URL into the request object. This call will throw an exception if the URL passed has invalid URL syntax.

The request portion controls how the outbound HTTP request is structured. As such it handles configuration of the HTTP headers, the most common of which are expressed as properties of the HttpWebRequest object. A few examples are UserAgent, ContentType, Expires and even a Cookies collection which map directly to header values that get set when the response is sent. Headers can also be set explicitly using the Headers string collection to which you can add either a whole header string or a key value pair. Generally the properties address all common headers, so you'll rarely need to resort to setting headers explicitly most likely only to support special protocols (for example, SoapAction for SOAP requests).

In the example, I do nothing much with the request other than setting a couple of the optional properties – the UserAgent (the client 'browser' which is blank otherwise) and the Timeout for the request. If you need to POST data to the server you'll need to do a little more work – I'll talk about this a little later.

Streaming good deals

Once the HTTP Request is configured for sending the data, a call to GetResponse() actually goes out and sends the HTTP request to the Web Server. At this point the request sends the headers and retrieves the first HTTP result buffer from the Web Server.

When the code above performs the GetResponse() call only a small chunk of data is returned from the Web server. The first chunk contains the HTTP header and the very first part of the data, which is simply buffered internally until read from the stream itself. The data from this initial request is used to set the properties of the HttpWebResponse object, so you can look at things like ContentType, ContentLength, StatusCode, Cookies and much more.

Next a stream is returned using the GetResponseStream() method. The stream points at the actual binary HTTP response from the Web server. Streams give you a lot of flexibility in handling how data is retriveved from the web server.

As mentioned, the call to GetResponse() only returned an initial internal buffer – to retrieve the actual data and read the rest of the result document from the Web server you have to read the stream.

Streams and StreamReader in .NET

If you've worked at all with .NET you've probably found out about streams by now. Streams are very flexible abstractions that are used to deal with blocks of data that are well, streaming – built from data that is not necessary complete by the time you start reading it. Streams are efficient because they read and write data sequentially for the most part (you can also access some streams like files with random access). In most cases streams are mapped to things like files or Network I/O inputs and outputs. Streams can also be applied to strings and memory mapped files and any number of other things that require reading and writing from large blocks of data. Streams manage the underlying access to insure integrity of the data so you can read the data before all the data is available. .NET uses streams for most of the network I/O environment, so access HTTP, FTP, and even sockets provides a fairly consistent interface across protocols. In these situations you usually end up with an input stream and an output stream. Both the WebRequest and WebResponse (which are the base classes of the HttpWebRequest/HttpWebResponse objects) have methods to return the respective streams which you read from and write to.

In the example above I use a StreamReader object to return a string from the data in a single operation. But realize that because a stream is returned I could access the stream directly and read smaller chunks to say provide status information on progress of the HTTP download.

Notice also that when the StreamReader is created I had to explicitly provide an encoding type – in this case CodePage 1252 which is the Windows default codepage. This is important because the data is transferred as a byte stream and without the encoding it would result in invalid character translations for any extended characters. CodePage 1252 works fairly well for English or European language content, as well as binary content. Ideally though you will need to decide at runtime which encoding to use – for example a binary file probably should write a stream out to file or other location rather than converting to a string, while a page from Japan should use the appropriate Unicode encoding for that language.

String and Character Encoding in .NET

String encoding and dealing with data returned over Web connection is arguably one of the most confusing subjects I've run into with working in .NET. All strings in .NET are Unicode (double byte) and require specific encoding to display properly. When retrieving data over the Web the data is retrieved in a binary stream and in order to use it as a string it must be encoded. Different content might require different encodings and you have to control how to encode the string. This basically involves telling the stream reader which CodePage to convert to.

.NET provides a number of tools to facilitate the encoding process including the Encoding class, which allows you to easily switch encoding formats for specific operations. Many classes and conversion tools then use this Encoding class as a parameter or property to provide their encoding and decoding.

To help with finding out what encoding is used the HttpWebResponse object returns a ContentEncoding property, but unfortunately very few Web servers return this information in their headers, so it's difficult to dynamically discover what format to encode to. CodePage 1252 is the best all around choice for Western content and I tend to use that as the default if no ContentEncoding can be determined. The following code is useful when creating an Encoding instance:

Encoding enc;

try {

  enc=Encoding.GetEncoding(Response.ContentEncoding);

}

catch {

  enc = Encoding.GetEncoding(1252);

}

If you are returning binary data store this data in a byte array ( byte[ ]) or stream the data directly whatever output source you need to deal with. For example, if you download a file, don't store it to string first but stream it straight into a file on disk.

I use the StreamReader object which provides an easy mechanism to retrieve the contents of a stream into strings or arrays of characters. It also provides the handy ReadToEnd() method which retrieves the entire stream in a single batch. The operation of reading the stream is what actually retrieves the data from the Web server (except for the initial block that was read to retrieve the headers). In this case a single read operation is called and retrieves the data with the request blocking until the data has been returned. If you wanted to provide feedback you can also read the data in chunks using the StreamReader's Read() method which lets you specify the size of the data to read. You'd run this in a loop and provide whatever status info you need on each read. With this mechanism you can retrieve the data and provide progress information.

StreamReader also exposes the underlying raw stream using the BaseStream property, so StreamReader is a good object to use to pass streamed data around.

POSTing data

The example above only retrieves data which is essentially an HTTP GET request. If you want to send data to the server you can use an HTTP POST operation. POSTing data refers to the process of taking data and sending it to the Web server as part of the request payload. A POST operation both sends data to the server and retrieves a response.

Posting uses a stream to send the data to the server, so the process of posting data is pretty much the reverse of retrieving the data (see listing 2).

Listing 2: POSTing data to the Web Server

 string lcUrl = "http://www.west-wind.com/testpage.wwd";

    HttpWebRequest loHttp =

  (HttpWebRequest) WebRequest.Create(lcUrl);


    // *** Send any POST data

    string lcPostData =

  "Name=" + HttpUtility.UrlEncode("Rick Strahl") +

  "&Company=" + HttpUtility.UrlEncode("West Wind ");


    loHttp.Method="POST";

    byte [] lbPostBuffer = System.Text.          

                      Encoding.GetEncoding(1252).GetBytes(lcPostData);

    loHttp.ContentLength = lbPostBuffer.Length;


    Stream loPostData = loHttp.GetRequestStream();

    loPostData.Write(lbPostBuffer,0,lbPostBuffer.Length);

    loPostData.Close();


    HttpWebResponse loWebResponse = (HttpWebResponse) loHttp.GetResponse();


    Encoding enc = System.Text.Encoding.GetEncoding(1252);


    StreamReader loResponseStream =

  new StreamReader(loWebResponse.GetResponseStream(),enc);


    string lcHtml = loResponseStream.ReadToEnd();


    loWebResponse.Close();

  loResponseStream.Close();

Make sure you use the this POST code immediately before the HttpWebRequest.GetResponse() call. All other manipulation of the Request object has no effect as the headers get send with the POST buffer. The rest of the code is identical to what was shown before – You retrieve the Response and then read the stream to grab the result data.

POST data needs to be properly encoded when sent to the server. If you're posting information to a Web page you'll have to make sure to properly encode your POST buffer into key value pairs and using URLEncoding for the values. You can utilize the static method System.Web.HttpUtility.UrlEncode() to encode the data. In this case make sure to include the System.Web namespace in your project. Note this is necessary only if you're posting to a typical HTML page – if you're posting XML or other application content you can just post the raw data as is. This is all much easier to do using a custom class like the one included with this article. This class has an AddPostKey method and depending on the POST mode it will take any parameters and properly encode them into an internally manage stream which is then POSTed to the server.

To send the actual data in the POST buffer the data has to be converted to a byte array first. Again we need to properly encode the string. Using Encoding.GetEncoding(1252) encoding with the GetBytes() method which returns a byte array using the Windows standard ANSI code page. You should then set the ContentLength property so the server can know the size of the data stream coming in. Finally you can write the POST data to the server using an output stream returned from HttpWebRequest.GetRequestStream(). Simply write the entire byte array out to the stream in one Write() method call with the appropriate size of the byte array. This writes the data and waits for completion. As with the retrieval operation the stream operations are what actually causes data to be sent to the server so if you want to provide progress information you can send smaller chunks and provide feedback to the user if needed.

You might also like...

Comments

.NET tutorials

.NET books

Expert WCF 4: SOA 2.0 with Windows Communication Foundation 4

Windows Communication Foundation has become an integral part of many .NET based solutions, enabling highly customizable messaging across distributed environments. In Expert WCF 4, you will cover scenarios that include designing, implementing, consumi...

.NET forum discussion

edmonton female escort services near me

by canadapleasure (0 replies)
Bagaimana memenangkan $ 1,54 miliar dalam Mega Jutaan

by gametogelan (0 replies)
Software development company GroupBWT

by alexthunders01 (0 replies)
The requested URL was not found on this server

by haulexgem (0 replies)
The requested URL was not found on this server

by haulexgem (0 replies)

.NET podcasts

Visual Studio Talk Show (en français): Louis-Philippe Pinsonneault

Published 7 years ago, running time 1h12m

20 mars 2013 (Ãmission #0157) ::.Louis-Philippe Pinsonneault: Le "App store" de Windows 8Nous discutons avec Louis-Philippe Pinsonneault du "App store" de Windows 8. Nous essaieront de couvrir tout ce quâil y a Ã savoir sur le "App store" : les types de licences, les modÃ¨les de reven.

.NET jobs

Web Systems Developer

Red Gate Software in Cambridge, United Kingdom
45,000
Web Application Developer

Red Gate Software in Cambridge, United Kingdom
£35,000-45,000 GBP per year
Senior Software Engineer

@ One Limited in London, United Kingdom
Jr. .NET Developer

T-Symmetry in Lakewood, United States

Managed hosting by Everycity

Retrieving HTTP content in .NET

New HTTP tools in .NET

Listing 1: Simple retrieval of Web data over HTTP.

Streaming good deals

Streams and StreamReader in .NET

String and Character Encoding in .NET

POSTing data

Listing 2: POSTing data to the Web Server

You might also like...

Comments

.NET tutorials

.NET books

Expert WCF 4: SOA 2.0 with Windows Communication Foundation 4

.NET forum discussion

edmonton female escort services near me

by canadapleasure (0 replies)

Bagaimana memenangkan $ 1,54 miliar dalam Mega Jutaan

by gametogelan (0 replies)

Software development company GroupBWT

by alexthunders01 (0 replies)

The requested URL was not found on this server

by haulexgem (0 replies)

The requested URL was not found on this server

by haulexgem (0 replies)

.NET podcasts

Visual Studio Talk Show (en français): Louis-Philippe Pinsonneault

Published 7 years ago, running time 1h12m

.NET jobs

Web Systems Developer

Red Gate Software in Cambridge, United Kingdom
45,000

Web Application Developer

Red Gate Software in Cambridge, United Kingdom
£35,000-45,000 GBP per year

Senior Software Engineer

@ One Limited in London, United Kingdom

Jr. .NET Developer

T-Symmetry in Lakewood, United States

Contribute

Web Development

Developer Jobs

Our tools

Retrieving HTTP content in .NET

New HTTP tools in .NET

Listing 1: Simple retrieval of Web data over HTTP.

Streaming good deals

Streams and StreamReader in .NET

String and Character Encoding in .NET

POSTing data

Listing 2: POSTing data to the Web Server

You might also like...

Comments

by canadapleasure (0 replies)

by gametogelan (0 replies)

by alexthunders01 (0 replies)

by haulexgem (0 replies)

by haulexgem (0 replies)

Visual Studio Talk Show (en français): Louis-Philippe Pinsonneault

Published 7 years ago, running time 1h12m

Red Gate Software in Cambridge, United Kingdom 45,000

Red Gate Software in Cambridge, United Kingdom £35,000-45,000 GBP per year

@ One Limited in London, United Kingdom

T-Symmetry in Lakewood, United States

Contribute

Web Development

Developer Jobs

Our tools

Red Gate Software in Cambridge, United Kingdom
45,000

Red Gate Software in Cambridge, United Kingdom
£35,000-45,000 GBP per year