SQL Injection Attacks by Example

17 Jan 2005 | by Stephen J. Friedl | Filed in

Comments
PDF

Mitigations

We believe that web application developers often simply do not think about "surprise inputs", but security people do (including the bad guys), so there are three broad approaches that can be applied here.

Sanitize the input

It's absolutely vital to sanitize user inputs to insure that they do not contain dangerous codes, whether to the SQL server or to HTML itself. One's first idea is to strip out "bad stuff", such as quotes or semicolons or escapes, but this is a misguided attempt. Though it's easy to point out some dangerous characters, it's harder to point to all of them. The language of the web is full of special characters and strange markup (including alternate ways of representing the same characters), and efforts to authoritatively identify all "bad stuff" are unlikely to be successful.

Instead, rather than "remove known bad data", it's better to "remove everything but known good data": this distinction is crucial. Since - in our example - an email address can contain only these characters:

abcdefghijklmnopqrstuvwxyz

  ABCDEFGHIJKLMNOPQRSTUVWXYZ

  0123456789

  @.-_+

There is really no benefit in allowing characters that could not be valid, and rejecting them early - presumably with an error message - not only helps forestall SQL Injection, but also catches mere typos early rather than stores them into the database.

Sidebar on email addresses

It's important to note here that email addresses in particular are troublesome to validate programmatically, because everybody seems to have his own idea about what makes one "valid", and it's a shame to exclude a good email address because it contains a character you didn't think about.

The only real authority is RFC2822 (which encompasses the more familiar RFC822), and it includes a fairly expansive definition of what's allowed. The truly pedantic may well wish to accept email addresses with ampersands and asterisks (among other things) as valid, but others - including this author - are satisfied with a reasonable subset that includes "most" email addresses.

Those taking a more restrictive approach ought to be fully aware of the consequences of excluding these addresses, especially considering that better techniques (prepare/execute, stored procedures) obviate the security concerns which those "odd" characters present.

Be aware that "sanitizing the input" doesn't mean merely "remove the quotes", because even "regular" characters can be troublesome. In an example where an integer ID value is being compared against the user input (say, a numeric PIN):


SELECT fieldlist
  FROM table
 WHERE id = 23 OR 1=1;  -- Boom! Always matches!

In practice, however, this approach is highly limited because there are so few fields for which it's possible to outright exclude many of the dangerous characters. For "dates" or "email addresses" or "integers" it may have merit, but for any kind of real application, one simply cannot avoid the other mitigations.

Escape/Quotesafe the input

Even if one might be able to sanitize a phone number or email address, one cannot take this approach with a "name" field lest one wishes to exclude the likes of Bill O'Reilly from one's application: a quote is simply a valid character for this field.

One includes an actual single quote in an SQL string by putting two of them together, so this suggests the obvious - but wrong! - technique of preprocessing every string to replicate the single quotes:

SELECT fieldlist
  FROM customers
 WHERE name = 'Bill O''Reilly';  -- works OK

However, this naïve approach can be beaten because most databases support other string escape mechanisms. MySQL, for instance, also permits \' to escape a quote, so after input of \'; DROP TABLE users; -- is "protected" by doubling the quotes, we get:

    SELECT fieldlist
      FROM customers
    WHERE name = '\''; DROP TABLE users; --';  -- Boom!

The expression '\'' is a complete string (containing just one single quote), and the usual SQL shenanigans follow. It doesn't stop with backslashes either: there is Unicode, other encodings, and parsing oddities all hiding in the weeds to trip up the application designer.

Getting quotes right is notoriously difficult, which is why many database interface languages provide a function that does it for you. When the same internal code is used for "string quoting" and "string parsing", it's much more likely that the process will be done properly and safely.

Some examples are the MySQL function mysql_real_escape_string(), perl DBD method $dbh->quote($value).

These methods must be used .

Use bound parameters (the PREPARE statement)

Though quotesafing is a good mechanism, we're still in the area of "considering user input as SQL", and a much better approach exists: bound parameters, which are supported by essentially all database programming interfaces. In this technique, an SQL statement string is created with placeholders - a question mark for each parameter - and it's compiled ("prepared", in SQL parlance) into an internal form.

Later, this prepared query is "executed" with a list of parameters. Here's an example in ASP.NET:

Dim thisCommand As SQLCommand = New SQLCommand("SELECT email,userid " & _

"FROM members WHERE email = @email", Connection)

  thisCommand.Parameters.Add ("@email", SqlDbType.VarChar).Value = Request.QueryString("email")

  SqlDataReader dr = thisCommand.ExecuteReader()

Here, $email is the data obtained from the user's form, and it is passed as positional parameter #1 (the first question mark), and at no point do the contents of this variable have anything to do with SQL statement parsing. Quotes, semicolons, backslashes, SQL comment notation - none of this has any impact, because it's "just data". There simply is nothing to subvert, so the application is be largely immune to SQL injection attacks.

There also may be some performance benefits if this prepared query is reused multiple times (it only has to be parsed once), but this is minor compared to the enormous security benefits. This is probably the single most important step one can take to secure a web application.

Limit database permissions and segregate users

In the case at hand, we observed just two interactions that are made not in the context of a logged-in user: "log in" and "send me password". The web application ought to use a database connection with the most limited rights possible: query-only access to the members table, and no access to any other table.

The effect here is that even a "successful" SQL injection attack is going to have much more limited success. Here, we'd not have been able to do the UPDATE request that ultimately granted us access, so we'd have had to resort to other avenues.

Once the web application determined that a set of valid credentials had been passed via the login form, it would then switch that session to a database connection with more rights.

It should go almost without saying that sa rights should never be used for any web-based application.

Use stored procedures for database access

When the database server supports them, use stored procedures for performing access on the application's behalf, which can eliminate SQL entirely (assuming the stored procedures themselves are written properly).

By encapsulating the rules for a certain action - query, update, delete, etc. - into a single procedure, it can be tested and documented on a standalone basis and business rules enforced (for instance, the "add new order" procedure might reject that order if the customer were over his credit limit).

For simple queries this might be only a minor benefit, but as the operations become more complicated (or are used in more than one place), having a single definition for the operation means it's going to be more robust and easier to maintain.

Note: it's always possible to write a stored procedure that itself constructs a query dynamically: this provides no protection against SQL Injection - it's only proper binding with prepare/execute or direct SQL statements with bound variables that provide this protection.

Isolate the webserver

Even having taken all these mitigation steps, it's nevertheless still possible to miss something and leave the server open to compromise. One ought to design the network infrastructure to assume that the bad guy will have full administrator access to the machine, and then attempt to limit how that can be leveraged to compromise other things.

For instance, putting the machine in a DMZ with extremely limited pinholes "inside" the network means that even getting complete control of the webserver doesn't automatically grant full access to everything else. This won't stop everything, of course, but it makes it a lot harder.

Configure error reporting

The default error reporting for some frameworks includes developer debugging information, and this cannot be shown to outside users. Imagine how much easier a time it makes for an attacker if the full query is shown, pointing to the syntax error involved.

This information is useful to developers, but it should be restricted - if possible - to just internal users.

Note that not all databases are configured the same way, and not all even support the same dialect of SQL (the "S" stands for "Structured", not "Standard"). For instance, most versions of MySQL do not support subselects, nor do they usually allow multiple statements: these are substantially complicating factors when attempting to penetrate a network.

Comments

About the author

Stephen J. Friedl

UNIX Wizard and Microsoft MVP

www.unixwiz.net

Interested in writing for us? Find out more.

SQL tutorials

SQL books

PHP & MySQL: The Missing Manual

If you can build websites with CSS and JavaScript, this book takes you to the next level—creating dynamic, database-driven websites with PHP and MySQL. Learn how to build a database, manage your content, and interact with users through queries and we...

SQL forum discussion

Best security system for laptop

by maha.k47825 (0 replies)
problem with select within select

by hamidreza.hr923 (2 replies)
Problem viewing IIS website locally

by ep1 (0 replies)
Flask or Django?

by radavid24 (0 replies)
Employee monitoring software

by KevinStevenson (0 replies)

SQL podcasts

IBM developerWorks: TWOdW: Some of the best gifts come in the form of knowledge

Published 8 years ago, running time 0h4m

This week on developerWorks, Millard Ellingsworth explains how to make Scrum teams productive via "backlog grooming" with Rational Team Concert, Vikram Vaswani demonstrates how to create REST apps with the Slim PHP micro-framework, and more.

SQL jobs

PHP Developer

The-Escape in Basingstoke, United Kingdom
PHP Developer

Ideotec in Grand Rapids, United States
Full Benefits
OpenEMI Hacker / Intern

EMI Music in London, United Kingdom
Mobile App Developer

WCNC-TV in Charlotte, United States

SQL Injection Attacks by Example

Mitigations

Sidebar on email addresses

You might also like...

Comments

About the author

Stephen J. Friedl

SQL tutorials

SQL books

PHP & MySQL: The Missing Manual

SQL forum discussion

Best security system for laptop

by maha.k47825 (0 replies)

problem with select within select

by hamidreza.hr923 (2 replies)

Problem viewing IIS website locally

by ep1 (0 replies)

Flask or Django?

by radavid24 (0 replies)

Employee monitoring software

by KevinStevenson (0 replies)

SQL podcasts

IBM developerWorks: TWOdW: Some of the best gifts come in the form of knowledge

Published 8 years ago, running time 0h4m

SQL jobs

PHP Developer

The-Escape in Basingstoke, United Kingdom

PHP Developer

Ideotec in Grand Rapids, United States
Full Benefits

OpenEMI Hacker / Intern

EMI Music in London, United Kingdom

Mobile App Developer

WCNC-TV in Charlotte, United States

Contribute

Web Development

Developer Jobs

Our tools

SQL Injection Attacks by Example

Mitigations

Sidebar on email addresses

You might also like...

Comments

About the author

by maha.k47825 (0 replies)

by hamidreza.hr923 (2 replies)

by ep1 (0 replies)

by radavid24 (0 replies)

by KevinStevenson (0 replies)

IBM developerWorks: TWOdW: Some of the best gifts come in the form of knowledge

Published 8 years ago, running time 0h4m

The-Escape in Basingstoke, United Kingdom

Ideotec in Grand Rapids, United States Full Benefits

EMI Music in London, United Kingdom

WCNC-TV in Charlotte, United States

Contribute

Web Development

Developer Jobs

Our tools

Ideotec in Grand Rapids, United States
Full Benefits