Performing raw SQL queries
Explore the ORM before using raw SQL!
The Django ORM provides many tools to express queries without writing rawSQL. For example:
- The QuerySet API is extensive.
- You can and aggregate using many built-in . Beyond those, you can createcustom query expressions.
Before using raw SQL, explore . Ask ondjango-users or the to see if the ORM supports your use case.
警告
You should be very careful whenever you write raw SQL. Every time you useit, you should properly escape any parameters that the user can controlby using params
in order to protect against SQL injection attacks.Please read more about SQL injection protection.
The raw()
manager method can be used to perform raw SQL queries thatreturn model instances:
Manager.
raw
(raw_query, params=None, translations=None)- This method takes a raw SQL query, executes it, and returns a
django.db.models.query.RawQuerySet
instance. ThisRawQuerySet
instancecan be iterated over just like a normalQuerySet
to provide object instances.
This is best illustrated with an example. Suppose you have the following model:
You could then execute custom SQL like so:
- >>> for p in Person.objects.raw('SELECT * FROM myapp_person'):
- ... print(p)
- John Smith
- Jane Jones
Of course, this example isn't very exciting — it's exactly the same asrunning Person.objects.all()
. However, raw()
has a bunch of otheroptions that make it very powerful.
Model table names
Where did the name of the Person
table come from in that example?
By default, Django figures out a database table name by joining themodel's "app label" — the name you used in manage.py startapp
— tothe model's class name, with an underscore between them. In the examplewe've assumed that the Person
model lives in an app named myapp
,so its table would be myapp_person
.
For more details check out the documentation for the option, which also lets you manually set thedatabase table name.
警告
No checking is done on the SQL statement that is passed in to .raw()
.Django expects that the statement will return a set of rows from thedatabase, but does nothing to enforce that. If the query does notreturn rows, a (possibly cryptic) error will result.
警告
If you are performing queries on MySQL, note that MySQL's silent type coercionmay cause unexpected results when mixing types. If you query on a stringtype column, but with an integer value, MySQL will coerce the types of all valuesin the table to an integer before performing the comparison. For example, if yourtable contains the values 'abc'
, 'def'
and you query for WHERE mycolumn=0
,both rows will match. To prevent this, perform the correct typecastingbefore using the value in a query.
raw()
automatically maps fields in the query to fields on the model.
The order of fields in your query doesn't matter. In other words, bothof the following queries work identically:
- >>> Person.objects.raw('SELECT id, first_name, last_name, birth_date FROM myapp_person')
- ...
- >>> Person.objects.raw('SELECT last_name, birth_date, first_name, id FROM myapp_person')
- ...
- >>> Person.objects.raw('''SELECT first AS first_name,
- ... last AS last_name,
- ... bd AS birth_date,
- ... pk AS id,
- ... FROM some_other_table''')
As long as the names match, the model instances will be created correctly.
Alternatively, you can map fields in the query to model fields using thetranslations
argument to raw()
. This is a dictionary mapping names offields in the query to names of fields on the model. For example, the abovequery could also be written:
- >>> name_map = {'first': 'first_name', 'last': 'last_name', 'bd': 'birth_date', 'pk': 'id'}
- >>> Person.objects.raw('SELECT * FROM some_other_table', translations=name_map)
Index lookups
raw()
supports indexing, so if you need only the first result you canwrite:
- >>> first_person = Person.objects.raw('SELECT * FROM myapp_person')[0]
However, the indexing and slicing are not performed at the database level. Ifyou have a large number of Person
objects in your database, it is moreefficient to limit the query at the SQL level:
- >>> first_person = Person.objects.raw('SELECT * FROM myapp_person LIMIT 1')[0]
Fields may also be left out:
The Person
objects returned by this query will be deferred model instances(see ). This means that thefields that are omitted from the query will be loaded on demand. For example:
- >>> for p in Person.objects.raw('SELECT id, first_name FROM myapp_person'):
- ... print(p.first_name, # This will be retrieved by the original query
- ...
- John Smith
- Jane Jones
From outward appearances, this looks like the query has retrieved boththe first name and last name. However, this example actually issued 3queries. Only the first names were retrieved by the raw() query — thelast names were both retrieved on demand when they were printed.
There is only one field that you can't leave out - the primary keyfield. Django uses the primary key to identify model instances, so itmust always be included in a raw query. An InvalidQuery
exceptionwill be raised if you forget to include the primary key.
Adding annotations
You can also execute queries containing fields that aren't defined on themodel. For example, we could use to get a listof people with their ages calculated by the database:
- >>> people = Person.objects.raw('SELECT *, age(birth_date) AS age FROM myapp_person')
- >>> for p in people:
- ... print("%s is %s." % (p.first_name, p.age))
- John is 37.
- Jane is 42.
- ...
You can often avoid using raw SQL to compute annotations by instead using aFunc() expression.
If you need to perform parameterized queries, you can use the params
argument to raw()
:
- >>> lname = 'Doe'
- >>> Person.objects.raw('SELECT * FROM myapp_person WHERE last_name = %s', [lname])
params
is a list or dictionary of parameters. You'll use %s
placeholders in the query string for a list, or %(key)s
placeholders for a dictionary (where key
is replaced by adictionary key, of course), regardless of your database engine. Suchplaceholders will be replaced with parameters from the params
argument.
注解
Dictionary params are not supported with the SQLite backend; withthis backend, you must pass parameters as a list.
警告
Do not use string formatting on raw queries or quote placeholders in yourSQL strings!
It's tempting to write the above query as:
- >>> query = 'SELECT * FROM myapp_person WHERE last_name = %s' % lname
- >>> Person.objects.raw(query)
You might also think you should write your query like this (with quotesaround %s
):
- >>> query = "SELECT * FROM myapp_person WHERE last_name = '%s'"
As discussed in , using the params
argument and leaving the placeholders unquoted protects you from SQLinjection attacks, a common exploit where attackers inject arbitrarySQL into your database. If you use string interpolation or quote theplaceholder, you're at risk for SQL injection.
Executing custom SQL directly
Sometimes even Manager.raw()
isn't quite enough: you might need toperform queries that don't map cleanly to models, or directly executeUPDATE
, INSERT
, or DELETE
queries.
In these cases, you can always access the database directly, routing aroundthe model layer entirely.
The object django.db.connection
represents the default databaseconnection. To use the database connection, call toget a cursor object. Then, call cursor.execute(sql, [params])
to executethe SQL and cursor.fetchone()
or cursor.fetchall()
to return theresulting rows.
例如:
- from django.db import connection
- def my_custom_sql(self):
- with connection.cursor() as cursor:
- cursor.execute("UPDATE bar SET foo = 1 WHERE baz = %s", [self.baz])
- cursor.execute("SELECT foo FROM bar WHERE baz = %s", [self.baz])
- row = cursor.fetchone()
- return row
To protect against SQL injection, you must not include quotes around the %s
placeholders in the SQL string.
Note that if you want to include literal percent signs in the query, you have todouble them in the case you are passing parameters:
If you are using , you canuse django.db.connections
to obtain the connection (and cursor) for aspecific database. django.db.connections
is a dictionary-likeobject that allows you to retrieve a specific connection using itsalias:
- from django.db import connections
- with connections['my_db_alias'].cursor() as cursor:
- # Your code here...
By default, the Python DB API will return results without their field names,which means you end up with a list
of values, rather than a dict
. At asmall performance and memory cost, you can return results as a dict
byusing something like this:
- def dictfetchall(cursor):
- "Return all rows from a cursor as a dict"
- columns = [col[0] for col in cursor.description]
- return [
- dict(zip(columns, row))
- for row in cursor.fetchall()
Another option is to use collections.namedtuple()
from the Pythonstandard library. A namedtuple
is a tuple-like object that has fieldsaccessible by attribute lookup; it's also indexable and iterable. Results areimmutable and accessible by field names or indices, which might be useful:
- from collections import namedtuple
- def namedtuplefetchall(cursor):
- "Return all rows from a cursor as a namedtuple"
- desc = cursor.description
- nt_result = namedtuple('Result', [col[0] for col in desc])
- return [nt_result(*row) for row in cursor.fetchall()]
Here is an example of the difference between the three:
- >>> cursor.execute("SELECT id, parent_id FROM test LIMIT 2");
- >>> cursor.fetchall()
- ((54360982, None), (54360880, None))
- >>> cursor.execute("SELECT id, parent_id FROM test LIMIT 2");
- >>> dictfetchall(cursor)
- [{'parent_id': None, 'id': 54360982}, {'parent_id': None, 'id': 54360880}]
- >>> cursor.execute("SELECT id, parent_id FROM test LIMIT 2");
- >>> results = namedtuplefetchall(cursor)
- >>> results
- [Result(id=54360982, parent_id=None), Result(id=54360880, parent_id=None)]
- >>> results[0].id
- 54360982
- >>> results[0][0]
- 54360982
Connections and cursors
connection
and cursor
mostly implement the standard Python DB-APIdescribed in PEP 249 — except when it comes to .
If you're not familiar with the Python DB-API, note that the SQL statement incursor.execute()
uses placeholders, "%s"
, rather than addingparameters directly within the SQL. If you use this technique, the underlyingdatabase library will automatically escape your parameters as necessary.
Also note that Django expects the "%s"
placeholder, not the "?"
placeholder, which is used by the SQLite Python bindings. This is for the sakeof consistency and sanity.
Using a cursor as a context manager:
- with connection.cursor() as c:
- c.execute(...)
相当于:
- c = connection.cursor()
- try:
- c.execute(...)
- finally:
- c.close()
Calling stored procedures
CursorWrapper.
callproc
(procname, params=None, kparams=None)- Calls a database stored procedure with the given name. A sequence(
params
) or dictionary (kparams
) of input parameters may beprovided. Most databases don't supportkparams
. Of Django's built-inbackends, only Oracle supports it.
For example, given this stored procedure in an Oracle database:
This will call it:
- cursor.callproc('test_procedure', [1, 'test'])