Postgres Tips and Tricks

Содержание

1. Postgres Tips and Tricks
2. Why does my query run slowSELECT max(l1.upload_timestamp)
3. Why does my query run slowSELECT max(l1.upload_timestamp)
4. Why does my query run slowSELECT ucase(final_lab_upload_log.lab)
5. How much will this really benefit me?The
6. How to generate this tableThis table is
7. Reduce CodeDon’t write the same query in
8. How much is the savings worth
9. Where did my NULL’s go?SELECT * FROM
10. The normal way to get current job
11. Fast way to get current job SELECT
12. Query Plan DifferenceOriginal QueryDISTINCT ON QueryQuery plan
13. How to compare two queriesThere is a
14. How to compare two queriesThere is a
15. How to compare two queriesNormally you will
16. How to compare two queriesIf you want
17. Finding the slow line in your queryEXPLAIN
18. Number of months between datesdate_trunc(‘month’, date) --
19. Creating a Table from a ViewCREATE TABLE
20. Update a Table from a ViewBEGIN;
21. What order is my data in?When you
22. What order is my data in?UPDATE test
23. ORDER BY fieldNormally you use ORDER BY
24. TRUNCATE vers DELETETRUNCATE is normally the best
25. pg_stat_activity 9.1-SELECT * FROM pg_stat_activity;This will show
26. POSTGRES 9.2+Postgres Tips & Tricks
27. pg_stat_activity 9.2+SELECT * FROM pg_stat_activity;This will show
28. Killing your own backend’sSELECT pg_cancel_backend(pid);SELECT pg_terminate_backend(pid);This cancels
29. POSTGRES 9.3+Postgres Tips & Tricks
30. Materialized ViewsCREATE MATERIALIZED VIEW schema.materialized_view ASSELECT *
31. Foreign Data TablesCREATE EXTENSION postgres_fdw; CREATE SERVER db_main
32. Скачать презентанцию

Why does my query run slowSELECT max(l1.upload_timestamp) AS LastUploadTimestamp, lcase(l1.network) as network, l1.filename, count(l1.file_id) AS NumberUploads FROM lab_upload_log AS l1 GROUP BY lcase(l1.network), l1.filenameDISTINCT and GROUP BY give

Слайды и текст этой презентации

Слайд 1Postgres Tips and Tricks
By Lloyd Albin
5/1/2013, 6/11/2013

Слайд 2Why does my query run slow
SELECT max(l1.upload_timestamp) AS LastUploadTimestamp, lcase(l1.network)

as network, l1.filename, count(l1.file_id) AS NumberUploads FROM lab_upload_log AS l1 GROUP

BY lcase(l1.network), l1.filename

DISTINCT and GROUP BY give the same results, but internally DISTINCT is faster.

SELECT DISTINCT max(l1.upload_timestamp) AS LastUploadTimestamp, lcase(l1.network) as network, l1.filename, count(l1.file_id) AS NumberUploads FROM lab_upload_log AS l1 GROUP BY lcase(l1.network), l1.filename

Why does my query run slowSELECT max(l1.upload_timestamp) AS LastUploadTimestamp, lcase(l1.network) as network, l1.filename, count(l1.file_id) AS

Слайд 3Why does my query run slow
SELECT max(l1.upload_timestamp) AS LastUploadTimestamp, lcase(l1.network)

as network, l1.filename, count(l1.file_id) AS NumberUploads FROM lab_upload_log AS l1 GROUP

BY lcase(l1.network), l1.filename

CREATE INDEX lab_upload_log_idx ON lab_data_ops.lab_upload_log
USING btree (network, filename);

Create a functional index for the field

CREATE INDEX lab_upload_log_idx2 ON lab_data_ops.lab_upload_log
USING btree (lower(network), filename);

Create an index for the field if you are going to use the field in a WHERE clause or a GROUP BY clause.

Слайд 4Why does my query run slow
SELECT ucase(final_lab_upload_log.lab) AS lab, count(final_lab_upload_log.filename) AS

"# Unique Files", max(x.NumUploads) AS "Total # Uploads“ FROM final_lab_upload_log LEFT JOIN (
SELECT

count(lab_upload_log.filename) AS NumUploads, ucase(lab_upload_log.lab) AS lab FROM lab_upload_log WHERE lcase(lab_upload_log.network)='vtn‘ AND lab_upload_log.upload_timestamp > (curdate() - 30) GROUP BY ucase(lab_upload_log.lab)) AS x ON x.lab = final_lab_upload_log.lab WHERE final_lab_upload_log.LastUploadTimestamp > (curdate() - 30) GROUP BY ucase(final_lab_upload_log.lab)

Remove doubling of the ucase. The doubling stops the index from being used.

SELECT ucase(final_lab_upload_log.lab) AS lab, …
GROUP BY ucase(final_lab_upload_log.lab)

-- 833 rows returned (execution time: 32 ms; total time: 93 ms) -- 14,719 rows in finance_feeds.ps_job

DISTINCT ON with ORDER BY and GROUP BY with MIN or MAX can give the same results, but internally DISTINCT ON is faster and is easier to read the code.

The normal way to get current job SELECT d.* FROM ( SELECT b.emplid,

Слайд 11Fast way to get current job
SELECT * FROM
(

SELECT DISTINCT ON (emplid, empl_rcd) *
FROM finance_feeds.ps_job
ORDER BY

emplid, empl_rcd, effdt DESC, effseq DESC
) a
WHERE a.empl_status::text <> 'T' ::text;

-- 833 rows returned (execution time: 31 ms; total time: 31 ms)
-- 14,719 rows in finance_feeds.ps_job

Much simpler code using DISTINCT ON with ORDER BY instead of GROUP BY with MAX.

Fast way to get current job SELECT * FROM ( SELECT DISTINCT ON (emplid, empl_rcd) * FROM

Слайд 12Query Plan Difference
Original Query
DISTINCT ON Query
Query plan is almost ½

the speed and only scans the table once instead of

three times.

LabKey does not support DISTINCT ON at this time.

https://www.labkey.org/wiki/home/Documentation/page.view?name=labkeySql

Query Plan DifferenceOriginal QueryDISTINCT ON QueryQuery plan is almost ½ the speed and only scans the table

Слайд 13How to compare two queries
There is a great command called

EXCEPT. This will compare the results of two queries and

tell you what is in the first query that is not in the second query.

SELECT * FROM view_a
EXCEPT
SELECT * FROM view_b

SELECT * FROM (SELECT * FROM view_a)) a
EXCEPT
SELECT * FROM (SELECT * FROM view_b)) b

This will show you all lines in view_a that are not in view_b. To find out all lines on view_b that are not in view_a, reverse the two queries. If you are comparing two complex query statements, wrap them in a simple SELECT statement so that the EXCEPT will not get confused.

How to compare two queriesThere is a great command called EXCEPT. This will compare the results of

Слайд 14How to compare two queries
There is a great command called

EXCEPT. This will compare the results of two queries and

tell you what is in the first query that is not in the second query.

SELECT * FROM (
SELECT 1
UNION
SELECT 2
) a

EXCEPT

SELECT * FROM (
SELECT 1
UNION
SELECT 3
) b

Слайд 15How to compare two queries
Normally you will also want to

reverse the two queries so that you can check the

results going the other direction. This way you have two sets of results, what is extra in each query.

SELECT * FROM (
SELECT 1
UNION
SELECT 3
) b

EXCEPT

SELECT * FROM (
SELECT 1
UNION
SELECT 2
) a

How to compare two queriesNormally you will also want to reverse the two queries so that you

Слайд 16How to compare two queries
If you want only the records

that match, then you want to use INTERSECT.

SELECT * FROM

(
SELECT 1
UNION
SELECT 3
) b

INTERSECT

SELECT * FROM (
SELECT 1
UNION
SELECT 2
) a

How to compare two queriesIf you want only the records that match, then you want to use

Слайд 17Finding the slow line in your query
EXPLAIN will show you

the query plan, and this by itself is helpful, but

even more helpful is the EXPLAIN (ANALYZE, BUFFERS) which compares the query plan to what actually happened when the query was run. Also use http://explain.depesz.com/

Nested Loop Left Join (cost=305819.49..449850.69 rows=1 width=572) (actual time=1871.328..9512784.289 rows=10983 loops=1)
Filter: ((NOT (hashed SubPlan 42)) AND ((SubPlan 38) IS NOT NULL))
Rows Removed by Filter: 74
Buffers: shared hit=2412568803 read=99

Estimate rows=1 vers Actual rows=10983. When you have big difference between these numbers, this is a sign of a problem. This can be caused by not having enough statistics, not having an index, etc.
Actual time=1871.328..9912784.289. This means that this row started 1.87 seconds into the query and took the difference of the two times, 2.64 hours, to complete

Finding the slow line in your queryEXPLAIN will show you the query plan, and this by itself

Слайд 18Number of months between dates
date_trunc(‘month’, date) -- First day of

the month
date + ‘1 day’::interval -- Converts last day of

the month to the first day of next month
age(date, date) -- The difference between two timestamps as an interval
date_part(‘year’, date/interval) -- returns only the year portion of the date/interval

SELECT
(date_part('year', age(max(b.earnenddate)::timestamp + interval '1 day',
date_trunc('month',min(b.earnenddate)::date)::timestamp))*12 +
date_part('month', age(max(b.earnenddate)::timestamp + interval '1 day',
date_trunc('month',min(b.earnenddate)::date)::timestamp)))::integer
AS elapsed_months
FROM (
SELECT '06/30/2012' AS earnenddate
UNION
SELECT '12/31/2013' AS earnenddate
) b;

Number of months between datesdate_trunc(‘month’, date) -- First day of the monthdate + ‘1 day’::interval -- Converts

Слайд 19Creating a Table from a View
CREATE TABLE schema.table AS

SELECT * FROM schema.view;

SELECT * INTO schema.table
FROM

schema.view;

The PostgreSQL usage of SELECT INTO to represent table creation is historical. It is best to use CREATE TABLE AS for this purpose in new code.

This allows you to create a table without having to look up all the field names and types to first generate a table and then fill it with the results of the view.

Creating a Table from a ViewCREATE TABLE schema.table AS SELECT * FROM schema.view;SELECT * INTO schema.table

Слайд 20Update a Table from a View
BEGIN;

TRUNCATE schema.table;

INSERT INTO schema.table SELECT * FROM schema.view;

COMMIT;
This allows you to

take the results of a view and append them to an existing table.

You may wish to TRUNCATE schema.table before adding the new data.

Update a Table from a ViewBEGIN; TRUNCATE schema.table; INSERT INTO schema.table SELECT * FROM schema.view;COMMIT;This

Слайд 21What order is my data in?
When you don’t use an

ORDER BY clause, your data is in physical order of

insert and update.

CREATE TEMP TABLE test (
key SERIAL,
test BOOLEAN
);

INSERT INTO test VALUES (1,FALSE);
INSERT INTO test VALUES (2,FALSE);
INSERT INTO test VALUES (3,FALSE);

SELECT * FROM test;

What order is my data in?When you don’t use an ORDER BY clause, your data is in

Слайд 22What order is my data in?
UPDATE test SET test =

TRUE WHERE key = 2;

SELECT * FROM test;

What order is my data in?UPDATE test SET test = TRUE WHERE key = 2;SELECT * FROM

Слайд 23ORDER BY field
Normally you use ORDER BY field_name, but you

can also use ORDER BY field_number. I have found this

to sometimes be useful when unioning one or more sets of data together.

SELECT 'test5' AS test1
UNION ALL
SELECT 'test2' AS test2
ORDER BY 1

Notes:
ORDER BY test1 will work. ORDER BY test2 will not work. UNION ALL gives you all rows from each SELECT and runs much faster. UNION only gives you DISTINCT rows between the two tables and ordered

ORDER BY fieldNormally you use ORDER BY field_name, but you can also use ORDER BY field_number. I

Слайд 24TRUNCATE vers DELETE
TRUNCATE is normally the best way to go

because it removes all the data within the table(s) quickly

and by specifying more than one table, deals automatically with foreign key dependencies. Delete can take a long time depending on the foreign key dependencies, etc.

TRUNCATE is not MVCC-save, so after truncation, the table will appear empty to all concurrent transactions, even if they are using a snapshot taken before the truncation occurred. DELETE does not have this issue.

RESTART IDENTITY
Automatically restart sequences owned by columns of the truncated table(s).

TRUNCATE vers DELETETRUNCATE is normally the best way to go because it removes all the data within

Слайд 25pg_stat_activity 9.1-
SELECT * FROM pg_stat_activity;

This will show you what queries

that you currently have running on a server. As user

Postgres, you will see all queries running on a server. If there is no query running, you will see .

pg_stat_activity 9.1-SELECT * FROM pg_stat_activity;This will show you what queries that you currently have running on a

Слайд 26POSTGRES 9.2+
Postgres Tips & Tricks

Слайд 27pg_stat_activity 9.2+
SELECT * FROM pg_stat_activity;

This will show you what queries

that you currently have running on a server. As user

Postgres, you will see all queries running on a server. They will also say or . The connections show you the last query executed.

pg_stat_activity 9.2+SELECT * FROM pg_stat_activity;This will show you what queries that you currently have running on a

Слайд 28Killing your own backend’s
SELECT pg_cancel_backend(pid);

SELECT pg_terminate_backend(pid);

This cancels your current command

and closes your connection. If you are in the middle

of a transaction, the transaction will be aborted instantly.

This cancels your current command and leaves your connection open for your next command. If you are in the middle of a transaction, the transaction will be aborted once you try and COMMIT your transaction. It will also complain about every line failing until you try and COMMIT.

If all the backends you see are gone and you are still getting the open connections when trying to drop your database, contact a dba.

Killing your own backend’sSELECT pg_cancel_backend(pid);SELECT pg_terminate_backend(pid);This cancels your current command and closes your connection. If you are

Слайд 29POSTGRES 9.3+
Postgres Tips & Tricks

Слайд 30Materialized Views
CREATE MATERIALIZED VIEW schema.materialized_view AS
SELECT * FROM schema.table;

SELECT *

FROM schema.materialized_view;

REFRESH MATERIALIZED VIEW schema.materialized_view;
This is basically a simple melding

of a TABLE and VIEW into a single entity. When you create the MATERIALIZED VIEW is populates the underlying TABLE. Every time you use the MATERIALIZED VIEW it returns you the data in the TABLE. To update the data in the TABLE, you need to run the REFRESH MATERIALIZED VIEW command.

Materialized ViewsCREATE MATERIALIZED VIEW schema.materialized_view ASSELECT * FROM schema.table;SELECT * FROM schema.materialized_view;REFRESH MATERIALIZED VIEW schema.materialized_view;This is basically

Слайд 31Foreign Data Tables
CREATE EXTENSION postgres_fdw;

CREATE SERVER db_main FOREIGN DATA WRAPPER

postgres_fdw
OPTIONS (host 'db.scharp.org', dbname 'main', port '5432');

CREATE USER

MAPPING FOR postgres SERVER db_main
OPTIONS (user 'webservices', password 'password');

CREATE FOREIGN TABLE ist.webservices_token (
"time" TIMESTAMP WITHOUT TIME ZONE DEFAULT now() NOT NULL,
token TEXT NOT NULL
)
SERVER db_main;

SELECT * FROM ist.webservices_token;

The Postgres DBA’s should take care of the EXTENSION, SERVER and USER MAPPING’s. The developer can then create the FOREIGN TABLES. For each user that wants to use the FOREIGN TABLE, there must be a USER MAPPING created by a DBA.

Foreign Data TablesCREATE EXTENSION postgres_fdw; CREATE SERVER db_main FOREIGN DATA WRAPPER postgres_fdw OPTIONS (host 'db.scharp.org', dbname 'main',

Скачать презентацию

Разделы презентаций

Postgres Tips and Tricks

Содержание

Слайды и текст этой презентации

Слайд 1Postgres Tips and TricksBy Lloyd Albin5/1/2013, 6/11/2013

Слайд 2Why does my query run slowSELECT max(l1.upload_timestamp) AS LastUploadTimestamp, lcase(l1.network)

as network, l1.filename, count(l1.file_id) AS NumberUploads FROM lab_upload_log AS l1 GROUP

Слайд 3Why does my query run slowSELECT max(l1.upload_timestamp) AS LastUploadTimestamp, lcase(l1.network)

as network, l1.filename, count(l1.file_id) AS NumberUploads FROM lab_upload_log AS l1 GROUP

Слайд 4Why does my query run slowSELECT ucase(final_lab_upload_log.lab) AS lab, count(final_lab_upload_log.filename) AS

"# Unique Files", max(x.NumUploads) AS "Total # Uploads“ FROM final_lab_upload_log LEFT JOIN ( SELECT

Слайд 11Fast way to get current job SELECT * FROM (

SELECT DISTINCT ON (emplid, empl_rcd) * FROM finance_feeds.ps_job ORDER BY

Слайд 12Query Plan DifferenceOriginal QueryDISTINCT ON QueryQuery plan is almost ½

the speed and only scans the table once instead of

Слайд 13How to compare two queriesThere is a great command called

EXCEPT. This will compare the results of two queries and

Слайд 14How to compare two queriesThere is a great command called

EXCEPT. This will compare the results of two queries and

Слайд 15How to compare two queriesNormally you will also want to

reverse the two queries so that you can check the

Слайд 16How to compare two queriesIf you want only the records

that match, then you want to use INTERSECT.SELECT * FROM

Слайд 17Finding the slow line in your queryEXPLAIN will show you

the query plan, and this by itself is helpful, but

Слайд 18Number of months between datesdate_trunc(‘month’, date) -- First day of

the monthdate + ‘1 day’::interval -- Converts last day of

Слайд 19Creating a Table from a ViewCREATE TABLE schema.table AS

SELECT * FROM schema.view;SELECT * INTO schema.table FROM

Слайд 20Update a Table from a ViewBEGIN; TRUNCATE schema.table;

INSERT INTO schema.table SELECT * FROM schema.view;COMMIT;This allows you to

Слайд 21What order is my data in?When you don’t use an

ORDER BY clause, your data is in physical order of

Слайд 22What order is my data in?UPDATE test SET test =

TRUE WHERE key = 2;SELECT * FROM test;

Слайд 23ORDER BY fieldNormally you use ORDER BY field_name, but you

can also use ORDER BY field_number. I have found this

Слайд 24TRUNCATE vers DELETETRUNCATE is normally the best way to go

because it removes all the data within the table(s) quickly

Слайд 25pg_stat_activity 9.1-SELECT * FROM pg_stat_activity;This will show you what queries

that you currently have running on a server. As user

Слайд 26POSTGRES 9.2+Postgres Tips & Tricks

Слайд 27pg_stat_activity 9.2+SELECT * FROM pg_stat_activity;This will show you what queries

that you currently have running on a server. As user

Слайд 28Killing your own backend’sSELECT pg_cancel_backend(pid);SELECT pg_terminate_backend(pid);This cancels your current command

and closes your connection. If you are in the middle

Слайд 29POSTGRES 9.3+Postgres Tips & Tricks

Слайд 30Materialized ViewsCREATE MATERIALIZED VIEW schema.materialized_view ASSELECT * FROM schema.table;SELECT *

FROM schema.materialized_view;REFRESH MATERIALIZED VIEW schema.materialized_view;This is basically a simple melding

Слайд 31Foreign Data TablesCREATE EXTENSION postgres_fdw; CREATE SERVER db_main FOREIGN DATA WRAPPER

postgres_fdw OPTIONS (host 'db.scharp.org', dbname 'main', port '5432'); CREATE USER

Похожие презентации

Обратная связь

Что такое TheSlide.ru?

Слайд 1Postgres Tips and Tricks
By Lloyd Albin
5/1/2013, 6/11/2013

Слайд 2Why does my query run slow
SELECT max(l1.upload_timestamp) AS LastUploadTimestamp, lcase(l1.network)

Слайд 3Why does my query run slow
SELECT max(l1.upload_timestamp) AS LastUploadTimestamp, lcase(l1.network)

Слайд 4Why does my query run slow
SELECT ucase(final_lab_upload_log.lab) AS lab, count(final_lab_upload_log.filename) AS

"# Unique Files", max(x.NumUploads) AS "Total # Uploads“ FROM final_lab_upload_log LEFT JOIN (
SELECT

Слайд 11Fast way to get current job
SELECT * FROM
(

SELECT DISTINCT ON (emplid, empl_rcd) *
FROM finance_feeds.ps_job
ORDER BY

Слайд 12Query Plan Difference
Original Query
DISTINCT ON Query
Query plan is almost ½

Слайд 13How to compare two queries
There is a great command called

Слайд 14How to compare two queries
There is a great command called

Слайд 15How to compare two queries
Normally you will also want to

Слайд 16How to compare two queries
If you want only the records

that match, then you want to use INTERSECT.

SELECT * FROM

Слайд 17Finding the slow line in your query
EXPLAIN will show you

Слайд 18Number of months between dates
date_trunc(‘month’, date) -- First day of

the month
date + ‘1 day’::interval -- Converts last day of

Слайд 19Creating a Table from a View
CREATE TABLE schema.table AS

SELECT * FROM schema.view;

SELECT * INTO schema.table
FROM

Слайд 20Update a Table from a View
BEGIN;

TRUNCATE schema.table;

INSERT INTO schema.table SELECT * FROM schema.view;

COMMIT;
This allows you to

Слайд 21What order is my data in?
When you don’t use an

Слайд 22What order is my data in?
UPDATE test SET test =

TRUE WHERE key = 2;

SELECT * FROM test;

Слайд 23ORDER BY field
Normally you use ORDER BY field_name, but you

Слайд 24TRUNCATE vers DELETE
TRUNCATE is normally the best way to go

Слайд 25pg_stat_activity 9.1-
SELECT * FROM pg_stat_activity;

This will show you what queries

Слайд 26POSTGRES 9.2+
Postgres Tips & Tricks

Слайд 27pg_stat_activity 9.2+
SELECT * FROM pg_stat_activity;

This will show you what queries

Слайд 28Killing your own backend’s
SELECT pg_cancel_backend(pid);

SELECT pg_terminate_backend(pid);

This cancels your current command

Слайд 29POSTGRES 9.3+
Postgres Tips & Tricks

Слайд 30Materialized Views
CREATE MATERIALIZED VIEW schema.materialized_view AS
SELECT * FROM schema.table;

SELECT *

FROM schema.materialized_view;

REFRESH MATERIALIZED VIEW schema.materialized_view;
This is basically a simple melding

Слайд 31Foreign Data Tables
CREATE EXTENSION postgres_fdw;

CREATE SERVER db_main FOREIGN DATA WRAPPER

postgres_fdw
OPTIONS (host 'db.scharp.org', dbname 'main', port '5432');

CREATE USER