SlideShare a Scribd company logo
4
Most read
5
Most read
12
Most read
Getting Started with
Postgresql
For Charlotte SQL Saturday on 10/20/2018
By Ramu Pulipati
Sponsors
About Me
Cofounder / CTO at Botsplash
Software-as-a-service omnichannel
messaging platform to engage businesses
and customers.
botsplash.com
About Postgresql
• Free – Open Source RDMS alternative to Microsoft SQL Server,
MySQL, Oracle, Mongo DB or any RDBMS and NoSQL databases
• Battle tested for over 30 years with continuous improvements to
support modern applications
• Rich ANSI/ISO Complaint SQL, JSON and Multiple Languages support
• Built-in and extensions for high scalability, geolocation and time series
based applications
• Hosted Providers and Commercial Support
• Used for embedded database, web applications and data warehouses
History & What’s new Version Major Feature
V11
10/18/2018
Partition & Index Improvements
Query Parallelism Improvements
Just-in-time compilation
V10
10/05/2017
Declarative Table Partitions
Logical Replication
Improved Monitoring and Control
V9.6
09/29/2016
Parallel Query
Foreign Data Wrappers
Replication
V9.5
07/01/2016
JSONB Modifying Operators
Row level security
UPSERT statement
V9.4
12/18/2014
Introduce JSONB
Materialized View Improvements
Source: Postgresql Releases
Capabilities
• Full featured Database
• ACID Transactions
• Mature Server Side Programming
• Hot Standby and High Availability
• Online backups
• Point-in-time recovery
• Native Table Partitioning
• Spatial Functionality
• Full Text Search
• JSON and HStore support
Source: Introduction to Postgresql Slideshare
80%
Commercial
Databases
Postgresql
Capabilities (contd …)
• Replication and Read Replicas
• Built-in pub-sub queue
• Multiple schemas with security
• Row level security
• Native SSL Support
• Data Level Encryption
• Extensions
• Database sharding
• Timeseries data
• Multi-tenancy
• Extensible and Customizable
Licensing & Community
• BSD License open for Open
Source, Private and
Commercial use
• No vendor Lock-in
• Predictable releases
• Faster Bug fixes
• Active Community
Source: postgresql.org
Compare to RDBMS
• MySQL
• Has better write performance but
Postgres is ACID compliant and has
better features, reliability and
consistency.
• SQL Server
• Featureful with recently introduced
JSON support. Postgres has
superset of features and scalable
with extensions.
• Oracle
• Most comprehensive database but
very expensive.
• Mongo Db
• Postgresql outshines read/write for
unstructured with JSONB.
Source: NoSQL Performance Benchmark 02/27/2018
Getting started with postgresql
Features
• Affordability
• Technology
• Security
• Flexibility
• Stability
• Extensibility
• Reliability
• Predictability
• Community
• Auditability
Source: Introduction to Postgresql Slideshare
Getting Started
• Download and Install
• Windows packaged from EnterpriseDb
• Linux from apt or source
• Mac installations using homebrew.
• Production recommended on *nix
systems. Postgres 11 supports Windows
2012 R2 deployment.
• Cloud hosted databases from Compose,
Citus Data, AWS RDS, Goolge CloudSql,
Azure Databases are excellent choice.
• Be careful with Docker/Kubernetes
deployments.
Postgresql Client
• Instance accessed from
TCP/IP Port: 5432 default
• CLI/IDE Clients
• PSQL, PgAdmin 4, PGCLI,
TablePlus, Navicat
• Application Clients
• Python SqlAlchemy, NodeJs
Pg, C# Client, Golang pq
Sample Connection String: postgres://username:password@127.0.0.1:5432/northwind
Postgresql Architecture
• Multi-process
architecture
• Primary postmaster
• Per connection backend
process
• Background maintenance
processes
• Use pgBouncer to load
balance large number of
connections
Source: The Internals of Postgresql
Schema and SQL Support
• Basic and Complex data types
• Complex Queries
• Functions, Operators and Aggregate
functions
• Transactional Integrity
• Indexes
• Procedural Language
• Optimized for data storage
• Extend and create custom data types
• Error handling
Data Type Commonly Used Names
Number Smallint, integer, Bigint, decimal,
Money
Number
(auto)
Serial, BigSerial
Character Char, nchar, Text
Binary Bytea
Date Time Timestamp with timezone,
Timestamp without Timezone,
interval
Collections Array, JSON, JSONB
Misc Boolean, UUID, Geo Types, Network
Address Mac Address, Custom Types
Full Text
Search
TsVector, TsQuery
See details from tutorials point
Demo Sample Table and DDL functions
Table DDL
Sample Function
More SQL / NoSQL Support
• Case Sensitive Schema Names
• Case Sensitive Data Storage
• Type casting
• Date Time formatting
• Common Table Expressions
• Array Data Types
• Upserts
• Pivot Tables crosstab function
• Materialized Views
• HStore
• JSONB Structure
• Rich Functions Support
• EXPLAIN ANALYZE
• Pg_stat
Psql tricks and tips
Command Description
l List of all databases
c <db> Change database name
d Show all objects in database
d <item> Describe database item
x [auto] Show one record at a time
timing Turn on/off sql timing
e Edit in an editor (vi or preferred editor)
copy … Save results to CSV file
h Help with SQL
Ctrl + R Search previous SQL command
ir Reference SQL file
Source: pgdash.io
Concurrency - MVCC
• Postgres uses “Multi-version concurrency control”. More details from
postgres internals or interdb.
• All reads are from the snapshot of the database until committed.
• Advantage of this method is reading does not block writing and vice-
versa. Isolation at “read-committed” level
• This method supports table level and row level locking as well.
• Other databases that use this technique are Oracle, CouchDB, etc
Indexes
• Index Support
• Single Column
• Multi Column
• Unique
• Partial Indexes
• Expression Indexes
• Implicit Indexes (PK, UK)
• Index Types
• B-Tree: Balanced Trees used in most databases
• Generalized Inverted Index (GIN): Useful for full text search
• Generalized Search Tree (GIST): Useful for geometric and full text search
B-Tree
Disadvantages / Limits
• Database only solution
• Few available tools compared to
commercial. PgAdmin is catching
up but relatively new.
• CPU bound queries and table
inheritance based partitioning
before v10.
• Limited Talent pool
Source: https://www.youtube.com/watch?v=6p2TNPabt6M
Source: Postgres limits from sreenstepslive.com
Best Practices
• Performance Optimization
• Do not read from Database at all
• Use indexes efficiently
• Use pg_stats collector to monitor usage and performance
• Vacuum regularly to clean up storage
• Bench mark hardware and optimize pg parameters
• Always use SSL for connectivity in untrusted networks
References and Follow up
• Introduction to Postgresql slideshare
• The Internals of Postgresql
• Postgresql excercises
• Postgres Blogs: 2nd Quadrant, Percona and Citus Data, Citus - Craig
Kerstiens
Questions? Contact me at ramu@botsplash.com

More Related Content

PPTX
PPTX
PostgreSQL Database Slides
PDF
PostgreSQL Tutorial For Beginners | Edureka
PPTX
PostgreSQL- An Introduction
PDF
Postgresql tutorial
PDF
Mastering PostgreSQL Administration
 
PPTX
Git 101 for Beginners
PDF
2019.06.27 Intro to Ceph
PostgreSQL Database Slides
PostgreSQL Tutorial For Beginners | Edureka
PostgreSQL- An Introduction
Postgresql tutorial
Mastering PostgreSQL Administration
 
Git 101 for Beginners
2019.06.27 Intro to Ceph

What's hot (20)

PDF
Postgresql database administration volume 1
PDF
PostgreSQL Deep Internal
PDF
PostgreSQL Performance Tuning
PDF
Deep dive into PostgreSQL statistics.
PPT
Oracle Transparent Data Encryption (TDE) 12c
PPTX
Logical Replication in PostgreSQL
 
ODP
Introduction to PostgreSQL
PPTX
Introduction to PostgreSQL
PDF
Linux tuning to improve PostgreSQL performance
PDF
Optimizing MariaDB for maximum performance
PDF
[pgday.Seoul 2022] PostgreSQL구조 - 윤성재
PPTX
Mongodb basics and architecture
PDF
What is new in PostgreSQL 14?
PDF
PostgreSQL HA
ZIP
NoSQL databases
PPTX
MySQL8.0_performance_schema.pptx
PDF
Oracle Performance Tuning Fundamentals
PDF
Migration From Oracle to PostgreSQL
PPTX
Introduction to NoSQL Databases
PDF
Get to know PostgreSQL!
Postgresql database administration volume 1
PostgreSQL Deep Internal
PostgreSQL Performance Tuning
Deep dive into PostgreSQL statistics.
Oracle Transparent Data Encryption (TDE) 12c
Logical Replication in PostgreSQL
 
Introduction to PostgreSQL
Introduction to PostgreSQL
Linux tuning to improve PostgreSQL performance
Optimizing MariaDB for maximum performance
[pgday.Seoul 2022] PostgreSQL구조 - 윤성재
Mongodb basics and architecture
What is new in PostgreSQL 14?
PostgreSQL HA
NoSQL databases
MySQL8.0_performance_schema.pptx
Oracle Performance Tuning Fundamentals
Migration From Oracle to PostgreSQL
Introduction to NoSQL Databases
Get to know PostgreSQL!
Ad

Similar to Getting started with postgresql (20)

PPTX
Postgres for Digital Transformation: NoSQL Features, Replication, FDW & More
PDF
Agile Oracle to PostgreSQL migrations (PGConf.EU 2013)
PPTX
Brk3288 sql server v.next with support on linux, windows and containers was...
PDF
SQL Server 2019 CTP2.4
PDF
PostgreSQL High Availability in a Containerized World
PDF
MongoDB 4.0 새로운 기능 소개
PDF
Hpc lunch and learn
PDF
Optymalizacja środowiska Open Source w celu zwiększenia oszczędności i kontroli
 
PPTX
Nordic infrastructure Conference 2017 - SQL Server on Linux Overview
PPTX
SQL Server 2017 on Linux Introduction
PDF
Big data conference europe real-time streaming in any and all clouds, hybri...
PDF
Save money with Postgres on IBM PowerLinux
 
PPTX
Azure Cosmos DB: Features, Practical Use and Optimization "
PPTX
EDB Database Servers and Tools
PDF
Introduction to Postrges-XC
PPTX
Summer 2017 undergraduate research powerpoint
PPT
Postgres for the Future
 
PPTX
Experience sql server on l inux and docker
PPTX
Doing More with Postgres - Yesterday's Vision Becomes Today's Reality
 
PDF
"Clouds on the Horizon Get Ready for Drizzle" by David Axmark @ eLiberatica 2009
Postgres for Digital Transformation: NoSQL Features, Replication, FDW & More
Agile Oracle to PostgreSQL migrations (PGConf.EU 2013)
Brk3288 sql server v.next with support on linux, windows and containers was...
SQL Server 2019 CTP2.4
PostgreSQL High Availability in a Containerized World
MongoDB 4.0 새로운 기능 소개
Hpc lunch and learn
Optymalizacja środowiska Open Source w celu zwiększenia oszczędności i kontroli
 
Nordic infrastructure Conference 2017 - SQL Server on Linux Overview
SQL Server 2017 on Linux Introduction
Big data conference europe real-time streaming in any and all clouds, hybri...
Save money with Postgres on IBM PowerLinux
 
Azure Cosmos DB: Features, Practical Use and Optimization "
EDB Database Servers and Tools
Introduction to Postrges-XC
Summer 2017 undergraduate research powerpoint
Postgres for the Future
 
Experience sql server on l inux and docker
Doing More with Postgres - Yesterday's Vision Becomes Today's Reality
 
"Clouds on the Horizon Get Ready for Drizzle" by David Axmark @ eLiberatica 2009
Ad

More from botsplash.com (15)

PDF
Migrating to postgresql
PPTX
Bootstrap SaaS startup using Open Source Tools
PPTX
Devops Days, 2019 - Charlotte
PPTX
Building NLP solutions for Davidson ML Group
PPTX
Building NLP solutions using Python
PPTX
Chat interfaces, Extension to Digital Marketing
PPTX
Cloud computing options
PPTX
Data Science meets Digital Marketing
PPTX
botsplash deep dive
PPTX
Building Twitter bot using Python
PPTX
Python for data science
PPTX
Live development & tools
PPTX
AI Use Cases discussion
PPTX
Career advice for beginner software engineers
PPTX
Node.js Getting Started &amd Best Practices
Migrating to postgresql
Bootstrap SaaS startup using Open Source Tools
Devops Days, 2019 - Charlotte
Building NLP solutions for Davidson ML Group
Building NLP solutions using Python
Chat interfaces, Extension to Digital Marketing
Cloud computing options
Data Science meets Digital Marketing
botsplash deep dive
Building Twitter bot using Python
Python for data science
Live development & tools
AI Use Cases discussion
Career advice for beginner software engineers
Node.js Getting Started &amd Best Practices

Recently uploaded (20)

PPTX
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
PDF
Test Bank, Solutions for Java How to Program, An Objects-Natural Approach, 12...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
PDF
CIFDAQ's Token Spotlight: SKY - A Forgotten Giant's Comeback?
PDF
Software Development Methodologies in 2025
PPTX
ABU RAUP TUGAS TIK kelas 8 hjhgjhgg.pptx
PDF
Google’s NotebookLM Unveils Video Overviews
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
PDF
Why Endpoint Security Is Critical in a Remote Work Era?
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
SparkLabs Primer on Artificial Intelligence 2025
PDF
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
PDF
REPORT: Heating appliances market in Poland 2024
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PPTX
CroxyProxy Instagram Access id login.pptx
PDF
Top Generative AI Tools for Patent Drafting in 2025.pdf
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
Test Bank, Solutions for Java How to Program, An Objects-Natural Approach, 12...
Understanding_Digital_Forensics_Presentation.pptx
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
CIFDAQ's Token Spotlight: SKY - A Forgotten Giant's Comeback?
Software Development Methodologies in 2025
ABU RAUP TUGAS TIK kelas 8 hjhgjhgg.pptx
Google’s NotebookLM Unveils Video Overviews
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Why Endpoint Security Is Critical in a Remote Work Era?
NewMind AI Weekly Chronicles - August'25 Week I
SparkLabs Primer on Artificial Intelligence 2025
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
REPORT: Heating appliances market in Poland 2024
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
GamePlan Trading System Review: Professional Trader's Honest Take
CroxyProxy Instagram Access id login.pptx
Top Generative AI Tools for Patent Drafting in 2025.pdf
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...

Getting started with postgresql

  • 1. Getting Started with Postgresql For Charlotte SQL Saturday on 10/20/2018 By Ramu Pulipati
  • 3. About Me Cofounder / CTO at Botsplash Software-as-a-service omnichannel messaging platform to engage businesses and customers. botsplash.com
  • 4. About Postgresql • Free – Open Source RDMS alternative to Microsoft SQL Server, MySQL, Oracle, Mongo DB or any RDBMS and NoSQL databases • Battle tested for over 30 years with continuous improvements to support modern applications • Rich ANSI/ISO Complaint SQL, JSON and Multiple Languages support • Built-in and extensions for high scalability, geolocation and time series based applications • Hosted Providers and Commercial Support • Used for embedded database, web applications and data warehouses
  • 5. History & What’s new Version Major Feature V11 10/18/2018 Partition & Index Improvements Query Parallelism Improvements Just-in-time compilation V10 10/05/2017 Declarative Table Partitions Logical Replication Improved Monitoring and Control V9.6 09/29/2016 Parallel Query Foreign Data Wrappers Replication V9.5 07/01/2016 JSONB Modifying Operators Row level security UPSERT statement V9.4 12/18/2014 Introduce JSONB Materialized View Improvements Source: Postgresql Releases
  • 6. Capabilities • Full featured Database • ACID Transactions • Mature Server Side Programming • Hot Standby and High Availability • Online backups • Point-in-time recovery • Native Table Partitioning • Spatial Functionality • Full Text Search • JSON and HStore support Source: Introduction to Postgresql Slideshare 80% Commercial Databases Postgresql
  • 7. Capabilities (contd …) • Replication and Read Replicas • Built-in pub-sub queue • Multiple schemas with security • Row level security • Native SSL Support • Data Level Encryption • Extensions • Database sharding • Timeseries data • Multi-tenancy • Extensible and Customizable
  • 8. Licensing & Community • BSD License open for Open Source, Private and Commercial use • No vendor Lock-in • Predictable releases • Faster Bug fixes • Active Community Source: postgresql.org
  • 9. Compare to RDBMS • MySQL • Has better write performance but Postgres is ACID compliant and has better features, reliability and consistency. • SQL Server • Featureful with recently introduced JSON support. Postgres has superset of features and scalable with extensions. • Oracle • Most comprehensive database but very expensive. • Mongo Db • Postgresql outshines read/write for unstructured with JSONB. Source: NoSQL Performance Benchmark 02/27/2018
  • 11. Features • Affordability • Technology • Security • Flexibility • Stability • Extensibility • Reliability • Predictability • Community • Auditability Source: Introduction to Postgresql Slideshare
  • 12. Getting Started • Download and Install • Windows packaged from EnterpriseDb • Linux from apt or source • Mac installations using homebrew. • Production recommended on *nix systems. Postgres 11 supports Windows 2012 R2 deployment. • Cloud hosted databases from Compose, Citus Data, AWS RDS, Goolge CloudSql, Azure Databases are excellent choice. • Be careful with Docker/Kubernetes deployments.
  • 13. Postgresql Client • Instance accessed from TCP/IP Port: 5432 default • CLI/IDE Clients • PSQL, PgAdmin 4, PGCLI, TablePlus, Navicat • Application Clients • Python SqlAlchemy, NodeJs Pg, C# Client, Golang pq Sample Connection String: postgres://username:[email protected]:5432/northwind
  • 14. Postgresql Architecture • Multi-process architecture • Primary postmaster • Per connection backend process • Background maintenance processes • Use pgBouncer to load balance large number of connections Source: The Internals of Postgresql
  • 15. Schema and SQL Support • Basic and Complex data types • Complex Queries • Functions, Operators and Aggregate functions • Transactional Integrity • Indexes • Procedural Language • Optimized for data storage • Extend and create custom data types • Error handling Data Type Commonly Used Names Number Smallint, integer, Bigint, decimal, Money Number (auto) Serial, BigSerial Character Char, nchar, Text Binary Bytea Date Time Timestamp with timezone, Timestamp without Timezone, interval Collections Array, JSON, JSONB Misc Boolean, UUID, Geo Types, Network Address Mac Address, Custom Types Full Text Search TsVector, TsQuery See details from tutorials point
  • 16. Demo Sample Table and DDL functions Table DDL Sample Function
  • 17. More SQL / NoSQL Support • Case Sensitive Schema Names • Case Sensitive Data Storage • Type casting • Date Time formatting • Common Table Expressions • Array Data Types • Upserts • Pivot Tables crosstab function • Materialized Views • HStore • JSONB Structure • Rich Functions Support • EXPLAIN ANALYZE • Pg_stat
  • 18. Psql tricks and tips Command Description l List of all databases c <db> Change database name d Show all objects in database d <item> Describe database item x [auto] Show one record at a time timing Turn on/off sql timing e Edit in an editor (vi or preferred editor) copy … Save results to CSV file h Help with SQL Ctrl + R Search previous SQL command ir Reference SQL file Source: pgdash.io
  • 19. Concurrency - MVCC • Postgres uses “Multi-version concurrency control”. More details from postgres internals or interdb. • All reads are from the snapshot of the database until committed. • Advantage of this method is reading does not block writing and vice- versa. Isolation at “read-committed” level • This method supports table level and row level locking as well. • Other databases that use this technique are Oracle, CouchDB, etc
  • 20. Indexes • Index Support • Single Column • Multi Column • Unique • Partial Indexes • Expression Indexes • Implicit Indexes (PK, UK) • Index Types • B-Tree: Balanced Trees used in most databases • Generalized Inverted Index (GIN): Useful for full text search • Generalized Search Tree (GIST): Useful for geometric and full text search B-Tree
  • 21. Disadvantages / Limits • Database only solution • Few available tools compared to commercial. PgAdmin is catching up but relatively new. • CPU bound queries and table inheritance based partitioning before v10. • Limited Talent pool Source: https://www.youtube.com/watch?v=6p2TNPabt6M Source: Postgres limits from sreenstepslive.com
  • 22. Best Practices • Performance Optimization • Do not read from Database at all • Use indexes efficiently • Use pg_stats collector to monitor usage and performance • Vacuum regularly to clean up storage • Bench mark hardware and optimize pg parameters • Always use SSL for connectivity in untrusted networks
  • 23. References and Follow up • Introduction to Postgresql slideshare • The Internals of Postgresql • Postgresql excercises • Postgres Blogs: 2nd Quadrant, Percona and Citus Data, Citus - Craig Kerstiens Questions? Contact me at [email protected]

Editor's Notes

  • #3: A quick comment about sponsors. SQL Saturdays cannot take place without the funding provided by sponsors. The speakers are not paid. The organizers and other folks running around making sure this event runs smoothly are all volunteers. However, his facility, the food, and other expenses that go into putting on an event of this magnitude requires money. Sponsors provide that money. So, show your appreciation by saying hi and thank you when you stop by the sponsor tables to stuff your raffle ticket into the box. You might even take a couple of minutes to ask about their product and services. You may learn something valuable that you can bring back to your work, or that might become a career opportunity. It's all part of the very important networking you should be doing while you are here.
  • #5: PostgreSQL is an object-relational database management system (ORDBMS) based on POSTGRES, Version 4.2, developed at the University of California at Berkeley Computer Science Department. POSTGRES pioneered many concepts that only became available in some commercial database systems much later.
  • #10: ACID (Atomicity, Consistency, Isolation, Durability) https://www.youtube.com/watch?v=6p2TNPabt6M
  • #24: https://www.slideshare.net/ScaleGrid/understanding-high-availability-options-in-postgresql?qid=d9d63ef4-7140-44a0-b9c6-5a5fafca02a0&v=&b=&from_search=18