Scaling PostgreSQL with GridSQL

Talk Type: 
45 Minute Talk
Technical Level: 
Intermediate
License: 
Creative Commons - Attribution Only

GridSQL is commonly thought of as a replication solution along the likes of Slony and Bucardo, but the open source GridSQL project actually allows PostgreSQL queries to be parallelized across many servers allowing performance to scale nearly linearly. In this session, we will discuss the advantages to using GridSQL for large multi-terabyte data warehouses and how to design your PostgreSQL schemas and queries to leverage GridSQL. We will dig into how GridSQL plans a query capable of spanning multiple PostgreSQL servers and executes across those nodes. Some gotchas such as cross node joins will also be discussed.