External tables provide access to data stored in data sources outside of Greenplum Database as if the data were stored in regular database tables. Data can be read from or written to external tables.
An external table is a Greenplum database table backed with data that resides outside of the database. An external table is either readable or writable. It can be used like a regular database table in SQL commands such as SELECT
and INSERT
and joined with other tables. External tables are most often used to load and unload database data.
Web-based external tables provide access to data served by an HTTP server or an operating system process. See Creating and Using External Web Tables for more about web-based tables.
- Defining External Tables
External tables enable accessing external data as if it were a regular database table. They are often used to move data into and out of a Greenplum database. - Accessing External Data with PXF
Data managed by your organization may already reside in external sources such as Hadoop, object stores, and other SQL databases. The Greenplum Platform Extension Framework (PXF) provides access to this external data via built-in connectors that map an external data source to a Greenplum Database table definition. - Accessing HDFS Data with gphdfs (Deprecated)
Greenplum Database leverages the parallel architecture of a Hadoop Distributed File System to read and write data files efficiently using thegphdfs
protocol. - Using the Greenplum Parallel File Server (gpfdist)
The gpfdist protocol is used in aCREATE EXTERNAL TABLE
SQL command to access external data served by the Greenplum Databasegpfdist
file server utility. When external data is served by gpfdist, all segments in the Greenplum Database system can read or write external table data in parallel.
Parent topic: Greenplum Database Administrator Guide
Content feedback and comments