Guides
The guides section contains compact how-to guides that are focused on achieving a single goal. For API references and examples, see the rest of the documentation.
Data Import and Export
CSV Files
Parquet Files
- How to load a Parquet file into a table
- How to export a table to a Parquet file
- How to run a query directly on a Parquet file
HTTP(S), S3 and GCP
- How to load a Parquet file directly from HTTP(S)
- How to load a Parquet file directly from S3
- How to export a Parquet file to S3
- How to load a Parquet file from S3 Express One
- How to load a Parquet file directly from GCS
- How to load a Parquet file directly from Cloudflare R2
- How to load an Iceberg table directly from S3
JSON Files
Excel Files with the Spatial Extension
Directly Reading Files
Performance
- My workload is slow (troubleshooting guide)
- How to design the schema for optimal performance
- What is the ideal hardware environment for Goose
- What performance implications do Parquet files and (compressed) CSV files have
- How to tune workloads
- Benchmarks
Meta Queries
- How to list all tables
- How to view the schema of the result of a query
- How to quickly get a feel for a dataset using summarize
- How to view the query plan of a query
- How to profile a query
Python Client
- How to install the Python client
- How to execute SQL queries
- How to easily query Goose in Jupyter Notebooks
- How to easily query Goose in marimo Notebooks
- How to use Multiple Python Threads with Goose
- How to use fsspec filesystems with Goose
Pandas
- How to execute SQL on a Pandas DataFrame
- How to create a table from a Pandas DataFrame
- How to export data to a Pandas DataFrame
Apache Arrow
- How to execute SQL on Apache Arrow
- How to create a Goose table from Apache Arrow
- How to export data to Apache Arrow