Summary of Data Loading Features

This topic provides a quick-reference of the supported features for using the COPY INTO table command to bulk load data from files into Snowflake tables.

In this Topic:

Data File Details

Feature Supported Notes
Location of files Local environment Files are first staged in a Snowflake internal location, then loaded into a table.
AWS S3 Files can be loaded directly from any user-supplied S3 bucket.
Microsoft Azure Files can be loaded directly from any user-supplied Azure container.
File formats Delimited (CSV, TSV, etc.) Any single-character delimiter is supported; default is comma (i.e. CSV).
JSON  
Avro Includes automatic detection and processing of Snappy-compressed Avro files.
ORC Includes automatic detection and processing of Snappy- or zlib-compressed ORC files.
Parquet  
XML Supported as a preview feature.
File encoding File-format specific

For delimited files (CSV, TSV, etc.), see Supported Character Sets (for Delimited Data Files) (in this topic).

UTF-8 for all other supported file formats (JSON, Avro, etc.).

Compression of Staged Files

Feature Supported Notes
Uncompressed files gzip When staging uncompressed files in a Snowflake internal location, the files are automatically compressed using gzip, unless compression is explicitly disabled.
Already-compressed files gzip When loading staged files that have already been compressed, Snowflake can automatically detect any of these compression methods or you can explicitly specify the method that was used to compress the files.
bzip2
deflate
raw_deflate
Brotli Note that loading staged files compressed using Brotli or ZStandard is currently a preview feature. Auto-detection is not yet supported for these methods; when staging or loading files compressed with either of these methods, the compression method must be explicitly specified.
Zstandard

Encryption of Staged Files

Feature Supported Notes
Unencrypted files 128-bit or 256-bit keys When staging unencrypted files in a Snowflake internal location, the files are automatically encrypted using 128-bit keys. 256-bit keys can be enabled (for stronger encryption); however, additional configuration is required.
Already-encrypted files User-supplied key Files that are already encrypted can be loaded into Snowflake from an S3 bucket or Azure container; the key used to encrypt the files must be provided to Snowflake.

Supported Character Sets (for Delimited Data Files)

Character Set Supported Languages
Big5 Traditional Chinese
EUC-JP Japanese
EUC-KR Korean
GB18030 Chinese
IBM420 Arabic
IBM424 Hebrew
ISO-2022-CN Simplified Chinese
ISO-2022-JP Japanese
ISO-2022-KR Korean
ISO-8859-1 Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish
ISO-8859-2 Czech, Hungarian, Polish, Romanian
ISO-8859-5 Russian
ISO-8859-6 Arabic
ISO-8859-7 Greek
ISO-8859-8 Hebrew
ISO-8859-9 Turkish
KOI8-R Russian
Shift_JIS Japanese
UTF-8 All languages
UTF-16BE All languages
UTF-16LE All languages
UTF-32BE All languages
UTF-32LE All languages
windows-1250 Czech, Hungarian, Polish, Romanian
windows-1251 Russian
windows-1252 Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish
windows-1253 Greek
windows-1254 Turkish
windows-1255 Hebrew
windows-1256 Arabic