Driver Options

TileDB Config Options

Option

Required

Description

uri

Yes

URI of a TileDB sparse or dense array (required)

tiledb.*

Set a , e.g., option("tiledb.vfs.num_threads", 4).

Writing Options

Option

Required

Description

write_buffer_size

Set the TileDB read buffer size in bytes per attribute/coordinates. Defaults to 10MB

schema.dim.<D>.name

Yes

Specify which of the Spark dataframe columns will be dimension D.

schema.dim.<D>.min

Specify the lower bound for the TileDB domain on dimension D.

schema.dim.<D>.max

Specify the upper bound for the TileDB domain on dimension D.

schema.dim.<D>.extent

Specify the tile extent on dimension D.

schema.attr.<A>.filter_list

Specify a filter list for attribute A. The filter list is a list of tuples of the form (name, option), ex: "(byteshuffle, -1), (gzip, 9)".

schema.capacity

Specify the tile capacity for sparse fragments.

schema.tile_order

Specify the tile order.

schema.cell_order

Specify the cell order.

schema.coords_filter_list

Specify the coordinates filter list. The filter list is a list of tuples of the form (name, option), ex: "(byteshuffle, -1), (gzip, 9)".

schema.offsets_filter_list

Specify the offsets filter list. The filter list is a list of tuples of the form (name, option), ex: "(byteshuffle, -1), (gzip, 9)".

Reading Options

Option

Required

Description

order

Result layout order. It can be "row-major"/ "TILEDB_ROW_MAJOR", "col-major" / "TILEDB_COL_MAJOR", "global-order" / "TILEDB_GLOBAL_ORDER", or "unordered"/ "TILEDB_UNORDERED" (default "unordered").

read_buffer_size

Set the TileDB read buffer size in bytes per attribute/coordinates. Defaults to 10MB

allow_read_buffer_realloc

If the read buffer size is too small allow reallocation. Default: True

partition_count

Number of partitions.

PreviousUsage NextSupported Datatypes

Last updated 3 years ago

Was this helpful?