to_parquet

to_parquet#

NestedFrame.to_parquet(path, large_list=False, **kwargs) None[source]#

Creates parquet file(s) with the data of a NestedFrame, either as a single parquet file where each nested dataset is packed into its own column or as an individual parquet file for each layer.

Note that here we always opt to use the pyarrow engine for writing parquet files.

Parameters:
  • path (str) – The path to the parquet file

  • large_list (bool) – If False (default), use regular list_ (int32 offsets). Set to True to use large_list (int64 offsets), which is required when the total number of nested elements across all rows exceeds 2**31 - 1.

  • kwargs (keyword arguments, optional) – Keyword arguments to pass to pyarrow.parquet.write_table

Return type:

None

Examples

>>> from nested_pandas.datasets.generation import generate_data
>>> nf = generate_data(5,5, seed=1)
>>> nf.to_parquet("nestedframe.parquet")