generate_data

Contents

generate_data#

generate_data(n_base, n_layer, seed=None) NestedFrame[source]#

Generates a toy dataset.

Parameters:
  • n_base (int) – The number of rows to generate for the base layer

  • n_layer (int, or dict) – The number of rows per n_base row to generate for a nested layer. Alternatively, a dictionary of layer label, layer_size pairs may be specified to created multiple nested columns with custom sizing.

  • seed (int) – A seed to use for random generation of data

Returns:

The constructed NestedFrame.

Return type:

NestedFrame

Examples

>>> from nested_pandas.datasets import generate_data
>>> nf1 = generate_data(10,100)
>>> nf2 = generate_data(10, {"nested_a": 100, "nested_b": 200})