select

Columns.select(population_table, peripheral_tables=None, share_selected_columns=0.5)[source]

Sets all categorical or numerical columns that are not sufficiently important to unused.

Args:
population_table (getml.data.DataFrame):

Main table containing the target variable(s) and corresponding to the population Placeholder instance variable.

peripheral_tables (List[getml.data.DataFrame] or dict):

Additional tables corresponding to the peripheral Placeholder instance variable.

share_selected_columns(numerical): The share of columns

to keep. Must be between 0.0 and 1.0.