from_db¶
-
classmethod
DataFrame.
from_db
(table_name, name=None, roles=None, ignore=False, dry=False, conn=None)[source]¶ Create a DataFrame from a table in a database.
It will construct a data frame object in the engine, fill it with the data read from table table_name in the connected database (see
database
), and return a correspondingDataFrame
handle.- Args:
- table_name (str):
Name of the table to be read.
- name (str):
Name of the data frame to be created. If not passed, then the table_name will be used.
- roles(dict[str, List[str]] or
Roles
, optional): Maps the
roles
to the column names (seecolnames()
).The roles dictionary is expected to have the following format:
roles = {getml.data.role.numeric: ["colname1", "colname2"], getml.data.role.target: ["colname3"]}
Otherwise, you can use the
Roles
class.- ignore (bool, optional):
Only relevant when roles is not None. Determines what you want to do with any colnames not mentioned in roles. Do you want to ignore them (True) or read them in as unused columns (False)?
- dry (bool, optional):
If set to True, then the data will not actually be read. Instead, the method will only return the roles it would have used. This can be used to hard-code roles when setting up a pipeline.
- conn (
Connection
, optional): The database connection to be used. If you don’t explicitly pass a connection, the engine will use the default connection.
- Returns:
-
Handler of the underlying data.
- Example:
getml.database.connect_mysql( host="relational.fit.cvut.cz", port=3306, dbname="financial", user="guest", password="relational" ) loan = getml.DataFrame.from_db( table_name='loan', name='data_frame_loan')