read_csv

getml.database.read_csv(name, fnames, quotechar='"', sep=',', num_lines_read=0, skip=0, colnames=None, conn=None)

Reads a CSV file into the database.

Parameters
  • name (str) – Name of the table in which the data is to be inserted.

  • fnames (List[str]) – The list of CSV file names to be read.

  • quotechar (str, optional) – The character used to wrap strings. Default:

  • sep (str, optional) – The separator used for separating fields. Default:,

  • num_lines_read (int, optional) – Number of lines read from each file. Set to 0 to read in the entire file.

  • skip (int, optional) – Number of lines to skip at the beginning of each file (Default: 0).

  • colnames (List[str] or None, optional) – The first line of a CSV file usually contains the column names. When this is not the case, you need to explicitly pass them.

  • conn (Connection, optional) – The database connection to be used. If you don’t explicitly pass a connection, the engine will use the default connection.

Example

Let’s assume you have two CSV files - file1.csv and file2.csv . You can import their data into the database using the following commands:

>>> stmt = data.database.sniff_csv(
...         fnames=["file1.csv", "file2.csv"],
...         name="MY_TABLE",
...         sep=';'
... )
>>>
>>> getml.database.execute(stmt)
>>>
>>> stmt = data.database.read_csv(
...         fnames=["file1.csv", "file2.csv"],
...         name="MY_TABLE",
...         sep=';'
... )