Binary classification dataset on occupancy detection
The occupancy detection data set is a very simple multivariate time series from the UCI Machine Learning Repository. It is a binary classification problem. The task is to predict room occupancy from Temperature, Humidity, Light and CO2.
The original publication is: Candanedo, L. M., & Feldheim, V. (2016). Accurate occupancy detection of an office room from light, temperature, humidity and CO2 measurements using statistical learning models. Energy and Buildings, 112, 28-39.
Return data as pandas.DataFrame s
Return data with roles set
>>> df_getml = getml.datasets.load_occupancy() >>> type(df_getml["train"]) ... getml.data.data_frame.DataFrame
For an full analysis of the occupancy dataset including all necessary preprocessing steps please refer to getml-examples.