Project Title: Occupancy Classification Data


You are given sensor data of an office such as light, temperature, humidity, and CO2 measurements.  There is also an attribute to indicate whether the room is occupied. In this way, we could analyze the data for patterns and apply a linear regression model to predict whether the room is occupied.


The objectives of this project are:

  1. To pre-process the data to ensure data is clean and ready for next stage of analysis.
  2. To perform data transformation so as to gain insights into the data.
  3. To perform data visualization so as to discover patterns, trends and etc.
  4. To fit the data into a Linear regression model for classification.

Data Set Information:

Each record in the data set consists of 8 attributes: 


Attribute description


record index


record date time year-month-day hour:minute:second


In Celsius


In %


In Lux


In ppm

Humidity Ratio

Derived quantity from temperature and relative humidity, in kg water-vapor/kg-air


, 0 or 1, 0 for not occupied, 1 for occupied status



