Skip to content

Time series forecasting for energy consumption, ASHRAE dataset on Kaggle. Data wrangling and FE (Pandas), regression (RF, LightGBM, DNN).

Notifications You must be signed in to change notification settings

mz256/ashrae_energy_forecast

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

61 Commits
 
 
 
 
 
 
 
 

Repository files navigation

ASHRAE Great Energy Predictor III

Collection of files for Kaggle competition by ASHRAE. Check my Kaggle profile for all kernels. The objective was stated in the competition info as:

In this competition, you’ll develop accurate models of metered building energy usage in the following areas: chilled water, electric, hot water, and steam meters. The data comes from over 1,000 buildings over a three-year timeframe. With better estimates of these energy-saving investments, large scale investors and financial institutions will be more inclined to invest in this area to enable progress in building efficiencies.

Directory structure

  • notebooks contains some of the kernels used on Kaggle.
  • script_version contains a version of the pipeline in python scripts, to be run either locally or in the cloud.

Please refer to the README in each folder for an in-depth description.

Summary of results

  • LightGBM models:

    1. Ensemble of k LightGBM models trained on different CV folds of the (preprocessed) training set. This was the highest scoring solution with a private score of 1.292.

    2. Ensemble of k LightGBM models per meter type, for a total of 4k models.

  • Neural network models:

    1. DNN with entity embedding for categorical features. These are then concatenated with continuous predictors and fed to the first dense layer. We trained this learner making exclusive use of fastai, from the data loading API all the way to training and testing.

Releases

No releases published

Packages

No packages published