Seattle Energy: automatising an administrative task

About this Project

In this project, we are working for the city of Seattle and with its open data. The goal is to facilitate the building energy statement generation for the non-residential buildings.

Complex and complete energy statements have been made by the city, but they represent a big effort and our task is to simplify the process using our Data Science skills.

In this project, we have multiple objectives:

  • Predict the energy consumption and green house gas emissions of the buildings,
  • Investigate the EnergyStar score and determinate if it is relevant for the greenhouse gas emissions predictions.

In order to fullfill these objectives:

  • We are going to implement custom Sklearn transformers to create a pipeline for our data.
  • Then, we train and evaluate multiple machine learning models to select the best estimator for our problematic.

Features

Artificial Intelligence

Get some help! Unlock the power of Artificial Intelligence directly from your admin panel, assisting you generating content and translation!

Data Analysis

Keep improving your results! With Data-Analysis we transform the raw data from your app into meaningful insights (and beautiful graphs) which will allow to understand your users and market.

Technologies

Data/Ai

NumPy


NumPy is a library for the Python programming language, adding support for large, multi-dimensional arrays and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

source: wikipedia.org

Pandas


Pandas  is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the  Python  programming language.

source: pandas.pydata.org

Plotly logo

Plotly


Plotly's Python graphing library makes interactive, publication-quality graphs. Examples of how to make line plots, scatter plots, area charts, bar charts, error bars, box plots, histograms, heatmaps, subplots, multiple-axes, polar charts, and bubble charts.

source: plotly.com

Sickit learn logo

Sickit-Learn


scikit-learn (formerly scikits.learn and also known as sklearn) is a free and open-source machine learning library for the Python programming language.

It features various classification, regression and clustering algorithms including support-vector machines, random forests, gradient boosting, k-means and DBSCAN, and is designed to interoperate with the Python numerical and scientific libraries NumPy and SciPy. Scikit-learn is a NumFOCUS fiscally sponsored project.

source: wikipedia.org

XGBoost logo

XGBoost


XGBoost is an optimized distributed gradient boosting library designed to be highly efficient, flexible and portable. It implements machine learning algorithms under the Gradient Boosting framework. XGBoost provides a parallel tree boosting (also known as GBDT, GBM) that solve many data science problems in a fast and accurate way.

source: xgboost.readthedocs.io

Posts

No posts yet.

Q&A

If you are still wondering...

Hikonic is a web development agency founded by Thibgl, a passionate data scientist and full-stack engineer based in Bordeaux, France. With a commitment to delivering cutting-edge solutions, Hikonic specializes in blending innovative technologies with elegant designs to create powerful digital products. Whether you're a small business owner or a large enterprise, Hikonic tailors its services to meet your unique needs and goals.

Get in Touch !

A question? An inquiry? We are happy to help!

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.