HomeHelp

Hotel clustering

Introducción

The Clustering application is a tool that groups establishments in the Basque Country using daily prices. These prices were collected via web-scraping methods on platforms offering room reservations at hotel establishments

To obtain the daily prices, 120 consultations per establishment were conducted every day, that is, the prices for the following 120 days were requested for every day and hotel/guesthouse. After obtaining the prices, the average of all of these 120 possible daily prices was selected for analysis.

Clusterings Graph

Selection of cluster and filters

The 4 selectors on the top of the application are mainly divided into 2 parts:

Seleccionar las categorias

  1. Cluster: via this selector you can choose to show one cluster in particular or all at once. Selecting the option All will show all of the groups in the first graph; the other graphs will show the absolute data of the establishments disaggregated by category and stratum. Also, selecting one cluster in particular will depict that cluster and the percentage of hotels corresponding to that group, again disaggregated by category and stratum.
  2. Category, Historical Territory and Stratum: these 3 selectors are simply filters which can be applied to the establishments.

Type of Clustering

The graph offers the option to select the type of clustering conducted for this application:

Botones de Tipo de Clustering

  1. Absolute Prices: the establishments have been grouped using the original prices. The objective is to group the hotels and guesthouses by different price ranges and different trends.
  2. Normalised Prices: for this clustering the hotel prices have been normalised from 0-100 in order to capture different seasonal price trends. On the one hand are those with constant or nearly constant prices, assigned a value of 100, and on the other are those who change their prices depending on the time of year or public holidays.
  3. Volatility: in this case the daily change in price has been studied. This was done using the close-to-close estimator, based on the manual from the TTR library on R.

In the first 2 cases, the clustering of the seasonal price series was conducted using the Euclidean distance, after trying different distances in libraries such as TSdist, TSclust y dtw

Distribution of Hotels by Category

The bar graph appearing with this heading disaggregates the hotels by category. Depending on the option selected in the Cluster selector, its meaning will change slightly.

  1. All is selected in the Cluster selector, the graph will show all of the establishments that were found via web-scraping, disaggregated by category. That is, the absolute data are portrayed.

Distribution of hotels in absolute terms

  1. On the other hand, if a specific cluster is selected in the Cluster selector, the diagram shows the percentage data. What appears in this case is the percentage of hotels and guesthouses belonging to the selected cluster.

Distribution of hotels in percentage terms

Distribution of Hotels by Stratum

The map and bar graph appearing with this heading disaggregate the hotels into 11 different strata. Depending on the option selected in the Cluster selector, its meaning will change slightly.

  1. If the option All is selected in the Cluster selector, the graph and map will show all of the establishments that were found via web-scraping, disaggregated by geographical stratum. That is, the absolute data are portrayed.

Distribution of hotels in absolute terms

  1. On the other hand, if a specific cluster is selected in the Cluster selector, the diagram and map show the percentage data. What appears in this case is the percentage of hotels and guesthouses belonging to the selected cluster.

Distribution of hotels in percentage terms

Technique

This tool uses the libraries Leaflet (to create the map) and Plotly (for the seasonal series and bar graphs). The use of JavaScript enables it to be interactive, and you must therefore have the JavaScript option active in your browser.

Your feedback.  Help us to make our web better

How would you rate the information on the site?
Very useful
Useful
Barely useful
Not useful at all
Would you like to make a suggestion?
Yes, I would
Send