A Map-Based Recommendation System and House Price Prediction Model For Real Estate
A Map-Based Recommendation System and House Price Prediction Model For Real Estate
A Map-Based Recommendation System and House Price Prediction Model For Real Estate
Geo-Information
Article
A Map-Based Recommendation System and House Price
Prediction Model for Real Estate
Maryam Mubarak 1 , Ali Tahir 1, * , Fizza Waqar 2 , Ibraheem Haneef 3 , Gavin McArdle 4 , Michela Bertolotto 4
and Muhammad Tariq Saeed 5
Simple Summary: The accessibility of spatial big data help real estate investors to make better
judgement calls and earn additional profit. Since location is considered necessary for real estate and
consequent decision-making, digital maps have become a prime resource for real estate purchases,
planning and development. Personalisation can support in making judgments by identifying user re-
quirements and inclinations, which a user interacts with digital map, it records all the user’s activities.
A personalised real estate portal can use this information to suggest properties, assist homeowners
and provide valuable real estate analytics. By monitoring user interactions through an online real
estate portal, the framework provided in this article can make personalised recommendations of real
Citation: Mubarak, M.; Tahir, A.; estate based on content, collaboration and location. The effectiveness of the recommendations was
Waqar, F.; Haneef, I.; McArdle, G.;
tested by the user feedback mechanism through a method of mean absolute precision, and the results
Bertolotto, M.; Saeed, M.T. A
show that 79% precise suggestions were generated. Out of 5 recommendations produced, users were
Map-Based Recommendation System
interested in at least 3. A separate house price prediction model was also developed base on neural
and House Price Prediction Model for
networks and classical regression technique. This model implemented to assist users in making an
Real Estate. ISPRS Int. J. Geo-Inf. 2022,
11, 178. https://doi.org/10.3390/
informed decision regarding prospects of real estate purchase.
ijgi11030178
Abstract: In 2015, global real estate was worth $217 trillion, which is approximately 2.7 times the
Academic Editor: Wolfgang Kainz
global GDP; it also accounts for roughly 60% of all conventional global resources, making it one of
Received: 3 January 2022 the key factors behind any country’s economic growth and stability. The accessibility of spatial big
Accepted: 3 March 2022 data will help real estate investors make better judgement calls and earn additional profit. Since
Published: 7 March 2022 location is deemed necessary for real estate and consequent decision-making, digital maps have
Publisher’s Note: MDPI stays neutral
become a prime resource for real estate purchases, planning and development. Personalisation
with regard to jurisdictional claims in can assist in making judgments by identifying user desires and inclinations, which can then be
published maps and institutional affil- recorded or captured as a user performs some interactions with a digital map. A personalised
iations. real estate portal can use this information to suggest properties, assist homeowners and provide
valuable real estate analytics. This article presents a novel framework for recommending real estate
to users. By monitoring user interactions through an online real estate portal, the framework can
make personalised recommendations of real estate based on content, collaboration and location. The
Copyright: © 2022 by the authors.
effectiveness of the recommendations was tested by the user feedback mechanism through a method
Licensee MDPI, Basel, Switzerland.
of mean absolute precision, and the results show that 79% precise suggestions were generated,
This article is an open access article
i.e., out of 5 recommendations produced, users were interested in at least 3. Along with that, a
distributed under the terms and
separate house price prediction model based on neural networks and classical regression techniques
conditions of the Creative Commons
Attribution (CC BY) license (https://
was also implemented to assist users in making an informed decision regarding prospects of real
creativecommons.org/licenses/by/ estate purchase.
4.0/).
Keywords: real estate; map personalisation; map recommendation; house price prediction; estatech
maps; real estate analytics
1. Introduction
Driven by advertising technologies and goals to produce targeted ads, the personalisa-
tion and customisation of websites and services have become the new norm in our society.
The need for personalisation has been driven by the increase in data and information
available. Information overload, which makes it challenging to find relevant information,
has been a phenomenon for the past two decades. For example, a study from 2003 found
that unique information creation was estimated to be between 1 to 2 exabytes. This implied
that each human being must be processing 250 megabytes of information. Almost 20 years
later, this demonstrates the mounting need for efficient and accurate user recommendation
systems to help find pertinent data and information. Personalised content delivery to any
set of users may consist of multiple aspects.
A factor that plays a vital role in most personalised web interfaces is the interactivity
and the “user-friendly” nature of the User Interface (UI). Every web user, be it a novice,
or an expert, wants the interface to provide meaningful content delivered without having
much prior expertise about its functionality. This process involves a lot of work from a web
developer’s perspective but should be invisible and seamless to the end-user. Therefore,
various tools and techniques have been developed to implicitly collect data from users.
Implicit data collection, in simpler terms, is just the collection of a user’s data through
“interface interactions” without the user having to provide the data in a specific manner.
The data is then used to determine interests and make recommendations. At the same
time, another aspect growing in popularity is having the location information of a user to
make recommendations.
Such recommender systems are widely deployed in many consumer domains, such as
online shopping, although our research focuses on real estate recommendations. Real estate
recommendation is often about the location of a property item, so we have incorporated
online map interactions as a tool to understand a user’s interests. This paper presents
four principle recommendation approaches for effectively identifying property items in
our real estate portal. (1) Analysis and implementation of content-based filtering for
suggesting real estate items. (2) Collaborative filtering approach reduces the computational
cost by suggesting similar items to a similar group of users. (3) Location-based approach
for predicting the area of interest to the user based on geographical location and user
preferences. (4) Building a price prediction model to assist users in making an informed
decision. The reason for selecting the first two approaches is based on the fact that the
features of a real estate database closely resemble a movie database. Both content-based
filtering and collaborative filtering have proven to provide precise recommendations to
users [1]. Introducing a location-based approach is essential since property items have an
inherent location aspect.
We have used data from the Estatech map’s portal: https://www.the-estatech.com
(accessed on 15 September 2021) for the recommendation part of the study. We also obtained
data in explicit and implicit formats. In addition, historical data of properties and price
listings were obtained from Zameen.com (accessed on 15 September 2021), a real estate
portal for online property listings. The techniques and methods used for recommendation
algorithms were the score tree processes, TF-IDF and K-nearest neighbours. For house
prediction, we cross-compared two techniques, namely multiple linear regression and
Keras regression based on neural networks.
The remainder of the article is organised as follows: Section 2 presents the related
literature review. The methodological approach is given in Section 3. Section 4 presents
a discussion and the results, while Section 5 concludes the study and provides future
recommendations.
ISPRS Int. J. Geo-Inf. 2022, 11, 178 3 of 19
2. Related Work
Today’s modern recommendation engines have emerged from the domain of infor-
mation filtering, a term created by [2] outlines one solution for the issue of retrieving the
correct information against a pool of massive online data, called content filters. To ascertain
a user’s choice correctly, multiple visualisation tools have also been developed to accurately
distinguish a user’s interests and inclinations. These tools can also be considered as a form
of content filter. This domain has been progressing ever since. [3] demonstrate various
options for integrating a recommendation engine into a real estate portal’s user journey.
Furthermore, in the same manner, the work validated how additional real estate details can
provide more accurate recommendation results when integrated into the proposed model
of deep learning and factorisation machines.
Another study by [4] aims to determine if consumer loyalty will help a recommender
system be more accurate. Other techniques implemented by [5] such as using intelligent
data analysis methods to create a recommender framework to solve the problem of recom-
mending the most appropriate components for each user at any given time. They have fur-
ther addressed the problem of converting an original dataset from a real component-based
application to an optimised dataset. After gathering the interaction data and developing a
dataset to produce optimised recommendation results, machine learning algorithms using
feature engineering techniques and feature selection methods were also applied. Users
and developers alike want information processing and its display to be swift. The system
developed by [6] is based on an implicit profiling system for tracking the user’s interests
through mouse movements.
A gap analysis approach by [7] identifies the differences between theory and reality in
presenting information on location choice by developing a seven-factor classification tool
for evaluating property websites. To capture the relations between the latent feature vectors
of real estate items, Ref. [8] utilised the average-based and individual-based geographical
regularisation terms. Both terms are integrated with the weighted regularised matrix
factorisation framework to model users’ implicit feedback behaviours to provide them with
personalised property recommendations.
A probabilistic model for collaborative filtering by [9] calculates the predicted values
for items against active users, given that there is information already available about those
active users. The same research divides collaborative filtering methods into two primary
modules, memory-based collaborative filtering and model-based collaborative filtering.
Additional probabilistic approaches have been presented, some more sophisticated than
others, including the work of [10]. The recommended procedure is taken as a sequential
decision-making process, and the use of Markov decision chains have been suggested to
create a model. However, they do not state any improved accuracy over Breese’s projected
models. Another recommendation system by [11] applies content-based filtering, a fuzzy
technique for identifying similar and different content and a prediction algorithm for
identifying the right set of movie content for the user. At the same time, Ref. [12] developed
item to item centred algorithms. It has been done to provide improved outcomes than
user-based algorithms by comparing the approach with K-nearest neighbour.
In the domain of GIS, a complete map personalisation system is developed by [13]
in which the users’ interests are implicitly recorded and given specific rankings based on
certain criteria fulfilment upon user’s mouse clicks or movements. As already mentioned,
map personalisation has become an area of interest since data overload has become a
common scenario in spatial information systems. In the model developed by [14], the
entire focus is to understand map usage patterns of the end-users. The goal is again
focused on developing personalised maps for users on a web interface. Working on
similar lines, RecoMap [13], is a web-based platform through which each user receives
customised spatial recommendations based on their likings. The results are presented in a
map interface highlighting the user’s personalised spatial recommendations. The adaptive
map also shows the user’s preferences and the context in which they are used. A different
approach by [15] is to build a recommendation system and map interface, represented in a
ISPRS Int. J. Geo-Inf. 2022, 11, 178 4 of 19
personalised format for the user to acquire quick results. Further inferences are made by
studying the user’s behaviour for system improvement.
Another recommender system designed by [16] is for real estate users who do not
have a user profile for any real estate portal. The session-based interaction of the user is
made more effective by utilising a user’s search context and ranking criteria for any suitable
property item. A portal developed by [8] specifically designed for real estate uses two basic
approaches for user profiling, an ontological structure and case-based reasoning. The pur-
pose is to save the end-user from the stress of massive online searching and deliver results
where the user gets quick recommendations based on their interests. A recommendation
system that is being used by the US-based real estate website “Trulia” utilises a “square
counting method” [17] The method works well with large scale datasets and delivers swift
results per the user’s preferences based on love and hate edge configurations.
Things have changed significantly in the real estate industry during the COVID-19
era. In some regions, house prices have shown signs of stagnancy and even, in some cases,
decreasing trends as people lost their livelihoods. These conditions have urged people
to tread more carefully while making investments in this sector. In such a scenario, a
price prediction model can help users make an informed decision. A method by [18] for
predicting house prices utilises a Mallows model averaging estimator, which is vigorous
in terms of spatial dependence. Another study on ML models for house price prediction
by concludes that the random forest regressor model provides the best results amongst
all other compared models like linear regression, decision tree, k-means regression [19].
Another similar study carried out by [20] applies regression as a predictive model. They
use MSE, MAE and RMSE as their evaluation metrics for their model’s accuracy. Another
interesting study by [21] used Multiple Regression Analysis (MRA) to estimate property
prices for mass evaluation. The structural qualities and the property’s location were
viewed as two primary micro factors of house pricing. MRA was utilised to determine the
structural characteristics and locational attributes that statistically influence house price
using a sample of 106 house sale transactions from 2011 to 2015. An alternative approach
by [22] focuses on traditional solutions based on widely known methods and procedures
and faith in the infallibility and objectivity of a human analysing the real estate market.
Since modern technologies are also boldly entering the arena. Hence, the study’s key focus
is that organisations should stop viewing automated solutions (such as AVM, CAMA, and
AAVM) as functioning in opposition to traditional approaches and instead embrace them
as supplemental tools.
Our previous work in map personalisation discusses the initial concept of personal-
isation using real estate analytics [23]. It also evaluates background research relating to
the building blocks that lead to a recommendation engine for real-time analytics. Exten-
sive research in this field has revealed gaps between real-estate analytics and map-based
personalisation, recommendation and prediction; thus, we have tried to bridge this gap in
our research and initial development work. We also found motivation for our study and
consequent development since map-based personalised real estate portals do not widely
exist in the online real estate market. Having to sift through a plethora of online data is
no longer suitable for most users, and personalisation has become a key concept in every
aspect of data search. In our scenario, real estate test users have been interacting with a real
estate portal, “Estatech Maps”, to search and post property items. Our recommendation
system is based on three techniques. This includes content, collaboration and location-
based filtering. The interaction of users is captured via the map-based interface of the
real estate application, Estatech Maps, and stored in a database. Based on this data and
analysis, a user gets recommendations as per their area of interest. Along with that, we
have incorporated a module based on traditional regression techniques and Keras API for
predicting the future price trends of property items.
The subsequent section discusses the detailed insight of the research process regarding
data collection, its pre-processing, run time environment creation, and model conception.
Finally, the section will discuss the following crucial areas of the research process in
database. Based on this data and analysis, a user gets recommendations as per their area
of interest. Along with that, we have incorporated a module based on traditional
regression techniques and Keras API for predicting the future price trends of property
items.
ISPRS Int. J. Geo-Inf. 2022, 11, 178 The subsequent section discusses the detailed insight of the research process 5 of 19
regarding data collection, its pre-processing, run time environment creation, and model
conception. Finally, the section will discuss the following crucial areas of the research
process(1)
detail. in detail. (1) Data collection
Data collection and Technology.
and Technology. (2) Property
(2) Property Recommendation.
Recommendation. (3) Price(3)
Price prediction
prediction model.model.
3.3.Methodology
Methodology
“Estatech
“EstatechMaps”
Maps”main mainfocus
focusisisto
toprovide
providepersonalised
personalisedreal
realestate
estatelistings
listingsto toits
itsusers
users
on
on a map-based interface by making accurate recommendations and providinginsight
a map-based interface by making accurate recommendations and providing insight
about
aboutprice
pricetrends
trendsof ofaauser’s
user’sarea
areaofofinterest.
interest. Recommendation
Recommendationand andprice
priceprediction
predictionwere were
the
thekey
key focus
focus areas
areas toto deliver
deliver map-based personalisation to
map-based personalisation to the
the users.
users.InInthe
thefirst
firststage,
stage,a
adetailed
detailedstudy
studyon on the
the mathematical
mathematical interpretation
interpretation ofof recommendation
recommendation algorithms
algorithms was was
carried
carried out. The second stage focused on the algorithm’s designs, and in the
out. The second stage focused on the algorithm’s designs, and in the third
third stage,
stage,
development
developmentbased basedon those algorithms
on those was carried
algorithms was out, and the
carried out,models
and were
the implemented.
models were
The validation and testing of these models were carried out in
implemented. The validation and testing of these models were carried out the final stage of the
in research.
the final
The sequence of the study is illustrated in Figure 1.
stage of the research. The sequence of the study is illustrated in Figure 1.
Figure1.1.Methodology
Figure MethodologySequence.
Sequence.
Regardingprice
Regarding priceprediction,
prediction, after
after researching
researching various
various prediction
prediction techniques,
techniques, two mod-two
models
els were were selected.
selected. One isOne is based
based on a on a classical
classical regression
regression technique,
technique, and and the other
the other relies
relies on
on neural
neural networks.
networks.
3.1.
3.1.Data
DataCollection
Collectionand andTechnology
Technology
User interaction data
User interaction data waswasextracted from
extracted the portal
from over over
the portal a yeara (May
year 2020–March
(May 2020–March2021).
Data
2021). Data were extracted in JSON format from a MongoDB database, which to
were extracted in JSON format from a MongoDB database, which was converted a
was
CSV format. It consisted of 1600 recorded user interactions with the portal.
converted to a CSV format. It consisted of 1600 recorded user interactions with the portal. The data for
house price
The data forprediction
house price was acquired was
prediction fromacquired
a Pakistani based
from real estate
a Pakistani portal
based realZameen.com
estate portal
(accessed
Zameen.com on 15forSeptember
two years2021) for two
between years between
2019–2020 2019–2020
for Islamabad City.for Islamabad City.
Both
Boththe
the datasets
datasets from
from Estatech maps and
Estatech maps and Zameen.com
Zameen.comwere (accessed on 15into
converted September
test and
2021) were converted into test and training datasets. Zameen.com (accessed
training datasets. Zameen.com data, used for a house price prediction model, was on 15 Septem-
further
ber 2021) data,
converted into used for a house
a validation priceThe
dataset. prediction model, was
data consisted further converted
of multiple files: Userinto a
login
validation dataset. The data consisted of multiple files: User login information
information (User demographics), Interaction Data (Most viewed properties list) and Item (User
demographics),
Data (Properties). Interaction Data (Most viewed properties list) and Item Data (Properties).
TuriCreate was used to build the recommendation engine for content-based and
TuriCreate was used to build the recommendation engine for content-based and
collaborative filtering, whereas a K-means clustering technique was employed for the
collaborative filtering, whereas a K-means clustering technique was employed for the
location-based recommendation. TuriCreate is an open-source toolkit for building Core ML
location-based recommendation. TuriCreate is an open-source toolkit for building Core
models for tasks like image recognition, object detection, style transfers, and recommenda-
tion generation, among others.
Tensor Flow and Keras API were used as baseline technologies to build the house price
prediction model and a proper validation for model loss and model accuracy, which was
done through evaluation techniques of MSE, MAE and RMSE. TensorFlow is a machine
learning software library that is free and open-source. It can be used for various activities,
but it focuses on deep neural network training and inference. The Google Brain team created
TensorFlow for internal Google use. In 2015, it was published under the Apache License 2.0.
ISPRS Int. J. Geo-Inf. 2022, 11, 178 6 of 19
The reason for using TensorFlow is that it is an open-source artificial intelligence library
that builds models using data flow graphs. It enables programmers to create large-scale
neural networks with multiple layers. Keras is a deep learning API written in Python that
runs on top of the TensorFlow machine learning system. It was built with the objective of
allowing fast experimentation.
The cosine value or similarity in Equation (1) can range between −1 and 1. Based on
this value, the articles are organised in descending order, and the top recommendations are
made to the user.
The approach for content-based filtering is further explained in Figure 2, which shows
how a tree-based criterion for item selection works. The concept is based on how much
interactivity a user has with a specific item or category. Interest ratios are calculated
between corresponding categories based on “incrementing the value of frequency”. For
example, buyers’ interactions with rent or purchase categories define the interest ratio
between the two categories. The flow of the function which performs frequency calculation
is elaborated in Figure 3, which details another content-based filtering process, namely
TF-IDF. For example, suppose a user searches for “the rise of analytics” on Google. In that
case, it is inevitable that the word “the” will occur more frequently than “analytics”, but
the relative importance of analytics is higher than the search query point of view. In such
cases, TF-IDF weighting negates the effect of high-frequency words in determining the
significance of an item (document).
Figure2.2.Score
Figure ScoreTree
Treeprocess
processfor
forProperty
PropertySelection.
Selection.
Figure 2. Score Tree process for Property Selection.
PDu,i is the prediction term for user ‘u’ against an item “i”.
Σv (iv,i ∗ su,v )
PDu,i = (5)
Σv su,v
PDu,i is the prediction term for user ‘u’ against an item “i”, iv,i is the interaction by
the user say “v”. with an item “i”, su,v is the likeness among the two users, i.e., user “u”.
and user “v”.
As per Table 1, the interactions between users and properties is recorded, and sugges-
tions to a new user “u1” are generated. At the same time, the symbol “x” represents any
interaction between a user and a property item. It is evident that there is more similarity
between user 1 and user 2 than user 3. Based on this, user 1 and user 2 will be grouped
together for future recommendations. Algorithm 1 depicts a generalised algorithm that
has been designed for grouping user 1 and user 2 together so that the same properties get
recommended to them.
P1 P2 P3 P4
U1 x x - -
U2 x x x -
U3 - - - x
In Tables 4 and 5, it is observed that the scores obtained against each user are not easy
to interpret. It is not clear against which property ID the user is getting the suggested items
of interest. To make our results clearer, we have utilised the Turicreate library. This made the
results obtained easier to understand. Table 4 represents content-based recommendation
model results. The model was assessed for five users of the portal, and recommendations
were generated for them.
In Table 4 the set of 5 users are recommended the same 5 property items due to the
popularity of those items as being the most interacted with.
Table 5 shows the properties recommended to users based on grouping with other
users having similar interests. Property IDs having a higher score are ranked higher. Each
user is recommended a different set of properties, which clearly shows that personalisation
exists for each user.
ISPRS Int. J. Geo-Inf. 2022, 11, 178 11 of 19
Table 5. Cont.
ISPRS Int. J. Geo-Inf. 2022, 11, x FOR PEER REVIEW 12 of 18
User ID Property ID SCORE Rank
3 381 0.536 4
3 392 0.522 5
3 392 0.522 5
4 055 1.113 1
4 055 1.113 1
44 248
248 1.039 1.039 2 2
44 121
121 0.930 0.930 3 3
44 342
342 0.904 0.904 4 4
44 151
151 0.899 0.899 5 5
55 175
175 1.033 1.033 1 1
5 287 0.943 2
5 287 0.943 2
5 067 0.837 3
5 067 0.837 3
5 099 0.834 4
099 0.834 4
55 057 0.786 5
5 057 0.786 5
4.2. Location-Based Recommendation Model Building through K-Means Clustering
4.2. Location-Based
K-means clusteringRecommendation
ascertainsModel the “k”Building
number through K-Meanswithin
of centroids Clustering
a dataset. After
that, K-means
it assigns clustering
every dataascertains
point withthe the“k”closest cluster.
number These datawithin
of centroids pointsaeventually end
dataset. After
that,
up it assigns
being in theevery
clusterdata
withpoint
the with
nearestthemean.
closestIncluster. These data
our approach, thepoints
purposeeventually end
of applying
up being in
K-Means the cluster
clustering is towith
groupthe similar
nearest users
mean.based
In ouron approach, the purpose
their respective of applying
locations. As the
K-Means
users clusteringthe
get clustered, is to
topgroup
most similar
searched users based onitem
or interacted theiramong
respective
that locations.
user groupAs the
starts
users
to get get clustered, the
recommended to top
eachmost
user.searched
Figure 4or interacted
shows item among
auto-generated that user
locations forgroup
users starts
from
to get recommended
different to each user.
places in Islamabad, alongFigure
with4 the
shows auto-generated
corresponding locations
cluster IDs. Asforone
users from
hovers
different
above anyplaces in Islamabad,
cluster, it shows along with the
the most corresponding
searched item in cluster IDs. Aswhich
that cluster, one hovers
gets
above any cluster,
recommended it shows
to users of the
thatmost searched
cluster. item in that
For example, cluster,
in one which
of the gets recommended
clusters, the count of
to users ofproperty
searched that cluster.
ID 689For isexample, in oneSince
the highest. of theitclusters,
is equalthe to count of searched
the highest countproperty
for that
ID 689 is the highest. Since it is equal to the highest count for that cluster,
cluster, all users falling within that cluster will get property ID 689 as the recommended all users falling
within that
property forcluster
view. will get property ID 689 as the recommended property for view.
MAP@k (Mean Average Precision at k) is an evaluation metric that considers the order
of the recommended items as well. In our case, we have recommended 5 items to the set
of 5 users, so in our case k = 5. We set an experimental environment for our group of five
portal users; they were provided with a list of recommended items in the order generated
by our recommender engine. The users interacted with certain items and provided verbal
feedback on whether the generated recommendations were of interest. For example, the
statistical accuracy of the recommendation engine is as follows for a given User 1.
• [1, 1, 1, 0, 0] where 1 stands for a correct recommendation such that the user interacted
with it and 0 stands for a recommendation with which the user did not interact.
• [1/1, 2/2, 3/3, 2/3, 1/3] is the precision at k.
• (1/5) [1/1 + 2/2 + 3/3 + 2/3 + 1/3] = 0.7999 is the average precision at k.
The precision is higher for the first three items which were interacted with, but for the
last two items with which the user did not interact, the precision falls. Therefore, for user
1, the average precision is almost 80%. Whereas for all sets of users, this will be the mean
average precision and can be calculated by taking the average precisions’ mean.
Figure6.6.Top
Figure TopRecommended Properties
Recommended Based
Properties on on
Based Location Recommendation
Location in different
Recommendation localities
in different of
localities
of Islamabad
Islamabad City.
City.
Figure 6. Top Recommended Properties Based on Location Recommendation in different localities
of Islamabad City.
ISPRS Int.
ISPRS Int. J.
J. Geo-Inf.
Geo-Inf. 2022, 11, x178
2022, 11, FOR PEER REVIEW 15 of
15 of 18
19
Figure
Figure 7.
7. Actual
Actual vs
vs.Predicted
PredictedPrice
PriceVisualisation
Visualisationof
ofMLR.
MLR.
Figures 88 and
Figures and 99 represent
represent different
different ways
ways prices
prices have
have been
been changing
changing in in the
the city
city of
of
Islamabad. In
Islamabad. In Figure
Figure 8,
8, mean
mean sector
sector (neighbourhood
(neighbourhood area) area) prices
prices are
are highlighted.
highlighted. ThisThis
visualisation provides
visualisation providesaaclear
clearoverview
overviewofofwhat
whatareas could
areas show
could drastic
show price
drastic changes
price and
changes
what areas will remain stagnant. This data has been analysed for the past 2 years,
and what areas will remain stagnant. This data has been analysed for the past 2 years, and and the
predictions show how prices will change or remain the same in the coming years.
the predictions show how prices will change or remain the same in the coming years. The The blue
area area
blue in theinfigure depicts
the figure howhow
depicts prices have
prices increased
have increasedsignificantly in the
significantly coming
in the coming years in
years
those
in neighbourhoods.
those neighbourhoods. In contrast, redred
In contrast, areas indicate
areas stagnancy
indicate in prices,
stagnancy providing
in prices, a user
providing a
with a clearer picture to assist in decision making.
user with a clearer picture to assist in decision making.
700000
600000
500000
400000
300000
200000
100000
0
Price
1 2 3 4 5 6 7 8 9 10
Price
1 2 3 4 5 6 7 8 9 10
Figure 9.
Figure 9. A
A visualisation of sector
visualisation of sector wise
wise Predicted
Predicted Price
Price Visualisation
Visualisation in
in Islamabad
Islamabad City.
City.
5. Conclusion
Conclusions
Three different
differentrecommendation
recommendation algorithms for for
algorithms the real
the estate portalportal
real estate “Estatech Maps”
“Estatech
were
Maps” developed along with
were developed alongtwowithdifferent models
two different for house
models priceprice
for house prediction. First,
prediction. we
First,
set
we our
set goals to analyse
our goals and implement
to analyse and implementcontent-based filtering
content-based for suggesting
filtering real estate
for suggesting real
items.
estate The collaborative
items. filtering approach
The collaborative filteringwasapproach
used for reducing
was used the computational
for reducing cost the
by suggesting similar items to a similar group of users. Then, we applied the location-based
approach for predicting the areas of interest to the user based on the user’s geographical
location. All this was achieved with a minimum precision of 79%. Prediction models were
created, and results were visualised by price increase, decrease or stagnancy in multiple
sectors of Islamabad city to better assist people planning future land asset purchases. Our
model was able to precisely predict the changes in house prices trends with a minimum
accuracy of 80%, which was through our neural network-based prediction model. This
work can be effectively utilised in any real-estate sale and purchase domain and will
improve the overall user experience of real estate portals. This proves the viability of
our map-based system in providing data and recommendations to users based on the
popularity of an item, user similarity and geographical location.
While nowadays recommendation and predictive analysis are becoming a common
trend in even the smallest of businesses, in Pakistan, the real estate industry is lacking
when it comes to implementing these techniques not only in terms of a map-based interface
but also in terms of presenting these items to the user in an effective way. Therefore, our
approach for displaying an item of interest to the user on a map-based interface would be
one of the pioneers in real estate portals in Pakistan.
We have used sequential NN models for our recommendation and prediction in this
research. One area of improvement and basis for future work could be exploring and
implementing these as parallel models to improve response time and efficiency. Another
approach could be combining multiple techniques to create a hybrid model. The same
approach was used in the study where the Cobb-Douglas and linear regression models
were combined to form a mathematical model [24]. GIS was an additional tool to organise
the regional data of the area under study. In turn, this can cover a broader spectrum of
users’ behaviours and avoid high computational costs at the server end.
Author Contributions: Conceptualization, Maryam Mubarak, Ali Tahir and Fizza Waqar; method-
ology, Maryam Mubarak and Ali Tahir; software, Maryam Mubarak, Ali Tahir and Fizza Waqar;
validation, Ali Tahir, Ibraheem Haneef, Gavin McArdle and Michela Bertolotto; formal analysis,
Maryam Mubarak, Ali Tahir and Muhammad Tariq Saeed; investigation, Gavin McArdle, Maryam
ISPRS Int. J. Geo-Inf. 2022, 11, 178 18 of 19
Mubarak, Ibraheem Haneef, Ali Tahir and Muhammad Tariq Saeed; resources, Ali Tahir and Muham-
mad Tariq Saeed; data curation, Maryam Mubarak, Ali Tahir and Fizza Waqar; writing—original
draft preparation, Maryam Mubarak, Ali Tahir and Fizza Waqar; writing—review and editing, Gavin
McArdle, Michela Bertolotto and Muhammad Tariq Saeed; visualization, Maryam Mubarak, Fizza
Waqar and Ali Tahir; supervision, Ali Tahir and Muhammad Tariq Saeed; project administration, Ali
Tahir, Ibraheem Haneef and Muhammad Tariq Saeed; funding acquisition, Ali Tahir and Ibraheem
Haneef. All authors have read and agreed to the published version of the manuscript.
Funding: This research received the funding from Higher Education Commission (HEC), Pakistan,
under grant no. TDF03-249.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Acknowledgments: This research was supported by the Higher Education Commission (HEC),
Pakistan, under grant no. TDF03-249. The authors gratefully acknowledge their support.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Geetha, G.; Safa, M.; Fancy, C.; Saranya, D. A hybrid approach using collaborative filtering and content based filtering for
recommender system. J. Phys. Conf. Ser. 2018, 1000, 012101. [CrossRef]
2. Peter, D. Electronic junk. Commun. ACM 1982, 25, 163.
3. Knoll, J.; Groß, R.; Schwanke, A.; Rinn, B.; Schreyer, M. Applying Recommender Approaches to the Real Estate E-Commerce
Market. In International Conference on Innovations for Community Services; Springer: Cham, Switzerland, 2018; pp. 111–126.
4. Bai, Y.; Jia, S.; Wang, S.; Tan, B. Customer Loyalty Improves the Effectiveness of Recommender Systems Based on Complex
Network. Information 2020, 11, 171. [CrossRef]
5. Fernández-García, A.J.; Iribarne, L.; Corral, A.; Criado, J.; Wang, J.Z. A recommender system for component-based applications
using machine learning techniques. Knowl. Based Syst. 2019, 164, 68–84. [CrossRef]
6. Rabiei-Dastjerdi, H.; McArdle, G.; Matthews, S.A.; Keenan, P. Gap analysis in decision support systems for real-estate in the era
of the digital earth. Int. J. Digit. Earth 2021, 14, 121–138. [CrossRef]
7. Mac Aoidh, E.; Bertolotto, M.; Wilson, D.C. Understanding geospatial interests by visualizing map interaction behavior. Inf.
Vis. 2008. [CrossRef]
8. Yu, Y.; Wang, C.; Zhang, L.; Gao, R.; Wang, H. Geographical Proximity Boosted Recommendation Algorithms for Real Estate. In
International Conference on Web Information Systems Engineering; Springer: Cham, Switzerland, 2018; pp. 61–66.
9. Breese, J.S.; Heckerman, D.; Kadie, C. Empirical Analysis of Predictive Algorithms for Collaborative Filtering. In Proceedings of
the Fourteenth Conference on Uncertainty in Artificial Intelligence, Madison, WI, USA, 24–26 July 1998; pp. 43–52.
10. Shani, G.; Heckerman, D.; Brafman, R.I. An MDP-based recommender system. J. Mach. Learn. Res. 2005, 6, 1265–1295.
11. Ayyaz, S.; Qamar, U.; Nawaz, R. HCF-CRS: A Hybrid Content based Fuzzy Conformal Recommender System for providing
recommendations with confidence. PLoS ONE 2018, 13, e0204849. [CrossRef] [PubMed]
12. Sarwar, B.; Karypis, G.; Konstan, J.; Riedl, J. Item-based Collaborative Filtering Recommendation Algorithms. In Proceedings of
the 10th International Conference on World Wide Web, Hong Kong, China, 1–5 May 2001; pp. 285–295.
13. Ballatore, A.; McArdle, G.; Kelly, C.; Bertolotto, M. Recomap: An interactive and adaptive map-based recommender. In
Proceedings of the 2010 ACM Symposium on Applied Computing, Sierre, Switzerland, 22–26 March 2010; pp. 887–891.
14. Wilson, D.C.; Lipford, H.R.; Carroll, E.; Karr, P.; Najjar, N. Charting New Ground: Modeling User Behavior in Interactive
GeoVisualization. In Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information
Systems, Irvine, CA, USA, 5–7 November 2008; pp. 1–4.
15. Tezuka, T.; Tanaka, K. Presentation of Dynamic Maps by Estimating User Intentions from Operation History. In International
Conference on Multimedia Modeling; Springer: Berlin/Heidelberg, Germany, 2007; pp. 156–165.
16. Rehman, F.; Masood, H.; Ul-Hasan, A.; Nawaz, R.; Shafait, F. An Intelligent Context Aware Recommender System for Real Estate.
In Mediterranean Conference on Pattern Recognition and Artificial Intelligence; Springer: Cham, Switzerland, 2019; pp. 177–191.
17. Kong, J.S.; Teague, K.; Kessler, J. The Love-Hate Square Counting Method for Recommender Systems. Proc. KDD Cup 2011, 18,
249–261.
18. Greenaway-McGrevy, R.; Sorensen, K. A spatial model averaging approach to measuring house prices. J. Spat. Econom. 2021, 2,
1–32. [CrossRef]
19. Rawool, A.G.; Rogye, D.V.; Rane, S.G. House Price Prediction Using Machine Learning. Int. J. Res. Appl. Sci. Eng. Technol. 2021, 9,
686–692. [CrossRef]
20. Chaturvedi, S.; Ahlawat, L.; Patel, T.; Talha, M. Real Estate Price Prediction. EasyChair 2021, 4926. Available online: https:
//easychair.org/publications/preprint/HbD8 (accessed on 2 January 2022).
ISPRS Int. J. Geo-Inf. 2022, 11, 178 19 of 19
21. Abdullahi, A.; Usman, H.; Ibrahim, I. Determining house price for mass appraisal using multiple regression analysis modeling in
Kaduna North, Nigeria. ATBU J. Environ. Technol. 2018, 11, 26–40.
22. Renigier-Biłozor, M.; Źróbek, S.; Walacik, M.; Borst, R.; Grover, R.; d’Amato, M. International acceptance of automated modern
tools use must-have for sustainable real estate market development. Land Use Policy 2022, 113, 105876. [CrossRef]
23. Mubarak, M.; Khalid, K.; Waqar, F.; Tahir, A.; Haneef, I.; McArdle, G.; Bertolotto, M. Towards Real Estate Analytics using Map
Personalisation. In Proceedings of the 6th International Conference on Geographical Information Systems Theory, Applications
and Management, GISTAM 2020, Prague, Czech Republic, 7–9 May 2020; pp. 184–190.
24. Sisman, S.; Akar, A.U.; Yalpir, S. The novelty hybrid model development proposal for mass appraisal of real estates in sustainable
land management. Surv. Rev. 2021, 1–20. [CrossRef]