Brussels, the Best Neighborhoods for Expats
By Elizabeth Shen
Welcome on board! We will take you on a nice tour of the Brussels region and its neighborhoods, with delicious Belgian chocolates and waffles nearby, or even some world-famous Belgian beer, if you would like to.
And let’s invite the famous little boy Julianske to be our tour guide. Here we go!
Part 1: Introduction/Business Problem
1.1 Background
Let’s briefly introduce Brussels: The beloved “Capital of Europe’, and the new homes for thousands of expats from all over the world.
Brussels is officially the Brussels-Capital Region, which comprising 19 municipalities, including the City of Brussels, which is the capital of Belgium. The Brussels-Capital Region is located in the central portion of the country and is a part of both the French Community of Belgium and the Flemish Community. Brussels has about 1.2 million inhabitants. [1]
Brussels has become the administrative center of many international organizations.
The European Union (EU), the North Atlantic Treaty Organisation (NATO) , the World Customs Organization and EUROCONTROL, as well as many international corporations.
The presence of the EU and the other international bodies has led to there being more ambassadors and journalists in Brussels than in Washington D.C. The “international community” in Brussels numbers at least 70,000 people. [1]
1.2 Business Problem
Brussels is the capital of the Europe, a very busy city and have attracted thousands of internationals people, and especially, expats to stay.
Those expats are from different countries and have strong requests to find a comfortable, safe and affordable neighborhood to stay in.
1.3 Interests/Stakeholders
International People, especially expats.
Stakeholders also include international organizations and companies, which have strong demands for housing in the Brussels region for their expats.
Part 2: Data
2.1 Data Sources:
Background / Municipalities/Postal code Data:
To achieve a general understanding of Brussels: the municipalities, the population size, and the postal code of Brussels etc.
Data Sources: Wiki [1] [2] and postal-codes.cybo.com [3]
Json file: Secondary and Fourth-Level Administrative Divisions, Belgium, 2015
To get the administrative divisions of Belgium.
Level 2 divisions include the capital regions and the provinces. Level 4 divisions include Brussels’ municipalities.
Data Sources: Second-level Administrative Divisions, Belgium, 2015, from Spatial Data Repository of NYU [4] Fourth-level Administrative Divisions, Belgium, 2015 from Spatial Data Repository of NYU [5]
FourSquare API:
To explore neighborhoods of Brussels. The data will be used to compare, segment and cluster neighborhoods.
Data Sources: FourSquare API [6]
Brussels House Price, School Distributions, Crime Rates
House prices, school distributions and crime rats of the Belgium region. These data will be used to address key factors influencing expats’ decision-making when choosing a neighborhood to live.
Data Sources: The Cost of Housing in Brussels, from KBCBrussels [7] Expats: Where should you live in Brussels? from KBCBrussels [8] Brussels Schools, from Enseignement.be[9] Brussels Crime Rats, from IBSA[10]
2.2 Data Selection
We dropped the Dutch names and keep the French name of the 19 municipalities of the Brussels region, to avoid the possible name confusion. This selection also based that the Brussels region is a largely French speaking region. The only exception is the Crime Rate data, for which we used the Flemish name as the json file used the Flemish name
We choose only to display the first rows of data due to the length of this report. For fully detailed data, please refer to the github page.
Part 3. Methodology and Analysis
3.1 Methodology
In this project, we will analyze the different neighborhoods of Brussels, and try to find which neighborhoods are the best matches for the expats in Brussels.
We need identify the key concerns from expats point of view and to establish the most important criteria they apply when choosing a neighborhood to live in.
In the first step, we collected the Location data of the 19 municipalities in the Brussels Region.
We also collected location and category data [the most common avenues of different neighborhoods] from FourSquare.
The Second step is the Exploratory Analysis, including to cluster and segment the neighborhoods.
We used different Statistics, One-Hot Coding, K-Means machine leaning, and different Visualization Tools, like Folium Maps with clusters and Choropleth Map, to help us to cluster, segment, and visualize the neighborhoods.
The final step is to analyze and categorize the different neighborhoods in the Brussels region. We joined different data, and segmented the neighborhoods into different categories based on the key criteria related to choosing a neighborhood to live in.
Application examples:
Json File: Since all the relevant data is in the [feature] key of [ Fourth level Administrative Divisions of the Brussel], we cleaned and reduced the data and to define a new variable that includes this data.
Folium Map: We used Folium map intensively to show the different neighborhoods segments, and we also used the Choropleth map to produce the Brussels’ neighborhood crime rate map.
Four Square API: To get [the most common venues] in different neighborhoods in the Brussels Region.
Machine Leaning: K-Means. We run the best K analysis with [elbow method] and it shows the K=3 is the optimum k of the K-Means
Others: We used [geopy library] to get the latitude and longitude values of the 19 municipalities of Brussels.
3.2 Data Analysis
3.2.1 A national view and the Capital Region
Since Brussels is the capital of Belgium, before we start our Brussels neighborhood tour, let’ get an overall impression about Belgium as a country.
We get [second level of administrative data] from NYU Spatial Data Repository.[4]
After we reduced the json file to [feature] level, and define to a new variable, We can see Belgium has 3 regions: Brussles, Vlaanderen(Flemish speaking part), and Wallonie(French speaking part) (Figure 1)
After the national impression, let’s focus on Brussels now.
We can see there are 19 municipalities/neighborhoods in the Brussels region. (Figure 2)
Each neighborhood of Brussels Region has its Flemish and French names. Since Brussels is a largely French speaking region, we will only keep the French name for this project. Here is a reference to the neighborhoods names, both in French and Flemish. (Figure 3)
Let’s check the largest groups of foreign residents in Brussels. ( Figure 4 )
People of foreign origin make up nearly 70% of the population of Brussels, and About 32% of city residents are of non-Belgian European origin and 36% are of another background.[1]
Next, Let’s get the latitude and longitude of the 19 municipalities of Brussels from geocoder ( Figure 5)
We can visualize all neighborhoods with clusters (neighborhood name) over the Map of Brussels. (Figure 6)
3.2.2 FourSquare:
With FourSquare API, we can explore each neighborhood to get the top 100 venues with a radius of 500 meters.
We know that all the information we need is in the items [key]. Here is a head of the list Venues name, category, latitude and longitude information from FourSquare API. (Figure 7)
We can see there are total 521 venues returned and there are 145 unique categories. So Brussels is indeed a fun city!
Next, let’s apply the One Hot Encoding technique to do further analyze. We will group rows by neighborhood and by taking the mean of the frequency of occurrence of each category. ( Figure 8)
We created the new dataframe to display the top 10 venues for each neighborhood.(Figure.10)
3.2.3 Cluster Neighborhoods
K-Means Machine Learning
K-Means algorithm is one of the most common cluster method of unsupervised learning.
First, let’s run the best K analysis with elbow method and it shows the K=3 is the optimum k of the K-Means. (Figure. 11)
We are able to create a new dataframe that includes the cluster as well as the top 10 venues for each neighborhood. (Figure.12)
Now we can finally visualize each neighborhood with clusters over the map: (Figure.13)
Since there are totally 3 cluster groups [0, 1, 2], We examine each cluster group one by one, to prepare for the future Results Analysis. (Figure. 14)
Part 4. Results
After the tour of Brussels region, we have made all of the data collocation, clusters, folium maps, visualizations, and machining leaning analysis, let’s check the results.
Target Analysis:
Based on the research we have done, [house prices, neighborhood safety, school distributions, along with green space and sports facilities] are the key criteria used by expats, when choosing a neighborhood to live in.
4.1 Green Neighborhoods
Many expats enjoy the neighborhoods with green areas, so let’s check which neighborhoods enjoy the parks most.
By selecting the related venues from [the most common venues] from FourSquare.We are able to find which neighborhoods enjoy Park most.
We decided to choose the top 6 most common avenues as our check points. (Figure.15)
Let’s visualize our results of park neighborhoods: (Figure.16) (Figure.17)
It clearly shows Cluster 2 Neighborhoods enjoy the park most!
4.2 Sports Neighborhoods
Since many expats spend their leisure time in sports, let’s check which neighborhoods enjoy Gym, Soccer, Golf etc most.
Again, we are able to find which neighborhoods enjoy Sports Fields most, from the data of the [most common venues] from FourSquare. We decided to choose the top 6 most common avenues as our check points. (Figure. 18)
Let’s visualize our results of sports neighborhoods: (Figure.19)
It shows Cluster 1 Neighborhoods enjoy sports most!
4.3 Gourmet Neighborhoods
Everyone loves a good restaurant, bar, or a nice coffee shop, so do expats!
Let’s check which neighborhoods enjoy bars, restaurants, and coffee shops most.
We are able to find which neighborhoods enjoy Gourmet Fields most, from the data of [the most common venues] from FourSquare. We decided to choose the top 6 most common avenues as our check points. (Figure. 20)
Let’s visualize our results of gourmet neighborhoods of gourmet neighborhoods: (Figure.21)(Figure.22)
We can clearly see that “Cluster 1” groups also enjoy the Restaurant and Bar most.
The Gourmets winner is Cluster 1 again!
4.4 Culture Neighborhoods
A nice neighborhood with good museums, book stores, and concert halls is A plus!
Let’s check which neighborhoods enjoy those culture avenues most.
We are able to find which neighborhoods enjoy Culture Spots most, from the data of [the most common venues] from FourSquare. We decided to choose the Top 6 most common avenues as our check points. (Figure. 23)
Let’s visualize our results of culture neighborhoods: (Figure.24)
Unbelievable, Cluster 1 is the winner of the Culture Neighborhood again!
4.5 House Prices
House price is one of the most important criteria when people choose a neighborhood to live in, and certainly is also one of most important criteria for expats to consider also.
We got the house price 2018 and 2019 in the Brussels region from The Cost of Housing in Brussels, from KBC Brussels.[7]
We cleaned the data and sorted the house price and identified the top 10 expensive neighborhoods in Brussels Regions. We can see these top expensive neighborhoods are also corresponds with those preferred by expats, according to KBC Brussels.[8]
So, let’s focus on the top 10 most expensive neighborhoods. (Figure.25)
To get a deeper insight, let’s visualize our results.
This bar chart shows the house prices of different neighborhoods in 2018 and 2019. (Figure.26)
A box plot is a way of statistically representing the distribution of the data through five main dimensions: (Figure. 27)
We can immediately make a few key observations from the house price box plot above:
2018:
The minimum number of house price in 2019 is around 280 (min), maximum number is around 550, (max), and median number is around 430 (median).
25% of the years for period 2018 had house prices of ~360 or fewer (First quartile).
75% of the years for period 2018 had house prices of ~490 or fewer (Third quartile).
2019:
The minimum number of house price in 2019 is around 310 (min), maximum number is around 640, (max), and median number of house prices is around 460 (median).
25% of the years for period 2018 had house prices of ~410 or fewer (First quartile).
75% of the years for period 2018 had house prices of ~530 or fewer (Third quartile).
We can also easily to see the house prices in 2019 increased compared to 2018: 2018 min is 280, 2019 min is 310; 2018 max is 550, 2019 max jumped to 640; 2018 median is 430, 2019 median is 460.
4.6 School Distribution
For the expats with young kids, schools are important criteria for them to choose a neighborhood to live in.
We got the Brussels Region primary school data from http://www.enseignement.be/ [9]
First, let’s check the numbers of schools in each neighborhood. [Figure.28]
We can further visualize it with a histogram chart and a bar chat. (Figure. 29) (Figure. 30)
The y-axis of this histogram is the frequency or the number of neighborhoods in each bin.
We can see that most neighborhoods have 5–12 schools. [Figure. 29]
Anderlecht, Schaerbeek, Bruxelles, Uccle and Molenbeek-Saint-Jean have more than 20 schools. [Figure. 30]
4.7 Neighborhoods Safety
The safety of neighborhoods is an ultra-important criterion when choosing to live in. And especially in a big international city as Brussels.
We downloaded the Brussels crime data 2019 from BISA.brussels.[10]
Here is the Top 5 crime rate per neighborhoods. [Figure 31]
Visualizing the crime rates in Brussels Region: ( Figure.32)
We can easily spot Brussels city center has the highest crime rate from above bar chart.
Let’s further visualize Brussels crime rate in 2019 by the choropleth map.
This choropleth map provides an easy way to show the level of crime rate variability within the Brussels region. ( Figure. 33)
Again, we can spot Brussels city center is the neighborhood with the most intensive high crime rates.
Part 5 Discussion
After a nice tour all over the 19 Brussels Municipalities neighborhood areas. What do we know now?
A. We can see all featured neighborhoods, Green/Sporty/Gourmet/Culture neighborhoods as following: (Figure.34)
B. We know the Top Expensive neighborhoods in house prices. (Figure.35)
C. We know the Top Crime neighborhoods. (Figure 36)
D. We know those neighborhoods have more schools than other neighborhoods. (Figure. 37)
Recommendations:
So we hope that you already get some important insights into the neighborhoods of Brussels.
The Green Neighborhoods are recommended for expats who are looking for green areas Sports Neighborhoods are recommended for expats who enjoy sports activities most, and the Gourmet Neighborhoods are recommended for expats who enjoy food. Culture neighborhoods are best-suited for expats who love cultural events.
Overall, Cluster 2 neighborhoods are recommended for expats based on the analysis here-above.
Further Discussion:
Machine Leaning:
There are always different approaches applying Machine Leaning, we can apply more Machine Leaning methods to compare/combine with the results we got from K-Means Machine Leaning.
For example, since House Price is a key criterion when it comes to choosing a neighborhood, we can build a model to predict the evolution of house prices in the coming years.
Neighborhoods Segments
As Brussels is a big region, we can further segment the 19 municipalities/neighborhoods into smaller neighborhoods for a more detailed neighborhood analysis.
To even better suit the international expats interests, we can also segment schools into French Schools, Dutch Schools, and International Schools. For international schools, we can also segment into English Schools, German Schools, Japanese Schools etc.
Part 6 Conclusion
In this project, we analyzed the Location Data, the House Prices, the School Distribution and the Crime Rates data of the Brussels Region.
- We Clustered and Segmented all neighborhoods of Brussels Region into different Clusters. We analyzed the most common venues of different neighborhoods, along with the different venues categories.
- We created different Folium maps with neighborhoods’ clusters over the maps for visualization.
- We identified the Green Neighborhoods, Sports Neighborhoods, Gourmet Neighborhoods, Culture Neighborhoods that the expats may interested in.
- We also identified the Most Expensive Neighborhoods as a reference for expats’ consideration, and we identified the Top Crime Rate Neighborhoods for safety concerns.
- We also provide Brussels Region School Distributions in different neighborhoods.
Analysis Tools
- We downloaded and reduced the json flies in to key [feature], to get the data we need.
- We used Folium map intensively for visualization and we used the Fourth level Administrative Divisions of the Brussels[5] to create Choropleth map, to visualize the crime rate levels of Brussels region.
- Machine Leaning:
we used unsupervised K-Means, One Hot Encoding technique to analyze the neighborhoods data.
Our analysis provides expats a very helpful insight. Expats can benefit from our analysis, when choosing a neighborhood to live in.
The international organizations/companies can also benefit from above analysis, e.g., the HR offices can use above info to search locations for their expats to live in.
Of course, besides the expats, all people can benefit the above analysis, to decide which neighborhoods to live in.
Let’s end of this report by a Data Scientist way! An Alice WordCloud by Brussels Wiki [1]
References:
[1] https://en.wikipedia.org/wiki/Brussels
[2] https://en.wikipedia.org/wiki/List_of_municipalities_of_the_Brussels-Capital_Region
[3] https://postal-codes.cybo.com/belgium/brussels/#listcodes
[4] https://geo.nyu.edu/catalog/stanford-jp810pg7549
[5] https://geo.nyu.edu/catalog/stanford-xq068cy0042
[7] https://www.kbcbrussels.be/retail/en/home/the-cost-of-housing-in-brussels.html
[10] https://ibsa.brussels/themes/securite/interventions-du-siamu
To the Future and A Happy New Year of 2021!
Bonus:
As a big chocolate lover, I would like to share some tips for Belgian chocolates.
You might know Godiva, the world famous Belgian chocolate brand. In Belgium, Marcolini is rising up as the new legend, the top of the top! Pierre Marcolini was awarded prize for “Best Pastry Chef in the World” in 2020.
My personal favor is Delrey, which only has one chocolate shop in Antewerp; another one is a traditional brand: Neuhaus. Both are delicious and right flavors for all most everyone! Many of my friends tried my recommendations, they all love them and some even claimed these chocolates are the best chocolates they have ever tried!
A special thanks goes to the super TA, Lakshmi Holla at the IBM Data Science Certificate Program! Thanks for your sharp and professional assistance for the whole journey!