Data Science - Capstone Project : Pizza Eat-Out place data analysis of Bengaluru city
A. Introduction
A.1. Description & Discussion of the Background
Bangalore, officially known as Bengaluru is the capital of the Indian state of Karnataka. It has a population of about 10 million and a metropolitan population of about 8.52 million, making it the third most populous city and fifth most populous urban agglomeration in India.[12] Located in southern India on the Deccan Plateau, at a height of over 900 m (3,000 ft) above sea level, Bangalore is known for its pleasant climate throughout the year. Its elevation is the highest among the major cities of India.Bangalore is widely regarded as the "Silicon Valley of India" (or "IT capital of India") because of its role as the nation's leading information technology (IT) exporter.Bengaluru is a city with a high population and population density. Being such a crowded city leads the fast-food chain business and social sharing places in the city where the population is dense. When we think of it by the investor, we expect from them to prefer the areas or suburban places where there is a lower competition and the type of business they want to install is less intense. If we think of the city residents, they may want to choose the regions where options are more and prices are economical. However, it is difficult to obtain information that will guide investors in this direction, nowadays.
Considering below hypothetical scenario, we can take help of data-science to nail down this problem to some extent.
Some of my entrepreneur friends wants to open new pizza eat-out place in Bangalore city. One day when we met for casual meet, they explained me about their plans. There basic requirements includes,
1. To open pizza eat-out in a place, where there are less numbers of pizza places, so that competition will be less and profit will be more.
2. The place should be economical. In sense investment price should be less and profit should be more.
In this project we will try to find an optimal location to set-up pizza eat-out. Specifically, this report will be targeted to stakeholders interested in opening Pizza Eat-out in Bangalore, Karnataka.
Since there are lots of pizza eat-out in Bangalore, we will try to detect locations that are not already crowded with Pizza places.
We will use our data science powers to generate a few most promising neighbourhoods based on this criteria. Advantages of each area will then be clearly expressed so that best possible final location can be chosen by stakeholders.
When we consider all these requirement points, we can create a map and information chart where the pizza place index is placed on Bengaluru and each area is clustered according to the number of counts.
A.2. Data Description
Based on definition of our problem, factors that will influence our decision are:
- Number of existing pizza places in the neighbourhood Following data sources will be needed to extract/generate the required information: I have prepared .xls file which has main details including area pin-code, latitude and longitude. This data set I have committed in kaggle.
- Number of pizza places and location in every neighbourhood will be obtained using Foursquare API
B. Methodology
As a database, I used CSV file which I have prepared. My master data which has the main components Area, pin-code, Latitude and Longitude informations of the city.I used python folium library to visualise geographic details of Bengaluru city and its suburban areas and I created a map of Bengaluru with areas superimposed on top. I used latitude and longitude values to get the visual as below:
I Used the Foursquare API to explore neighbourhood areas and segment them. Here as an example I used 'Basavanagudi' area and tried to list down restaurants, eat-out places using Foursquare API.
Next, Prepare neighbourhood list that contains pizza places.
We can see that, there 76 places which are related to pizza.
Let's plot the bar graph to get to know which area has more number of pizza places and which has less.(comparatively)
From above graph, it clear that 'Koramangala 6th Block' has highest number of pizza places.
Now, Let's calculate average count for each area.
we will consider all the areas with average count lesser than 2.0 to visualise on map.
Let's get list of areas in which pizza places counts are less than 2.
Now above we have the pizza places which has count lesser 2.0 and their Geo-Co-ordinates
Lets Visualise it!

C. Results
Let’s merge those new variables with related informations in our main master table

As we can see below are the list of areas which has less pizza places and it would be good for business, if we consider these areas to open new pizza eat-out.
1. Adugodi
2. Banashankari 3rd Stage
3. Vyalikaval
4. VivekNagar
5. Vidyaranyapura
6. Ulsoor
7. Peenya
8. Mathikere
9. Kumbalgodu
10. Kadugodi
Limitations :
The Count is highly dependent on Foursquare API details.
There might be some other pizza eat-outs which are not listed in foursquare database.
D. Discussion
As I mentioned before, Bengaluru is a big city with a high population density with many of fast-food options.
For more detailed and accurate guidance, the data set can be expanded and the details of the areas or street can also be drilled.
I also performed data analysis through this information by adding the coordinates of areas. In future studies, these data can also be accessed dynamically from specific platforms or packages.
F. Conclusion
Being pizza lover, I would like to explore more places which serves varieties of flavours. I believe this small hypothetical project would help some of us.(Either of the cases, If you are planning to open new pizza restaurant or just want to try out some new pizza places).
This project can be modified for more options.
For eg : I have plan to extend this project to get details about Udupi south Indian restaurant chains.
G. References:
[1] Bengaluru — Wikipedia - Bengaluru-wikipedia
[2] Kaggle - Kaggle Dataset
[3] Forsquare API - Foursquare API
[4] Google Maps
[5] Distancefrom.in - Latitude-Longitude
Update : I have completed IBM Data Science Course successfully and you can find my badge in below link.
https://www.youracclaim.com/badges/284be8a2-0e01-4763-8857-ef74b38de4db/public_url








Comments
Post a Comment