Teclov Project Description PDF
Teclov Project Description PDF
You work for Teclov, an asset management company. Teclov wants to make
investments in a few companies. The CEO of Teclov wants to understand the global
trends in investments so that she can take the investment decisions effectively.
! You may use this list: Click here for a list of countries where English is an
official language.
These conditions will give you sufficient information for your initial analysis. Before
getting to specific questions, let’s understand the problem and the data first.
Teclov wants to invest where most other investors are investing. This pattern is
We have taken real investment data from crunchbase.com, so the insights you get
may be incredibly useful. For this group project, we have divided the data into the
following files:
You have to use three main data tables for the entire analysis
The business objectives and goals of data analysis are pretty straightforward.
and a suitable investment type for making investments. The overall strategy is
to invest where others are investing, implying that the 'best' sectors and
2. Goals of data analysis: Your goals are divided into three sub-goals:
in the venture, seed, angel, private equity etc. so that Teclov can
○ Country analysis: Identifying the countries which have been the most
the eight main sectors. (Note that we are interested in the eight 'main
sectors' provided in the mapping file. The two files — companies and
rounds2 — have numerous sub-sector names; hence, you will need to
1. Company details
3. Sector Classification:
mapping.csv: This file maps the numerous category names in the companies table
(such 3D printing, aerospace, agriculture, etc.) to eight broad sector names. The
purpose is to simplify the analysis into eight sector buckets, rather than trying to
1. Load the companies and rounds data into two data frames and name them
2. Table 1.1:
Are there any companies in the rounds2 file which are not present in
Merge the two data frames so that all variables (columns) in the
companies frame are added to the rounds2 data frame. Name the
master_frame?
After this, you will need to work only with the master frame.
This is the first of the three goals of data analysis – investment type analysis.
The funding types such as seed, venture, angel, etc. depend on the type of the
company (startup, corporate, etc.), its stage (early stage startup, funded startup,
etc.), the amount of funding (a few million USD to a billion USD), and so on. For
example, seed, angel and venture are three common stages of startup funding.
occurs after seed or angel stage/s and involves a relatively higher amount of
investment.
! Private equity type investments are associated with much larger companies
and involve much higher investments than venture type. Startups which have
grown in scale may also receive private equity funding. This means that if a
company has reached the venture stage, it would have already passed
Teclov wants to choose one of these four investment types for each potential
Considering the constraints of Teclov, you have to decide one funding type which is
each of the four funding types (venture, angel, seed, and private equity) and
Considering that Teclov wants to invest between 5 to 15 million USD per investment
round, which investment type is the most suitable for it? Identify the investment type
and, for further analysis, filter the data so it only contains the chosen investment
type.
Problems - Part 2
countries.
Teclov wants to invest in countries with the highest amount of funding for the chosen
investment type. This is a part of its broader strategy to invest where most
1. Teclov wants to see the top nine countries which have received the highest
total funding (across ALL sectors for the chosen investment type)
2. For the chosen investment type, make a data frame named top9 with the top
nine countries (based on the total investment amount each country has
received)
Identify the top three English-speaking countries in the data frame top9.
Results Expected: All codes for data frame top9. Fill out Table 3.1.
2. Second English-speaking
country
3. Third English-speaking
country
Now you also know the three most investment-friendly countries and the most suited
funding type for Teclov. Let us now focus on finding the best sectors in these
countries.
When we say sector analysis, we refer to one of the eight main sectors listed in the
mapping file (note that ‘Other’ is one of the eight main sectors; also, there are eight
sectors if you consider the category 'Blanks' as a missing value). This is to simplify
the analysis by grouping the numerous category lists (named ‘category_list’) in the
mapping file. For example, in the mapping file, category_lists such as ‘3D’, ‘3D
Printing’, ‘3D Technology’, etc. are mapped to the main sector ‘Manufacturing’.
Also, for some companies, the category list is a list of multiple sub-sectors separated
by a pipe (vertical bar |). For example, one of the companies’ category_list is
You discuss with the CEO and come up with the business rule that the first string
before the vertical bar will be considered the primary sector. In the example above,
1. Extract the primary sector of each category list from the category_list
column
2. Use the mapping file 'mapping.csv' to map each primary sector to one of the
eight main sectors (Note that ‘Others’ is also considered one of the main
sectors)
Expected Results: Code for a merged data frame with each primary sector mapped
to its main sector (the primary sector should be present in a separate column).
Now you have a data frame with each company’s main sector (main_sector) mapped
to it. When we say sector analysis, we refer to one of the eight main sectors.
Also, you know the top three English speaking countries and the most suitable
funding type for Teclov. Let’s call the three countries 'Country 1', 'Country 2' and
Now, the aim is to find out the most heavily invested main sectors in each of the
three countries (for funding type FT and investments range of 5-15 M USD).
1. Create three separate data frames D1, D2 and D3 for each of the three
countries containing the observations of funding type FT falling within the 5-15
! All the columns of the master_frame along with the primary sector and the
main sector
! The total number (or count) of investments for each main sector in a separate
column
Using the three data frames, you can calculate the total number and amount of
2. Table 5.1: Based on the analysis of the sectors, which main sectors and
countries would you recommend Teclov to invest in? Present your conclusions
in the presentation. The conclusions are subjective (i.e. there may be no ‘one
right answer’), but it should be based on the basic strategy — invest in sectors
Note: In the following table, all the observations refer to investments of the
(count)
2. Total amount of investment
(USD)
3. Top sector (based on count of
investments)
4. Second-best sector (based on
count of investments)
count of investments)
4)
8. Number of investments in the
investment?
Problem 6: Plots
As a final step, you have to present your findings to the CEO of Teclov. Specifically,
private equity, and the average amount of investment in each funding type.
This chart should make it clear that a certain funding type (FT) is best suited
for Teclov.
2. A plot showing the top 9 countries against the total amount of investments of
funding type FT. This should make the top 3 countries (Country 1, Country 2,
3. A plot showing the number of investments in the top 3 sectors of the top 3
This plot should clearly display the top 3 sectors each in Country 1, Country 2, and
Country 3.