0% found this document useful (0 votes)
140 views1 page

Python Project Naming Trend

This Python certification course project involves analyzing naming trends using a dataset of popular baby names from the US Social Security Administration. Students will download a zipped file of the dataset, extract the relevant csv files using Pandas, and visualize the number of male and female babies born each year. They will also analyze and sort baby names by birth count to identify the most popular names. The project tests skills in data manipulation, visualization, and solving real-world problems using Pandas.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
140 views1 page

Python Project Naming Trend

This Python certification course project involves analyzing naming trends using a dataset of popular baby names from the US Social Security Administration. Students will download a zipped file of the dataset, extract the relevant csv files using Pandas, and visualize the number of male and female babies born each year. They will also analyze and sort baby names by birth count to identify the most popular names. The project tests skills in data manipulation, visualization, and solving real-world problems using Pandas.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Python Certification Course

Python Project – 1: Analyzing Naming Trends using Python


Project: Analyzing the naming trends using Python

Industry: General

Problem Statement:

The dataset is in Zipped format, we have to extract the dataset in the program, visualize the number of
male and female babies born in a particular year, and find out popular baby names.

Description: This project not only focusses on implementing data manipulation and data visualization
using Pandas library, but also tests your ability to deal with real word problem statements.

Dataset: Popular baby names data provided by Social Security Administration (SSA) of United States

How to download the dataset:

 Go to https://www.ssa.gov/oact/babynames/limits.html
 Click on ‘National data’
 Get the zipped file

Here’s what the zipped folder looks like,

Hints:

 First, use Pandas, zipfile, and BytesIO library to extract the data. Find out a way to extract only
files that consists useful data.
 Hint: pd.read_csv(BytesIO(z.read(file_name)), encoding='utf-8', engine='python', header=None)
 Then, visualize the number of male and female babies born in a particular year with the help of
pandas.DataFrame.plot, then Analyse baby names by sorting out all birth counts.
 Then, analyse baby names by sorting out top 100 birth counts and group them by names to find
out popular baby names

[email protected] - +91-7022374614 - US: 1-800-216-8930 (Toll Free)

You might also like