0% found this document useful (0 votes)

13 views11 pages

Self Intoduction 1 Project

Malsur, a Python backend developer with 3.5 years of experience, currently works on data exploration and insights generation at Accenture, focusing on data extraction, cleaning, and analysis using tools like SQL, NumPy, and Pandas. Previously, he upgraded the Core Banking System for Citibank, developing backend applications and managing databases. His expertise includes exploratory data analysis (EDA) using visualization tools like Matplotlib and Seaborn to derive actionable insights for stakeholders.

Uploaded by

sasipreetham2201

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views11 pages

Self Intoduction 1 Project

Uploaded by

sasipreetham2201

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

My Name is Malsur I am from suryapet, Telangana professionally I am working

with python backend developer for last 3.5 years and I was currently working
from Accenture. and my current project was

1)Current project Data Exploration and Insights Generation.

In this project, In my role, was I focus on exploring and understanding data.
This involves extracting data, cleaning it, and preparing it for analysis.

I use tools like SQL, NumPy, and Pandas to handle and organize the data,
making sure it is accurate and reliable. This means fixing any missing
information, outliers, and inconsistencies.

Once the data is clean, I perform exploratory data analysis (EDA). EDA is about
looking at the data from different angles to find patterns, trends, and insights.
To do this, I use visualization tools like Matplotlib and Seaborn. These tools
help create charts and graphs that make it easier to understand and share our
findings.

Overall, my goal is to ensure the data we work with is of high quality and to
generate valuable insights that can be communicated effectively to
stakeholders.
.
Project 2 ) Bank of America

Previously, I worked on upgrading the Core Banking System (CBS) for Citibank
to enhance performance, security, and compliance with banking regulations.
This project involved adding new features, optimizing system performance,
and ensuring seamless data migration while maintaining high security and
scalability.

Roles and Resposibility

 Developed and optimized backend applications using Django and Python
for banking operations.
 Designed and managed PostgreSQL databases, ensuring data integrity and
performance.
 Built and maintained RESTful APIs using Django REST Framework (DRF) for
secure communication.
 Implemented authentication & authorization using JWT, OAuth2, and
Django’s permission framework.
 Improved application performance using Django ORM optimizations,
caching (Redis, Memcached), and Celery for background tasks.
 Conducted unit testing, debugging, and API testing to ensure software
reliability and compliance.

My Current project is: Data Exploration and Insights Generation.

In my role, I focus on exploring and understanding data. This involves
extracting data, cleaning it, and preparing it for analysis.

I use tools like SQL, NumPy, and Pandas to handle and organize the data,
making sure it is accurate and reliable. This means fixing any missing
information, outliers, and inconsistencies.

Overall, my goal is to ensure the data we work with is of high quality and to
generate valuable insights that can be communicated effectively to
stakeholders.

During the exploratory data analysis (EDA) phase, we used statistical

techniques to find patterns, trends, and connections in the data. We then used
Matplotlib and Seaborn to create visualizations (like charts and graphs) to
show these findings. These visual tools made it easier to understand the data
and share our discoveries with others. Stakeholders could also interact with
these visualizations to explore the data themselves.
In my current role, I use Python and its libraries to handle, visualize, and
analyse data. I also work with stakeholders to share insights and support
machine learning projects.
Project Logical or Scenarios:

1. Data Extraction and Cleaning:

2. Pandas: Utilized pd.read_sql() function to extract data from SQL databases into
Pandas DataFrame for further processing
3. NumPy and Pandas: Employed methods like np.isnan() and df.isnull() to identify
missing values
4. Pandas: Used df.dropna() or df.fillna() to handle missing values by either dropping
or imputing them.
5. NumPy: Detected outliers using statistical methods like calculating Z-scores
(np.abs((x - x.mean()) / x.std())) or IQR (Interquartile Range) method (np.percentile())
6. Pandas: Removed outliers using boolean indexing with conditions like
df[(df['column'] > lower_bound) & (df['column'] < upper_bound)]
7. Pandas: Handled inconsistencies in data by using methods like str.lower(),
str.strip(), or str.replace() for string operations

Exploratory Data Analysis (EDA):

1.
 Pandas: Calculated descriptive statistics such as mean, median,
standard deviation, etc., using methods like df.describe().
 Pandas and NumPy: Generated summary statistics like correlation
coefficients (df.corr()) and covariance (np.cov()).
 Matplotlib and Seaborn: Created various types of plots including
histograms, box plots, scatter plots, and heatmaps to visualize
distributions, relationships, and trends in the data.
 Matplotlib and Seaborn: Customized plot aesthetics and styles using
parameters and functions provided by these libraries.
 NumPy and Pandas: Applied mathematical functions and operations
to transform data, calculate new features, or derive insights.

Data Preprocessing:

 Pandas: Processed datetime data using methods like pd.to_datetime()

and extracted date components (df['date'].dt.year).
 Pandas: Conducted feature scaling or normalization using methods
like StandardScaler or MinMaxScaler.
 Pandas: Handled categorical variables through techniques like one-
hot encoding (pd.get_dummies()).

Optimization and Efficiency:

 NumPy: Leveraged vectorized operations for faster computations,

avoiding explicit looping over DataFrame rows.
 Pandas: Utilized efficient DataFrame methods instead of iterative
processes, such as df.apply() or df.groupby().
 NumPy and Pandas: Employed broadcasting to perform operations on
entire arrays or columns efficiently.
 Pandas: Took advantage of method chaining to streamline data
processing steps and enhance code readability.

Did your project involve any machine learning aspects?

Yes, In Python to develop advanced features that boosted the predictive

capabilities of machine learning models. This involved collaborating closely
with a dedicated ML team to analysis precision and support their model-
building efforts.

How did you ensure project efficiency?

We optimized our code and processes using Python's programming and

libraries. By doing so, we improved data processing speed and overall project
efficiency, allowing us to extract insights from large datasets more effectively.

Pandas:
Pandas as a python open-source library used for data manipulation and
analysis. It provides easy-to-use data structures, as Data Frames and Series,
along with a wide range of functions for tasks like filtering, sorting, grouping,
and aggregating data. Pandas is particularly valuable for tasks involving
structured data, making it a go-to tool for tasks like data cleaning,
transformation, and exploration in fields like data science, finance, and
business analytics.

Numpy:

NumPy is a powerful Python library used for numerical computing. It provides

support for large, multi-dimensional arrays and matrices, along with a
collection of mathematical functions to operate on these arrays efficiently.
NumPy is widely used in scientific computing, data analysis, and machine
learning due to its speed and convenience in handling large datasets and
performing complex mathematical operations.

Matplotlib: Matplotlib is a popular plotting python library.it provides a -

static,animated and interactive visualizations and is highly customizable.It
provides a wide variety of plots(bar,pie,histogram,voilin,scatter plots) charts,
and graphs for analyzing data and presenting it in a visually appealing
manner.it is often used in combinations of Numpy and pandas. (from matplotlib
import pyplot as plt)

Seaborn:
Sea born is a Python data visualization library built on top of Matplotlib. It
provides a high-level interface for creating attractive and informative statistical
graphics. In simpler terms, Seaborn makes it easier to generate visually
appealing plots and charts for analyzing data, allowing users to explore
patterns, trends, and relationships in their datasets

Box plot or Whisker plot:

In seaborn, a box plot is a statistical visualization that represents the
distribution of a dataset through five key statistics: minimum, first quartile
(Q1), median (second quartile, Q2), third quartile (Q3), and maximum.
Syntax:
import seaborn as sns
import matplotlib.pyplot as plt
Import pandas as pd

# Sample DataFrame
data = {'category': ['A', 'B', 'A', 'C', 'B', 'C', 'A', 'B', 'C'],
'value': [10, 15, 20, 25, 30, 35, 40, 45, 50]}
df = pd.DataFrame(data)

# Creating a box plot

sns.boxplot(x='category', y='value', data=df)

# Display the plot

plt.show()
Project-3: Data Exploration and Insights Generation
Accenture:3- Jan 2024 to till date.
Payroll: Traegen systems
Address: No.1/5,2nd Floor, The Twin Oaks Building, Nallurahalli Main Rd,
Nallurhalli, Whitefield, Bengaluru, Karnataka 560066

This description outlines a data analyst or data scientist's role and

responsibilities in a project. Here's a breakdown of the key points:

Led Data Extraction full form

In the context provided, "LED" doesn't seem to stand for any specific
abbreviation related to data extraction. "Led" in this context likely refers to the
individual taking the lead or being in charge of the data extraction process. It
indicates that they were responsible for overseeing and managing the
extraction of data from various sources for further analysis.

Data Extraction, Cleaning, and Pre-processing: This indicates that the

individual took charge of gathering data, ensuring its quality and preparing it
for analysis. They likely used SQL for querying databases and Pandas and
NumPy for data manipulation in Python.

Ensuring Data Integrity: Data integrity is crucial for accurate analysis. This
person likely implemented measures to maintain the accuracy, consistency,
and reliability of the data throughout the process.

Exploratory Data Analysis (EDA): They conducted EDA using statistical

methods and advanced visualization techniques such as Matplotlib and
Seaborn. EDA involves summarizing the main characteristics of the data, often
with the help of visualizations, to understand its nature and uncover initial
insights.

Developed Reports, Dashboards, and Visualizations: The individual used tools

like Tableau and Python to create reports, dashboards, and visualizations to
effectively communicate insights derived from the data analysis. This step is
critical for stakeholders to understand the findings and make informed
decisions.

Expertise in Data Analysis and Visualization: This highlights the individual's

proficiency in both analyzing data and creating compelling visualizations. These
skills are essential for translating complex data into actionable insights.

Facilitating Informed Decision-Making: The ultimate goal of the project was to

empower decision-makers with actionable insights derived from the data. By
effectively communicating findings through reports and visualizations, this
person helped stakeholders make informed decisions.

Showcased Proficiency in Extracting Actionable Insights from Large Datasets:

The project demonstrated the individual's ability to extract valuable insights

from large volumes of data, which is crucial for strategic planning and decision-
making in various industries.

Overall, this description highlights a comprehensive approach to data analysis,

from data extraction to visualization, with a focus on driving strategic decisions
based on actionable insights.

difficulties faced during the project

One challenge was dealing with inconsistent data formats across different
sources. I overcame this by implementing a robust data cleaning process using
Pandas, which standardized the formats and ensured data integrity, allowing
for accurate analysis."

3)****Sample Response Recently worked dataset I follow this steps

“During my recent project, I worked with electronic health records (EHRs) and
medical imaging data. The dataset included around 100,000 patient records
with details such as medical history and treatment outcomes, and thousands
of MRI scans each averaging 200 MB.
Steps followed:
1. Data Extraction: I used SQL queries to extract patient data into Pandas
DataFrames. For imaging data, I utilized APIs to download the images.
2. Data Cleaning: I handled missing values by imputing or removing them
and addressed outliers using statistical methods like Z-scores. I also
standardized and corrected inconsistencies in textual data.
3. Data Preprocessing: Converted date columns to datetime formats,
applied feature scaling, and one-hot encoded categorical variables.
4. Exploratory Data Analysis (EDA): I performed EDA using Pandas for
statistical summaries and created visualizations with Matplotlib and
Seaborn to identify patterns and trends.
5. Reporting and Visualization: Developed interactive dashboards and
visualizations to effectively communicate insights to stakeholders.
6. Optimization: Improved data processing efficiency by leveraging
vectorized operations and method chaining.

Typical Columns in EHR Datasets

1. Patient Demographics:
o Patient ID: Unique identifier for each patient.
o Name: First and last name.
o Date of Birth: Patient’s birthdate.
o Gender: Patient’s gender.
o Address: Residential address.
o Phone Number: Contact information.
2. Medical History:
o Medical Record Number: Identifier for the medical record.
o Diagnosis Codes: ICD codes or other classification codes for
diagnoses.
o Previous Conditions: History of past medical conditions.
o Allergies: Known allergies of the patient.
3. Visit Information:
o Visit ID: Unique identifier for each visit.
o Visit Date: Date and time of the visit.
o Provider: Name or ID of the healthcare provider.
o Visit Type: Type of visit (e.g., routine check-up, emergency visit).
4. Treatment and Medications:
o Medication Prescribed: Details of medications prescribed.
o Dosage: Dosage information for prescribed medications.
o Treatment Plan: Outline of the treatment plan.
5. Lab Results:
o Test ID: Identifier for each lab test.
o Test Name: Name of the test performed.
o Test Results: Results of the lab tests.
o Test Date: Date the test was conducted.
6. Vital Signs:
o Blood Pressure: Recorded blood pressure readings.
o Heart Rate: Recorded heart rate readings.
o Temperature: Recorded body temperature.
7. Other Information:
o Insurance Information: Details about insurance coverage.
o Emergency Contact: Information for emergency contact.

l Questions and Answers

Q1: Can you describe your role in the "Data Exploration and Insights Generation" project?
 A1: In the "Data Exploration and Insights Generation" project, my role focused on
exploring and understanding data. I was responsible for extracting, cleaning, and
preparing data for analysis using tools like SQL, NumPy, and Pandas. Additionally, I
performed exploratory data analysis (EDA) using Matplotlib and Seaborn to identify
patterns, trends, and insights. My goal was to ensure the data's quality and generate
valuable insights that could be effectively communicated to stakeholders.
Q2: What tools and technologies did you use for data extraction and cleaning in your
project?
 A2: For data extraction and cleaning, I primarily used SQL to query databases and
Pandas to manipulate and organize the data in Python. I employed functions like
pd.read_sql() to load data into Pandas DataFrames, df.dropna() or df.fillna() for
handling missing values, and np.isnan() from NumPy to detect missing data. I also
used string operations like str.lower() and str.replace() to address inconsistencies in
the data.
Q3: How did you handle missing values and outliers during data preprocessing?
 A3: I handled missing values by either dropping them using df.dropna() or imputing
them with appropriate values using df.fillna() in Pandas. For outliers, I used statistical
methods like Z-scores and the Interquartile Range (IQR) method to detect them. I
then removed outliers using boolean indexing in Pandas, ensuring that the data
remained consistent and accurate for analysis.
Q4: What methods did you use for exploratory data analysis (EDA)?
 A4: During EDA, I calculated descriptive statistics such as mean, median, and
standard deviation using Pandas methods like df.describe(). I also generated
summary statistics, including correlation coefficients with df.corr() and covariance
with np.cov() from NumPy. For visualizing the data, I created histograms, box plots,
scatter plots, and heatmaps using Matplotlib and Seaborn to uncover relationships,
trends, and insights.
Q5: Did your project involve any machine learning aspects? If so, what was your
contribution?
 A5: Yes, my project did involve machine learning aspects. I collaborated closely with
a dedicated ML team to develop advanced features that boosted the predictive
capabilities of machine learning models. My contribution involved ensuring the
accuracy and quality of the data used for model training, which in turn enhanced the
precision of the analysis and supported the ML team’s model-building efforts.
Q6: How did you ensure the efficiency of your code and processes during the project?
 A6: To ensure efficiency, I optimized my code using Python's programming
paradigms and libraries. I leveraged vectorized operations in NumPy to speed up
computations and used efficient Pandas methods like df.apply() and df.groupby() to
avoid explicit looping. Additionally, I employed method chaining in Pandas to
streamline data processing steps, which enhanced code readability and overall
project efficiency.
Q7: What challenges did you face during the data cleaning process, and how did you
overcome them?
 A7: One significant challenge was dealing with inconsistent data formats across
different sources. I overcame this by implementing a robust data cleaning process
using Pandas. This involved standardizing formats, addressing missing values, and
correcting inconsistencies, which ensured the integrity of the data and allowed for
accurate analysis.
Q8: Can you explain a scenario where you used advanced visualization techniques to
present data insights?
 A8: In one scenario, I used Seaborn's box plots to visualize the distribution of a
dataset that contained several outliers. By customizing the plot aesthetics, I was able
to highlight key statistics like the median, quartiles, and outliers, which made it
easier for stakeholders to understand the data distribution. This visualization was
crucial in identifying patterns that were not immediately apparent from the raw
data.

Q9:The main aim this project.

A9: The main aim of the "Data Exploration and Insights Generation" project is to
analyze data to identify patterns, trends, and valuable insights that can guide decision-
making. This involves extracting, cleaning, and preparing data to ensure its quality and
reliability. Through exploratory data analysis (EDA), the project seeks to uncover meaningful
information that can drive strategic actions. Visualizations are created to effectively
communicate these findings to stakeholders. Overall, the project supports informed
decision-making and enhances the organization's ability to predict and respond to future
trend

Project On Netflix Data Analysis
100% (1)
Project On Netflix Data Analysis
22 pages
DVAP - Final Project Report
No ratings yet
DVAP - Final Project Report
27 pages
ML Lab File
No ratings yet
ML Lab File
33 pages
Dav - Lab Manual
No ratings yet
Dav - Lab Manual
34 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
51 pages
DVAP - Final Project Report
No ratings yet
DVAP - Final Project Report
27 pages
Analyzing The Impact of Python Libraries On Data Science
No ratings yet
Analyzing The Impact of Python Libraries On Data Science
23 pages
Python Library Functions
No ratings yet
Python Library Functions
12 pages
Lavanya Sharma IP File 2024-25-1
No ratings yet
Lavanya Sharma IP File 2024-25-1
37 pages
Report
No ratings yet
Report
18 pages
Data Science
No ratings yet
Data Science
10 pages
Suraj Report File
No ratings yet
Suraj Report File
17 pages
Data Science Workflow
No ratings yet
Data Science Workflow
7 pages
Shine Bharat Bhooshan 5yrs Mumbai City 10.00 LPA Project Consultant
No ratings yet
Shine Bharat Bhooshan 5yrs Mumbai City 10.00 LPA Project Consultant
3 pages
2A - Python+Data Analysis For Pyhton2 v2
No ratings yet
2A - Python+Data Analysis For Pyhton2 v2
38 pages
Matplotlib Project Report AIPT
No ratings yet
Matplotlib Project Report AIPT
6 pages
DEV Manual - ESEC
No ratings yet
DEV Manual - ESEC
27 pages
DSBDAlab Manual
No ratings yet
DSBDAlab Manual
116 pages
Essential Python Libraries and Functions For Data Science 1706295212
No ratings yet
Essential Python Libraries and Functions For Data Science 1706295212
12 pages
Python Ca22
No ratings yet
Python Ca22
14 pages
Data Analysis Using Python2
No ratings yet
Data Analysis Using Python2
27 pages
Python Written Assignment
No ratings yet
Python Written Assignment
35 pages
Python For Data Exploration
No ratings yet
Python For Data Exploration
28 pages
Python For Data Science
No ratings yet
Python For Data Science
22 pages
Chapter 2. Data Analysis and Processing - Full
No ratings yet
Chapter 2. Data Analysis and Processing - Full
49 pages
MGNM801 Ca2 Final
No ratings yet
MGNM801 Ca2 Final
13 pages
L6 and 7-Data Preprocessing-Coding
No ratings yet
L6 and 7-Data Preprocessing-Coding
34 pages
Python Quick Notes
No ratings yet
Python Quick Notes
2 pages
Vibhin Pro
No ratings yet
Vibhin Pro
36 pages
PythonDASE - 2025 Version1
No ratings yet
PythonDASE - 2025 Version1
44 pages
Finall Report Internship
No ratings yet
Finall Report Internship
45 pages
DSBDA Manual
No ratings yet
DSBDA Manual
76 pages
Synopsis For Data Analyzer
No ratings yet
Synopsis For Data Analyzer
18 pages
Mastering Pandas in Python: Course Book
From Everand
Mastering Pandas in Python: Course Book
Pedro Martins
No ratings yet
Sample Resume - 1yr DS
No ratings yet
Sample Resume - 1yr DS
2 pages
Kavin
No ratings yet
Kavin
13 pages
Data Analysis Visualization Full Project
No ratings yet
Data Analysis Visualization Full Project
19 pages
Vamshi ml-1,2
No ratings yet
Vamshi ml-1,2
25 pages
DS Final
No ratings yet
DS Final
46 pages
Naukri ShyamPrabhakarAmbilkar 9124317 - 03 04 - 1
No ratings yet
Naukri ShyamPrabhakarAmbilkar 9124317 - 03 04 - 1
4 pages
Python - Data Analysis
No ratings yet
Python - Data Analysis
11 pages
Hemanth SDP
No ratings yet
Hemanth SDP
13 pages
Dnyaneshwar Ds
No ratings yet
Dnyaneshwar Ds
2 pages
Sajid Sheikh 421
No ratings yet
Sajid Sheikh 421
1 page
DEV Lab Manual-1
No ratings yet
DEV Lab Manual-1
27 pages
Dav Report
No ratings yet
Dav Report
17 pages
20dcs009 Internal 1 Presentation
No ratings yet
20dcs009 Internal 1 Presentation
13 pages
Finaldoc
No ratings yet
Finaldoc
19 pages
Vinaykanth Pythondeve
No ratings yet
Vinaykanth Pythondeve
8 pages
Ass1 DSBDA Writeup
No ratings yet
Ass1 DSBDA Writeup
8 pages
ELC Assignment
No ratings yet
ELC Assignment
4 pages
Top 18 Python Libraries
100% (1)
Top 18 Python Libraries
11 pages
Sample Phase 2 Document
No ratings yet
Sample Phase 2 Document
7 pages
Trisha (Searchmyexpert)
No ratings yet
Trisha (Searchmyexpert)
2 pages
Lavajiit Singh CV
No ratings yet
Lavajiit Singh CV
3 pages
Machine Learning Lecture2
No ratings yet
Machine Learning Lecture2
38 pages
Ad3301 Data Exploration and Visualization
No ratings yet
Ad3301 Data Exploration and Visualization
38 pages
Numpy Notes
No ratings yet
Numpy Notes
38 pages
41 DS PL MF
No ratings yet
41 DS PL MF
20 pages
Rakshitha.M - 1BO17EC031
No ratings yet
Rakshitha.M - 1BO17EC031
26 pages
Payslip
No ratings yet
Payslip
1 page
Practice Programs
No ratings yet
Practice Programs
3 pages
Srikanth
No ratings yet
Srikanth
3 pages
Vikram Travels 1
No ratings yet
Vikram Travels 1
3 pages
Harish 2.0
No ratings yet
Harish 2.0
2 pages
Harsha B 2k25
No ratings yet
Harsha B 2k25
1 page
Vandana 2k25
No ratings yet
Vandana 2k25
1 page
Kranthi 2k25 (March)
No ratings yet
Kranthi 2k25 (March)
1 page
Bank of America Notes
No ratings yet
Bank of America Notes
26 pages
Nothing To Hide, The Blurring of The Physical and Temporal Line Between Life, Work and Education - Microcities
No ratings yet
Nothing To Hide, The Blurring of The Physical and Temporal Line Between Life, Work and Education - Microcities
7 pages
Jivit 200810
No ratings yet
Jivit 200810
6 pages
Indian Mathematicians
No ratings yet
Indian Mathematicians
12 pages
Theo Notes
No ratings yet
Theo Notes
5 pages
Troubleshooting Client/Server Connectivity in SEP 12: Understanding The Concept
No ratings yet
Troubleshooting Client/Server Connectivity in SEP 12: Understanding The Concept
11 pages
The Salvatore Saga Part
No ratings yet
The Salvatore Saga Part
45 pages
Module 1-Ders Notları
No ratings yet
Module 1-Ders Notları
2 pages
Monocular Depth Estimation Based On Deep Learning An Overview
No ratings yet
Monocular Depth Estimation Based On Deep Learning An Overview
16 pages
Tutorial 5
No ratings yet
Tutorial 5
3 pages
Traffic Control in Atm
No ratings yet
Traffic Control in Atm
8 pages
Kti Agnes
No ratings yet
Kti Agnes
46 pages
Solving Quadratic Factoring
No ratings yet
Solving Quadratic Factoring
4 pages
Application Guide
No ratings yet
Application Guide
4 pages
Engineer Pros Backend Level 2 Course
No ratings yet
Engineer Pros Backend Level 2 Course
12 pages
Chapter 13 - Aggregate Supply and The Short-Run Tradeoff Between Inflation and Unemployment
No ratings yet
Chapter 13 - Aggregate Supply and The Short-Run Tradeoff Between Inflation and Unemployment
26 pages
PDF 24
0% (1)
PDF 24
2 pages
Resume Shekhar Vijay Barewar
No ratings yet
Resume Shekhar Vijay Barewar
4 pages
p6 Angles Studentonline
No ratings yet
p6 Angles Studentonline
10 pages
Unpaired T-Tests PDF
No ratings yet
Unpaired T-Tests PDF
3 pages
Moral Reasoning: Moral Reasoning Is The Process of Determining Right or Wrong in A Given Situation
No ratings yet
Moral Reasoning: Moral Reasoning Is The Process of Determining Right or Wrong in A Given Situation
12 pages
MSDS
No ratings yet
MSDS
3 pages
DTM Excel Report
No ratings yet
DTM Excel Report
3 pages
Pascal Programming
No ratings yet
Pascal Programming
31 pages
HUMIDIFICADOR Fisher Paykel MR850 2
No ratings yet
HUMIDIFICADOR Fisher Paykel MR850 2
57 pages
2ndmonthly Values
No ratings yet
2ndmonthly Values
1 page
High-Precision Clipping Path Services For Flawless Image Cutouts
No ratings yet
High-Precision Clipping Path Services For Flawless Image Cutouts
5 pages
Singing Competition Themanoor
No ratings yet
Singing Competition Themanoor
4 pages
MAN Gas Engines
No ratings yet
MAN Gas Engines
18 pages
Issues in Distance Learning
No ratings yet
Issues in Distance Learning
29 pages
Numerical Measures To Describe Data
No ratings yet
Numerical Measures To Describe Data
103 pages

Self Intoduction 1 Project

Uploaded by

Self Intoduction 1 Project

Uploaded by

My Name is Malsur I am from suryapet, Telangana professionally I am working

1)Current project Data Exploration and Insights Generation.

Roles and Resposibility

My Current project is: Data Exploration and Insights Generation.

During the exploratory data analysis (EDA) phase, we used statistical

1. Data Extraction and Cleaning:

Exploratory Data Analysis (EDA):

 Pandas: Processed datetime data using methods like pd.to_datetime()

Optimization and Efficiency:

 NumPy: Leveraged vectorized operations for faster computations,

Did your project involve any machine learning aspects?

Yes, In Python to develop advanced features that boosted the predictive

How did you ensure project efficiency?

We optimized our code and processes using Python's programming and

NumPy is a powerful Python library used for numerical computing. It provides

Matplotlib: Matplotlib is a popular plotting python library.it provides a -

Box plot or Whisker plot:

# Creating a box plot

# Display the plot

This description outlines a data analyst or data scientist's role and

Led Data Extraction full form

Data Extraction, Cleaning, and Pre-processing: This indicates that the

Exploratory Data Analysis (EDA): They conducted EDA using statistical

Developed Reports, Dashboards, and Visualizations: The individual used tools

Expertise in Data Analysis and Visualization: This highlights the individual's

Facilitating Informed Decision-Making: The ultimate goal of the project was to

Showcased Proficiency in Extracting Actionable Insights from Large Datasets:

The project demonstrated the individual's ability to extract valuable insights

Overall, this description highlights a comprehensive approach to data analysis,

difficulties faced during the project

3)****Sample Response Recently worked dataset I follow this steps

Typical Columns in EHR Datasets

l Questions and Answers

Q9:The main aim this project.

You might also like