Supriya Reddy
Data Analyst
Ph: 2169727312 | E: [email protected]
SUMMARY:
Data Analyst with 4+ years of hands-on experience in data analysis, manipulation, and visualization, with a strong
focus on driving actionable insights and decision-making processes in Healthcare and Real-State Sector.
Proficient in Python, SQL, and R, with expertise in utilizing packages like Scikit- Learn, ggplot2, Pandas, NumPy,
Matplotlib, and SciPy for data analysis and machine learning.
Specialized in data visualization using tools such as Tableau, Power BI, and Advanced Excel (Pivot Tables, VLOOKUP),
creating interactive dashboards for stakeholders.
Adept in the entire data lifecycle, from ETL processes using Apache Airflow to database management with MySQL,
MongoDB, PostgreSQL, and Oracle.
Experienced in leveraging cloud platforms such as Amazon Web Services (AWS) and GCP for scalable and efficient
data processing.
Proven track record of troubleshooting and resolving critical issues, supporting teams during challenging situations.
Possess excellent communication and problem-solving skills, contributing to a positive and collaborative work
environment.
Expert in developing visualization dashboards using calculations, Filters, Charts, parameters, calculated fields,
groups, sets and hierarchies.
Strong experience in Data Analysis, Data Migration, Data Cleansing, Transformation, Integration, Data Import,
and Data Export through the use of multiple ETL tools such as Ab Initio and Informatica Power Center Experience in
testing and writing SQL and PL/SQL statements - Stored Procedures, Functions, Triggers and packages.
TECHNICAL SKILL:
Methodologies SDLC, Agile (Scrum), Waterfall
Programming Languages Python, SQL, R
Packages Scikit-Learn, ggplot2, Pandas, NumPy, Matplotlib, SciPy,
Seaborn
Data Visualization Tools Tableau, Power BI, Advanced Excel (Pivot Tables,
VLOOKUP)
IDEs Visual Studio Code, PyCharm, Jupyter Notebook
Database Management MySQL, MongoDB, PostgreSQL, Oracle
Cloud Platforms Amazon Web Services (AWS), GCP
Other Technologies JIRA, SSIS, SSRS, Machine Learning Algorithms,
Mathematics, Probability distributions, Confidence
Intervals, Hypothesis Testing, Regression Analysis, Linear
Algebra, Advance Analytics, Data Mining, Data
Visualization, Data warehousing, Data transformation
Version Control Git, GitHub, Bitbucket, SVN
Operating Systems Windows, Linux, Mac OS
Education:
Master of Science in Information Studies from Trine University, USA
Bachelor of Engineering in Electronics and Telecommunication from JNTU University - India
Professional Experience
Client: BCBS, NJ Jul 2023 – Current
Role: Data Analyst
Responsibilities:
Collected and integrated diverse datasets, including demographics, health conditions, and utilization patterns,
ensuring comprehensive member information.
Employed Python with pandas for data cleaning and preprocessing, achieving a 20% reduction in data
inconsistencies and errors.
Conducted in-depth EDA using tools like Jupyter Notebooks and matplotlib, uncovering key insights into member
behavior and characteristics.
Utilized statistical techniques such as clustering algorithms (e.g., K-means) to identify distinct member segments
based on quantitative criteria.
Designed interactive dashboards using Tableau, providing stakeholders with real-time insights into member
segmentation and communication effectiveness.
Ensured efficient data extraction, transformation, and loading processes, improving data processing efficiency by
30% through Apache Airflow's automated workflows.
Implemented clustering algorithms using scikit-learn, resulting in a 15% increase in accuracy in identifying distinct
member segments.
Collected business requirements to set rules for proper data transfer from Data Source to Data Target in Data
Mapping.
Creating complex SQL queries and scripts to extract and aggregate data to validate the accuracy of the data.
Responsible for interacting with the business analysts and the business partners to identify information needs for
the business requirements for DAR.
Built strong credibility to become a trusted and sought-after partner for business teams seeking guidance on using
analytics.
Client: William Autonetics. NY Sep 2022 – Jun 2023
Role: Data Analyst
Responsibilities:
Predict the Purchase orders in the future, with resetting the metrics in model to improve, have medications in stock
when needed.
Build regression model in Python to do required time series analysis to predict and analyze the underlying causes of
trends in the purchase orders and produce Purchase orders for next 21 calendar days.
Provide a model so that you look at each day in future and see what is to be ordered. It would reset each day as
medications
Developed custom routines for generating test cases.
Created the Test Specifications, Test Scripts and Test Categories for the testing of data in the EDW.
Attended defect triage meetings with the end users and developers.
Developed Spark code to using Scala and Spark -SQL for faster processing and testing.
Experience in building Data Integration, Workflow Solutions and Extract, Transform, and Load (ETL) solutions for
data warehousing using SQL Server Integration Service (SSIS).
Created action filters, parameters, and calculated sets for preparing dashboards and worksheets in Tableau.
Involved in gathering and synthesizing business requirements and translated into functional and nonfunctional
requirements to be used as input to the functional design specifications.
Interacted with technical Architects to identify and analyze the given information, procedures and decision flows
and evaluated existing procedures.
Client: Adani, India Aug 2019 - Jun 2021
Role: Data Analyst
Responsibilities:
Developed real-time dashboards in Tableau to visualize and monitor key metrics, leading to a 20% improvement in
data-driven decision-making.
Leveraged AWS S3 for effective data storage and retrieval, enhancing data accessibility and backup solutions by 25%,
leading to more efficient data management practices.
Performed data cleaning and processing for third-party spending data, utilizing Excel macros and Python libraries
(NumPy, Pandas and Matplotlib) to ensure data accuracy and efficiency, reducing processing time by 25%.
Executed data cleansing and staging of operational sources using ETL processes, resulting in improved data quality
and streamlined data pipelines, reducing data errors by 30%.
Conducted exploratory data analysis using Matplotlib and Seaborn, resulting in the identification of key insights and
actionable recommendations that improved click-through rates by 15%.
Proficiently managed NoSQL databases like MongoDB, leveraging their flexible schema design and efficient data
retrieval capabilities to handle large volumes of structured and unstructured data, resulting in improved data storage
and retrieval efficiency by 25%.
Implemented conditional filters and Action links to filter data on dashboards with Power BI Desktop.
Performing statistical data analysis and data visualization using Python.
Worked on creating filters, parameters and calculated sets for preparing dashboards and worksheets in Tableau.
Created new scripts for Splunk scripted input for collecting CPU, system and OS data.