Sohail
Sohail
Summary:
A highly skilled Data Engineer and Scientist with a strong foundation in software engineering,
data science, and analytics.
Over 4 years of professional experience working with cutting-edge technologies in data
engineering, data science, product analytics, and machine learning.
Proven ability to drive impactful insights and innovative solutions using a data-driven
approach.
Adept at building scalable data pipelines, optimizing data systems, and creating models that
enhance business decision-making.
Strong expertise with Snowflake, AWS, Azure, RDS, Python, Data Pipelines, and Model
Testing, alongside a range of data analytics, science, and engineering technologies.
Data Engineering: ETL Pipelines, apache spark, snowflake, AWS, Azure, RDS, data
warehousing
Data Science: Machine learning, predictive modelling, testing, Data Analysis, visualization
Prgramming: Python, SQL , Java, R, Scala
Tools & Platform: Snowflake, Git, Dockers, Kubernetes, tableau, power BI
Front end : Experience with building and integrating data dashboards for visualization
Analytics: Business Intelligence(BI), A/B testing, SQL
Professional experience:
Currently working on advanced data engineering projects focused on automating and scaling data
workflows. Involved in the development of machine learning models and data pipelines for
predictive analytics. Collaborating with cross-functional teams to optimize data storage and retrieval
processes.
Key Responsibilities:
Design and implement scalable data pipelines using Snowflake, AWS, and Azure
Build and deploy machine learning models for real-time decision-making
Test and optimize models using RDS and data-driven methodologies
Collaborate with product teams to deliver actionable insights
Sep 2022 to Dec 2023, US
Devita Healthcare - Data Science, product and Analysis
Worked on data analytics and product optimization for Deivta Healthcare, focusing on enhancing
healthcare services and patient outcomes through data-driven insights.
Key Responsibilities:
Led data-driven initiatives to optimize healthcare product offerings using AWS, Azure, and
Snowflake data warehouses, streamlining data workflows and improving system efficiency.
Analyzed patient data and healthcare operations using advanced data science tools such as
Python and RDS to uncover key insights for better decision-making and identify growth
opportunities in service delivery.
Developed predictive models to forecast patient needs and optimize healthcare processes,
implementing model testing techniques to enhance product performance and overall
patient experience.
Focused on both the engineering and data science aspects, designing and building data pipelines for
various enterprise applications. Involved in data wrangling, feature engineering, and optimizing data
processes for scalability.
Key Responsibilities:
Built and optimized ETL pipelines using Python, Snowflake, and AWS
Performed exploratory data analysis (EDA) and model development using RDS, Azure,
and Python
Worked closely with data scientists and product managers to understand business needs
and implement front-end visualization tools for better insights
Implemented Data Pipeline architectures to ensure efficient data flows across platforms