Data_Engineer
Data_Engineer
[email protected]
4707814773
PROFESSIONAL SUMMARY
I have 9+ years of experience in Data Engineering and Data Analysis/Science, specializing in enterprise-
level data models.
I possessed extensive hands-on experience with AWS, including EC2, S3, EBS, Glue, Lambda,
Redshift, SNS, SQS, EMR, EKS and RDS.
I am skilled in Azure cloud components such as HDInsight, Databricks, Data Lake, Blob Storage,
Data Factory, Storage Explorer, SQL DB, SQL DWH, and Cosmos DB.
I am proficient in designing, implementing, and optimizing end-to-end data pipelines for large-scale data
processing.
I have strong expertise in Extract, Transform, Load (ETL) processes, ensuring efficient data movement
and transformation across diverse systems.
I have in-depth knowledge of relational databases, including design, optimization, and administration of
schemas, with expertise in SQL for complex queries and performance tuning.
I have proven experience working with various NoSQL databases such as MongoDB, Cassandra, and
DynamoDB.
I am proficient in big data technologies such as Apache Hadoop and Apache Spark for distributed
processing and analytics.
I have knowledge in converting SQL queries to Spark Transformations utilizing Spark RDDs, Data
Frames, and Scala.
I have expertise in data modeling and schema design, including star schema and snowflake schemas for
relational and non-relational databases.
I am skilled in working with real-time data streaming technologies, including Apache Kafka, Spark, and
Apache Flink.
I have expertise in designing and maintaining data warehouses, leveraging technologies like Amazon
Redshift and Google Big Query.
I am proficient in workflow orchestration tools like Apache Airflow for scheduling and monitoring
complex data workflows.
I used Jenkins CI/CD pipelines to streamline deployment procedures, which include software quality
tests and automatic data integrity testing.
I have strong proficiency in version control systems such as Git and scripting languages such as Python
for building scalable and maintainable data solutions.
TECHNICAL SKILLS
Cloud Technologies AWS (Amazon Web Services) [EMR, EC2, S3, Redshift, Glue, Route 53,
Athena, Lambda, DynamoDB]
Azure (Microsoft Azure) [Azure Databricks, Azure SQL Database, Azure
Data Factory, Azure Machine Learning, Azure Data Lake, Azure Functions]
Big Data Ecosystem Hadoop, MapReduce, HDFS, Hive, Impala, HBase, Kafka, Spark [Scala,
SQL].
PROFESSIONAL EXPERIENCE