Clarista - Data Engineer - JD
Clarista - Data Engineer - JD
Position Summary
At Clarista.io, we are driven to create a connected data world for enterprises,
empowering their employees with the information they need to compete in the digital
economy. Information is power, but only if it can be harnessed by people.
Clarista turns current enterprise data silos into a ‘Live Data Network’, easy to use, always
available, with flexibility to create any analytics with controls to ensure quality and
security of the information
Clarista is designed with business teams in mind, hence ensuring performance with large
datasets and a superior user experience are critical to the success of the product
What You'll Do
You will be part of our data platform & data engineering team. As part of this agile team,
you will work in our cloud native environment and perform following activities to support
core product development and client specific projects:
• You will develop the core engineering frameworks for an advanced self-service
data analytics product.
• You will work with multiple types of data storage technologies such as relational,
blobs, key-value stores, document databases and streaming data sources.
• You will work with latest technologies for data federation with MPP (Massive
Parallel Processing) capabilities
• Your work will entail backend architecture to enable data modeling, data queries
and API development for both back-end and front-end data interfaces.
• You will support client specific data processing needs using SQL and
Python/Pyspark
• You will integrate our product with other data products through Django APIs
• You will partner with other team members in understanding the functional / non-
functional business requirements, and translate them into software development
tasks
• You will follow the software development best practices in ensuring that the code
architecture and quality of code written by you is of high standard, as expected
from an enterprise software
• You will be a proactive contributor to team and project discussions
Who you are
• Strong education track record - Bachelors or an advanced degree in Computer
Science or a related engineering discipline from Indian Institute of Technology or
equivalent premium institute.
• 2-3 years of experience in data queries, data processing and data modeling
• Excellent ANSI SQL skills to handle complex queries
• Excellent Python and Django programming skills.
• Strong knowledge and experience in modern and distributed data stack
components such as the Spark, Hive, Airflow, Kubernetes, Docker etc.
• Experience with cloud environments (AWS, Azure) and native cloud technologies
for data storage and data processing
• Experience with relational SQL and NoSQL databases, including Postgres, Blobs,
MongoDB etc.
• Familiarity with ML models is highly preferred
• Experience with Big Data processing and performance optimization
• Should know how to write modular, optimized and documented code.
• Should have good knowledge around error handling.
• Experience in version control systems such as GIT
• Strong problem solving and communication skills.
• Self-starter, continuous learner.
• Be an integral part of the founding team. You will work directly with the founder
• Work Life Balance. You can't do a good job if your job is all you do!
• Prepare for the Future. Academy – we are all learners; we are all teachers!
• Diversity & Inclusion. HeForShe!
• Internal Mobility. Grow with us!
• Business knowledge of multiple sectors