0% found this document useful (0 votes)
4 views3 pages

JD - LLM Data Analyst

The document outlines a job description for an Entry-Level LLM & Data Analyst position, focusing on Natural Language Processing, data analytics, and machine learning workflows. Key responsibilities include data collection, preprocessing, model training, and collaboration with cross-functional teams. Candidates should have programming skills in Python, familiarity with LLMs and NLP frameworks, and a strong analytical mindset, with 0-1 years of relevant experience.

Uploaded by

dxsm6996
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views3 pages

JD - LLM Data Analyst

The document outlines a job description for an Entry-Level LLM & Data Analyst position, focusing on Natural Language Processing, data analytics, and machine learning workflows. Key responsibilities include data collection, preprocessing, model training, and collaboration with cross-functional teams. Candidates should have programming skills in Python, familiarity with LLMs and NLP frameworks, and a strong analytical mindset, with 0-1 years of relevant experience.

Uploaded by

dxsm6996
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

JOB DESCRIPTION

1 Job Title Department


LLM & Data Analyst (Entry-Level)

Direct Supervisor Job Number


010
2 Job Purpose
We are seeking an enthusiastic and detail-oriented LLM & Data Analyst (Entry-Level)
to join our growing team. In this role, you will work at the intersection of Natural
Language Processing (NLP), data analytics, and machine learning workflows—with
a strong focus on Large Language Models (LLMs).

You will be involved in end-to-end tasks such as data collection, preprocessing,


exploratory data analysis, model training, evaluation, and deployment. This role is
ideal for recent graduates or early-career professionals who are excited to work with
cutting-edge AI technologies in a collaborative and innovation-driven environment.
3 Operating Environment, Framework and Boundaries, Working Relationships
Project Development:

• Assist in the end-to-end development of NLP and AI solutions involving Large


Language Models (LLMs) and machine learning workflows
• Collect, clean, preprocess, and manage structured and unstructured datasets for
model training and evaluation.
• Implement data pipelines (ETL) to automate data extraction, transformation,
and loading processes.
Support model training, fine-tuning, and evaluation using LLMs and other
machine learning techniques.
• Build dashboards and visualizations to present data insights and model outcomes
effectively

Code and Version Control:

• Use Python and relevant libraries (Pandas, NumPy, scikit-learn) for data
processing and model development
• Participate in code reviews and adhere to coding standards and best practices
• Utilize Git or other version control tools for collaborative development and
version management

Team Collaboration:

• Work closely with data scientists, ML engineers, and cross-functional teams to


deliver scalable AI-powered solutions
• Communicate progress, challenges, and technical insights clearly in team meetings
and documentation
• Collaborate in designing workflows that align with business and technical
requirements

Learning and Growth:

• Stay updated with the latest developments in AI, NLP, LLMs, and data
engineering tools and frameworks
• Be open to feedback and continuously improve technical and analytical skills
through mentorship and self-learning
• Participate in knowledge-sharing sessions and contribute to the team’s learning
culture

Process Improvement:

• Suggest and help implement improvements to data preprocessing, model training,


and deployment workflows
• Document data handling, model development, and evaluation processes to enhance
team efficiency and knowledge base
• Assist in building reusable tools and frameworks to streamline AI project
development

4 Knowledge, Skills and Experience


Key Responsibilities:

• Support the design and implementation of applications leveraging Large


Language Models (LLMs) and data-driven AI solutions
• Collect, clean, preprocess, and manage large-scale structured and unstructured
datasets using Python and relevant libraries
• Perform exploratory data analysis (EDA) and generate actionable insights
through data visualization
• Build and maintain ETL pipelines to automate data extraction, transformation,
and loading workflows
• Assist in selecting, training, fine-tuning, and evaluating machine learning
and LLM models using appropriate metrics
• Utilize NLP libraries and APIs (e.g., Hugging Face, OpenAI) to develop
intelligent AI applications
• Implement prompt engineering and experiment with embedding-based retrieval
or fine-tuning techniques
• Create visual reports and dashboards to communicate findings to both technical
and non-technical stakeholders
• Collaborate with cross-functional teams including data scientists, ML
engineers, and business analysts to deploy scalable AI solutions
• Participate in code reviews and stay current with advances in AI, NLP, and data
engineering best practices

Mandatory Skills:

• 0–1 years’ experience in data analysis, natural language processing, or machine


learning (internship/project experience acceptable)
• Strong programming skills in Python with hands-on experience in libraries such
as Pandas, NumPy, Scikit-learn, Matplotlib, and Seaborn.
• Understanding of data preprocessing, feature engineering, model training,
and evaluation techniques
• Basic familiarity with Large Language Models and NLP frameworks like
Hugging Face Transformers, or OpenAI API
• Experience working with SQL and NoSQL databases and constructing ETL
pipelines
• Knowledge of data visualization tools and ability to present insights effectively.
• Strong analytical, problem-solving, and communication skills.
• Ability to work collaboratively in a team-oriented environment.

Desired Skills:

• Experience with prompt engineering, or fine-tuning LLMs


• Familiarity with model selection, hyperparameter tuning, and ML evaluation
metrics
• Exposure to cloud platforms such as AWS, GCP, or Azure
• Understanding of MLOps practices, Docker, and version control systems like
Git
• Participation in AI/ML competitions (Kaggle), hackathons, or open-source
projects is a plus
• Knowledge of Agile or Scrum methodologies is advantageous

You might also like