0% found this document useful (0 votes)
54 views2 pages

JD DS Intern

Sembly is building a knowledge hub that combines ML, NLP and probabilistic models to classify information and drive collaboration towards establishing accurate knowledge at scale. They are looking for a Data Science (NLP) intern to help build NLP and deep learning tools for tasks like information retrieval, document classification, toxicity detection and more. The internship offers competitive pay and requires strong Python skills, experience with ML/NLP frameworks and research papers, and a passion for learning.

Uploaded by

ghch
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
54 views2 pages

JD DS Intern

Sembly is building a knowledge hub that combines ML, NLP and probabilistic models to classify information and drive collaboration towards establishing accurate knowledge at scale. They are looking for a Data Science (NLP) intern to help build NLP and deep learning tools for tasks like information retrieval, document classification, toxicity detection and more. The internship offers competitive pay and requires strong Python skills, experience with ML/NLP frameworks and research papers, and a passion for learning.

Uploaded by

ghch
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Sembly.

com - 2022 Tech Recruiting


(ML/NLP)

About Sembly
We’re building the world’s first real-time, grounded, peer-reviewed, scalable
knowledge hub on the web. In our platform, accurate information always wins
against other noises. Current approaches to determine accuracy don’t work ‒ AI is
still ineffective in reasoning, and naive human moderation can be biased and not
scalable. We are building a novel approach that combines the power of ML/NLP (to
classify elements of discourse) and probabilistic graphical models to drive
collaboration towards grounding of knowledge at scale, with intuitive and highly
usable UX. In this way, we will gradually transform and re-orient the web towards
accurate information, starting from niche science and research verticals before
broadening to the mass audience.

Our Plan
We are a team of experienced designers, engineers, machine learning & data
scientists, and second-time entrepreneurs hailing from Stanford, Harvard, and other
top institutions, geographically distributed across US/Bay Area, Singapore, and
Ireland, and we are not strangers to distributed teams and remote work.

We are currently still in stealth mode, with a product in alpha stage targeted for
medical professionals and related research domains, with a group of first users
showing initial signs of stickiness and organic word of mouth, despite a relatively
early product.

We are currently looking for a Data Science (NLP) intern to join our exciting journey.
Contact us via email: [email protected] 

Internship Positions

1) Data Scientist (NLP) Intern 


location: As a data scientist intern, you will help us build NLP and Deep Learning
anywhere based tools to cater many tasks in our platform such as Information
(remote or on- Retrieval, Document Classification (Topic Tagging & Recommendation),
site) Toxicity Detection, etc. 

starting You will work with some of our experienced Data Scientists in the team
month: who have many years of experience building research-based state-of-the-
immediate art Machine Learning (ML) and NLP models, and you will participate in
literature survey, code review, and productizing process.

Comp. level Qualifications:


Competitive  Have a firm grasp of computer science fundamentals with a top
tier educational background in Computer Science or AI/ML/DS
majors
 Strong Python coding skills (having experience in competitive
programming is a bonus e.g., good profiles/leaderboard positions
in platforms such as Leetcode)
 Familiarity with ML and NLP basics 
 Demonstrated capabilities in Data Science related coding e.g.
good leaderboard positions in Kaggle or winners in KDD/ CIKM/
NIPS coding challenges
 Solid hands-on experience with Deep Learning packages such as
PyTorch or Tensorflow (experience with NLP packages such as
Huggingface Transformers, Spacy, and NLTK is a plus)
 Extensive hands-on experience with supporting ML libraries such
as scikit-learn, pandas, numpy etc.
 Good theoretical understanding of Deep Learning models for NLP
(e.g., Transformers, BERT, LSTMs, etc.)
 Basic R&D skills such as reading, interpreting and critically
analysing research papers, doing literature surveys, etc.
 Good sense of ownership and a learning attitude
 Experiences in collaborative software
development/DevOPs/MLOPs with tools such as GitHub, Dockers
and its interfacing with cloud-based notebook environments like
Google Colab will be an advantage 

We accept flexible arrangements including custom time period, number of


hours per week, remote internship, and supporting publications and other
requirements if needed.

You might also like