A16 PPT
A16 PPT
BY USING LATENT
DIRICHLET ALLOCATION
Team Members:
1. A.Abhishek (21H71A0503) Batch No:A16
2. D.Trisanthi (21H71A0559) Name of the Guide : Ms.G.Rohini
3. P.Balaji (22H75A0509) Designation : Assistant Professor
4. K.Tejesh (21H71A0558)
AGENDA
INTRODUCTION
PROBLEM STATEMENT
LITERATURE REVIEW
EXISTING SYSTEM
DIS-ADVANTAGES OF EXISTING SYSTEM
PROPOSED SYSTEM
ARCHITECTURE
METHODOLOGY
TOOLS
OUTPUT
CONCLUSION
THIS PROJECT LEVERAGES NATURAL
LANGUAGE PROCESSING (NLP)
TECHNIQUES, SPECIFICALLY LATENT
DIRICHLET ALLOCATION (LDA), TO
AUTOMATE THE IDENTIFICATION AND
ANALYSIS OF EMERGING INDUSTRY
TRENDS.
Dataset preparation,
A proposed Lack of emotional
NLP techniques, Preprocessing,
Academic Chatbot N. Raeesh, Nabi, N. intelligence,
Hybrid model Hybrid intelligence,
System using NLP Bavigha outdated
approach AI-driven approach,
Techniques information
Feedback loops
Difficulty with
Interpretability
Polysemy
PROPOSED SYSTEM
LDA predictions
Data Collection Data Analysis Apply LDA
LATENT DIRICHLET ALLOCATION (LDA) IS A PROBABILISTIC TOPIC MODELING ALGORITHM THAT DISCOVERS
UNDERLYING TOPICS WITHIN A COLLECTION OF DOCUMENTS BY MODELING DOCUMENTS AS MIXTURES OF TOPICS AND
TOPICS AS DISTRIBUTIONS OF WORDS.
LDA MAKES UN-STRUCTURED DATA TO STRUCTURED DATA TO REMOVE NOISE DATA AND DATA IN ORDERED WAY TO GET
ACCURATE PREDICTIONS
BY APPLYING LDA TOPIC MODELING ALGORITHM TO THE LARGE CORPUS DATA IT CLUSTERS THE WORDS AND IT SHOWS
FREQUENCY OF TOPICS FOR EACH DOCUMENT IN MILLIONS OF DOCUMENTS.
PROGRAMMING LANGUAGE :PYTHON
NLP LIBRARIES : GENSIM, NLTK, SPACY
(BACK-END)
DATA PROCESSING : PANDAS, NUMPY
VISUALIZATION : MATPLOTLIB , SEABORN,
TOOLS PYLDAVIS
DATABASE : SQLITE OR MONGODB (OPTIONAL
FOR STORING LINKEDIN DATA)
FRAMEWORK : FLASK /DJANGO FOR WEB
DEPLOYMENT
FRONT-END : HTML, CSS, REACTJS.
OUTPUT
CONCLUSION
THIS PROJECT SUCCESSFULLY DEMONSTRATES THE APPLICATION OF LATENT
DIRICHLET ALLOCATION (LDA) IN ANALYSING LINKEDIN POST DATA TO
IDENTIFY EMERGING INDUSTRY TRENDS.
THE RESULTS REVEAL HOW TRENDS EVOLVE OVER TIME, ENABLING DATA-
DRIVEN DECISION-MAKING IN VARIOUS SECTORS.
USING PYTHON LIBRARIES LIKE GENISM, NLTK, AND PANDAS, THE PROJECT
TRANSFORMS VAST UNSTRUCTURED TEXT INTO ACTIONABLE INSIGHTS.