Generate Insights From Unstructured Financial Data
Generate Insights From Unstructured Financial Data
Investor Reports
ML Models
To store embeddings
On daily basis, get latest Articles /
Why do we need it? Reports / Transcripts
Links between Embeddings to give
back related Information Quickly
Remove Stop words
Store
Lematization
Chunking Spacy
pt Templates
Context Aware Sentiment Analysis How?
Fine tuning the LLM
2. End Product
2.1. UI Interface for Users
2.3. ML Models
3. Steps for Individual Document
3.1. On daily basis, get latest Articles / Reports / Transcripts
3.2.2. Lematization
3.2.4. Vectorization
3.3.1.2. Limitations
3.3.1.2.1.1. Mitigation
3.3.2.2. Limitations?
3.3.2.2.1.2. Mitigation
3.3.2.2.1.2.1. Vectorization
3.4.1. NER
3.4.1.1.2.1. Designation
3.4.1.1.4. Companies
3.4.1.2.3. Spacy
3.5.1. How?
3.6. Document Summary
3.6.1. How?
3.7.1. Rule based approach to identify and list questions asked in the call?
3.8.1. How?
3.8.1.1. Neo4j
4. At Aggregate Level
4.1. Identify Trending Topics
5. Document Q&A
5.1. How scalable is it?
6.2.2. Chunking
7. Knowledge Graph
7.1. Why do we need it?
7.2.1. REBEL
7.3.1. Neo4j
8. LLama Index?
9. Vector Store
9.1. Why do we need it?
9.2.1.3. Glove?
9.3.1. Chroma
9.3.1.1. Chroma, in the context of vectorization, refers to an AI-native open-source vector database focused on developer productivity and happiness. It is designed to store and retrieve vector representations of data
efficiently. Chroma allows you to create collections of documents and perform similarity searches to find similar documents based on their vector representations.
9.3.2. Pinecone
9.3.3. Faiss