Introductory Sheet
Introductory Sheet
and sentences into numerical representations that computers can understand. This process is
called text vectorization or embedding. Below are different ways to convert a text sentence
into a numerical vector, ranging from simple to advanced methods.
Example:
Sentences:
Example:
If "NLP" appears 10 times in a document but is rare in the overall dataset, it gets a high TF-
IDF score.
Example:
Example:
Example:
If we ask GPT:
"What does the word 'apple' mean?"
It understands whether we are talking about a fruit or the tech company.
Conclusion
For beginners: