Elasticsearch Optimization
Elasticsearch Optimization
Elasticsearch Optimization
By Ketan Bansal
What is Elasticsearch?
By Your Name
What is Kibana?
● The best way to add data to Elastic Stack is to use one of the integrations
from Kibana Dashboard such as:
1. Add Data with Elastic Solutions - Website Search crawler, Elastic APM,
Endpoint Security
2. Add Data with Programming Languages - Add any data in ES using any
programming language, such as JavaScript, JAVA, Python and Ruby
3. Add Sample Data - Sample data sets come with sample visualizations,
dashboards, and more you to explore data before you add your own data
4. Upload a file - If you have a CSV, TSV, or JSON file you can upload it
and optionally import it into Elasticsearch
Kibana Query Language (KQL)
● KQL is a simple syntax for filtering Elasticsearch data using free text
search or field-based search
By Your Name
What is Logstash?
● It can dynamically unify data from disparate sources and normalize the
data into the destination of our choice
● Cleanse and Democratize all the data for diverse advanced downstream
analytics and visualization use cases
Natural Language Toolkit (NLTK)
By Your Name
What is NLTK?
● Convert whole text into various pieces of smaller text that are still
relatively meaningful outside from the main text (converting unstructured
data into structured data)
NOTE: Before using these classes, you need to first import relevant part of
NLTK
Stemming
● For Example, “helping” and “helper” share the same root i.e. “help”
● NLTK has more than one stemmer, but we’ll use Porter Stemmer
Stemming
● nltk.pos_tag() is the library used for tagging, giving the output as tuple
values
Lemmatizing: Like Stemming, Lemmatizing reduces words to their core
meaning, but it’ll give you a complete English word that makes sense of its
own instead of just a fragment of a word like “discoveri”
Elasticsearch practice :
https://github.com/S19CRXPP0098/Practice/blob/main/Elasticsearch_Pr
actice.ipynb
NLTK practice :
https://github.com/S19CRXPP0098/Practice/blob/main/NLTK_Practice.
ipynb
THANK YOU