0% found this document useful (0 votes)

6 views28 pages

Lab 0 - Environment Setup

The document outlines the steps for setting up a lab environment using Strigo, including creating an account, launching the lab, and starting an Elasticsearch cluster. It details the process for setting up a web crawler to index Elastic documentation and configuring inference pipelines for vectorization. Additionally, it provides instructions for connecting to OpenAI and launching a web frontend using Streamlit to interact with Elastic and ChatGPT.

Uploaded by

cecil.miranda.auz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views28 pages

Lab 0 - Environment Setup

Uploaded by

cecil.miranda.auz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Lab 0 - Environment Setup

1. Go to this link
2. This will launch a Strigo page where you might have to create an account if you’re new
to Strigo.

3. Once you’ve created an account, enter token: Y33M and click on Enter the classroom
to launch the lab environment.
4. This will begin the lab creation process which may take a few moments. You should now
be presented with a screen as shown below. You might also be asked by your browser to
allow app.strigo.io to use your microphone and camera, you can allow for now. Click on
the highlighted icon as shown below to start the Lab
5. Once the lab is loaded, you’ll be greeted with a command prompt. The lab already has
Elastic installed. We will be starting our Elastic cluster. Type “start_elastic” at the
command prompt. This starts the Elasticsearch cluster and should take about 5-7
minutes to come up
6. A process will begin to create docker containers for the required Elastic cluster nodes.
When finished, the result should look like below…

Congratulations, you are now ready for Lab 1!

Lab 1 - Setup the Web Crawler:
1. To get started, navigate to your Strigo lab in your browser. At the command prompt run
the script “get_elastic_password”. This will show the generated password for your Elastic
cluster.

2. Now we will need to change to our “Elastic” tab in Strigo. We might also have to “Reload
this view”…
3. Once the page loads you should see a login prompt. Use “elastic” for the user and paste
the password from the terminal window in the password field.
4. Once logged in you’ll be greeted by a message about adding integrations. We will be
skipping this step as we do not need any integrations for this lab. Click on “Explore on
my own”.

5. On the resulting page, click on “Enterprise Search”

6. In the middle of the next page click “Create an Elasticsearch Index”.
7. On the next screen choose “Use a Web Crawler”.

8. Be sure to name it “elastic-docs”, then click “Create Index”. Naming it elastic-docs is

important because the code for the next lab will reference this index by name.

9. Near the top of the screen select “Pipelines”. Pipelines in the context are referring to
inference pipelines. These are different from the ingest pipelines that process data prior
to it being indexed. Inference pipelines become part of ingest pipelines.
10. Click on “Copy and customize” under Ingest Pipelines.
11. Click on “Add Inference Pipeline” in the Machine Learning Inference Pipelines box.

12. Enter “title-vector” for the name. Then select the “Dense Text Embedding” model which
came preloaded into your cluster. Elastic allows for the import and inclusion of multiple
transformer models for different use cases. At the bottom, click “continue”.
13. On the next screen, enter “title” for the Source Field. Leave the Target Field blank and
then click continue at the bottom. Here we are telling the transformer model which field
we want to apply the vectorization to.
14. Click “Continue” again to pass the option test of the model, then click “Create Pipeline”.

15. Now that the pipeline is created, we need to make an adjustment to the vector
dimensions. On the left hand menu select “Dev Tools” in the Management section.
16. Paste the below code into the console to tell Elastic that we’re going to use 768
dimensions. We could increase this to 2048, however that would incur additional
resource cost during ingest processing…

Unset

POST search-elastic-docs/_mapping
{
"properties": {
"title-vector": {
"type": "dense_vector",
"dims": 768,
"index": true,
"similarity": "dot_product"
}
}
}

17. Check for the following response on the right side of the screen…

Unset

{
"acknowledged": true
}

Now we need to add an additional pipeline to compare vectorization with Elastic’s ELSER
model.
18. Navigate back to Enterprise Search
19. Click on “Indices” under overview

20. In the list of indices click on “search-elastic-docs”. Notice that we entered “elastic-docs”
for the cluster name earlier, however, we’re referencing it by “search-elastic-docs” here.
This because we preface search indexes with “search”.

21. Near the top of the next screen click on “Pipelines”...

22. In the inference pipeline section we’ll add another pipeline like we did for vectors. Click
on “Add Inference Pipeline”…

23. On this screen we will enter similar information as before with a few adjustments. Let’s
start with choosing “New Pipeline” and then setting the name to “title-elser”. Under
models we’ll choose “Elser Text Expansion”. Then click “Continue” at the bottom of the
page.
24. On the next screen we’ll add a mapping. In the list of source fields select “title”. Then
click “Add” to the right. Notice that the target field is automatically named. At the bottom
click continue.

25. At the bottom click “Continue”. We’ll skip testing the model for now so click “Continue
again.
26. On the review page click “Create Pipeline”.

27. Now let's configure the crawler to capture the Elastic documentation.
On the navigation menu to the left, select Enterprise Search -> Overview

28. Under Content click on “Indices”.

29. Under “Available Indices” click on “search-elastic-docs”.

30. Click on the “Manage Domains” tab and enter “https://www.elastic.co/guide/en”, then
click “Validate Domain”. This checks that the domain we want to index is available and
doesn’t have any limitations like a robot.txt file.

31. You’ll get a warning about robots.txt. This can be ignored.

32. After the checks complete click “Add Domain”.

33. Then click “Crawl Rules” and add the following rules one at a time. These rules make
sure that we don’t index data we don’t need or that won’t help us in the use case. Rules
can be in different formats and ordered to follow specific logic.

Disallow Regex .*

Allow Regex /guide/en/./current/.

Disallow Contains release-notes

The rules should look like this, note the order of the rules…

Note: If you need to reorder the rules click on the “=” sign and drag up or down until correct.

34. Now scroll to the top of the page and click on the blue button titled “Crawl” then select
“Crawl all domains on this index”
The Crawl button will start spinning and this will take some time to complete.

Lab 1 is complete.
Lab 2 - Setting Up the Web Front End

Connecting to OpenAI
While we wait for the crawler, we’ll set up the frontend for sending queries to Elastic.

1. In a browser navigate to https://platform.openai.com/ where you’ll need to sign up for an

OpenAI account. (Don’t worry it’s free.)(If you are unable to use your account or create
one, we will try to provide keys for use with this workshop.)

2. Click on your account and then click on “View API keys” to

3. Now we’ll need to generate an API key to use for connecting in python. Click on “API
Keys”.

4. Click “Create New Secret”

5. Copy the new key and save it. It will not be displayed again.

Launching Streamlit
Now we will use an app called Streamlit to run a web based frontend to submit our queries to
Elastic and ChatGPT.

1. Navigate back to the “Terminal” view in Strigo.

2. At the command prompt, type “cd src/elasticgpt/” and press enter.

Unset
cd src/elasticgpt/

3. To start the frontend application that will give us a webpage to interact with Elastic and
ChatGPT, run the following command…

Unset
streamlit run elasticdocs_gpt.py

There may be a couple of warnings, these can be ignored.

4. Copy the External URL like below…

5. Paste the URL into a new browser tab. Your URL should look similar to this (if you’re on
a VPN or corporate network this port might be blocked.)…

6. Press Enter and a page like below should appear. We’ll fill in the information for this
page during the next lab…
Lab 2 is complete.
Lab 3 - Ask Questions
1. Using the page we loaded from lab 2, enter the API key for OpenAI.
(The other fields under the “Click here to enter Elasticsearch cluster connectivity
information” expansion can be ignored and are included for those that want to run this
environment outside of this workshop.)

2. Below the inputs you’ll see a dropdown box that allows you to select from different
OpenAI models. Feel free to choose whichever you like. Keep in mind that the models
that allow for more tokens will give better answers. However, they also charge more.
(Also note that depending on your account type and status, some models may be
unavailable.)

3. In the prompt within the browser window, enter the question: “What is ELSER?”
The response should look like this…

4. Another question to ask would be: “Generate an elasticsearch query to search for
cows in index cow-logs.” This will respond with a properly formatted query to
search the index for cows.
5. Lastly we can try to ask a random question: “How do I build a boat?”
Due to the focused data we’re using, ChatGPT is unable to answer.

Elastic DB Engineer
No ratings yet
Elastic DB Engineer
513 pages
Elasticsearch Tutorial
100% (3)
Elasticsearch Tutorial
82 pages
Elasticsearch Py
100% (1)
Elasticsearch Py
63 pages
Lab Guide - Elasticsearch Engineer I
100% (1)
Lab Guide - Elasticsearch Engineer I
107 pages
Es Lab Final
No ratings yet
Es Lab Final
19 pages
ELK Session
No ratings yet
ELK Session
30 pages
Networking
No ratings yet
Networking
51 pages
Written Arguments Consumer
No ratings yet
Written Arguments Consumer
3 pages
Elasticsearch Server - Third Edition - Sample Chapter
No ratings yet
Elasticsearch Server - Third Edition - Sample Chapter
56 pages
Elasticsearch Blueprints - Sample Chapter
No ratings yet
Elasticsearch Blueprints - Sample Chapter
24 pages
Display A CDS View Using ALV With IDA
No ratings yet
Display A CDS View Using ALV With IDA
7 pages
Iso 123
No ratings yet
Iso 123
13 pages
ELK Interview Project Based Qwestions2
No ratings yet
ELK Interview Project Based Qwestions2
7 pages
IBM Security Guardium Data Protection Level 2
No ratings yet
IBM Security Guardium Data Protection Level 2
13 pages
Elasticsearch: by Maruf Hassan
No ratings yet
Elasticsearch: by Maruf Hassan
14 pages
Ducati Superbike: Owner's Manual
No ratings yet
Ducati Superbike: Owner's Manual
122 pages
Custom Search Engine Project Presentation
No ratings yet
Custom Search Engine Project Presentation
18 pages
Gen-Ai-Microsoft-Azure-Open-Ai-Online-Programnew (Great Learning)
No ratings yet
Gen-Ai-Microsoft-Azure-Open-Ai-Online-Programnew (Great Learning)
14 pages
ES Tutorial PDF
No ratings yet
ES Tutorial PDF
61 pages
Central Purchase Contract
No ratings yet
Central Purchase Contract
38 pages
3.7.1 Copies of Colabarations For 2021 22 Part 3
No ratings yet
3.7.1 Copies of Colabarations For 2021 22 Part 3
240 pages
Elasticsearch and The Elk Stack For Monitoring and Data Analysis
No ratings yet
Elasticsearch and The Elk Stack For Monitoring and Data Analysis
46 pages
Governing AI For Humanity
No ratings yet
Governing AI For Humanity
101 pages
Build A Search Engine For Medium Stories Using Streamlit and Elasticsearch - by ChiaChong - Better Programming
No ratings yet
Build A Search Engine For Medium Stories Using Streamlit and Elasticsearch - by ChiaChong - Better Programming
19 pages
ELK Developer Basic
No ratings yet
ELK Developer Basic
5 pages
Thesis On Color Image Segmentation
100% (2)
Thesis On Color Image Segmentation
5 pages
Learn More E - Commerce Product Photography Tips
No ratings yet
Learn More E - Commerce Product Photography Tips
9 pages
Tree Menu Magic 2
No ratings yet
Tree Menu Magic 2
77 pages
Set 5
No ratings yet
Set 5
10 pages
Datatool Alarm Manual
No ratings yet
Datatool Alarm Manual
20 pages
Modulewise QuestionBank
No ratings yet
Modulewise QuestionBank
9 pages
Elasticsearch Engineer 1
No ratings yet
Elasticsearch Engineer 1
2 pages
Fortimanager v6.4.11 Release Notes
No ratings yet
Fortimanager v6.4.11 Release Notes
45 pages
OperatingSystemConcepts 3 OperatingSystemStructures
No ratings yet
OperatingSystemConcepts 3 OperatingSystemStructures
30 pages
CSI 4500 Datasheet PDF
No ratings yet
CSI 4500 Datasheet PDF
16 pages
JS7 ClassNotes
No ratings yet
JS7 ClassNotes
5 pages
Elastic Search
No ratings yet
Elastic Search
9 pages
Flow Over Weirs Apparatus: Model FM 02
No ratings yet
Flow Over Weirs Apparatus: Model FM 02
22 pages
LT08
No ratings yet
LT08
5 pages
Exception Handling
No ratings yet
Exception Handling
12 pages
Exception 20240408
No ratings yet
Exception 20240408
7 pages
Bipolar Soft Neutrosophic Topological Region
No ratings yet
Bipolar Soft Neutrosophic Topological Region
5 pages
ElasticSearch IEEE Format1
No ratings yet
ElasticSearch IEEE Format1
3 pages
Quants Intern - JD
No ratings yet
Quants Intern - JD
3 pages
Elasticsearch Developer Cheat Sheet PDF
No ratings yet
Elasticsearch Developer Cheat Sheet PDF
2 pages
Full Paper Title in Title Case: Name Surname, Name Surname
No ratings yet
Full Paper Title in Title Case: Name Surname, Name Surname
4 pages
BV - Embedded Software Engineer - Le Dinh Hoang
No ratings yet
BV - Embedded Software Engineer - Le Dinh Hoang
1 page
Mickael Musindo
No ratings yet
Mickael Musindo
2 pages
Computational Fluid Dynamic Analysis of Innovative Design of Solar-Biomass Hybrid Dryer
No ratings yet
Computational Fluid Dynamic Analysis of Innovative Design of Solar-Biomass Hybrid Dryer
12 pages
Aimcat 1803 Exp Review
No ratings yet
Aimcat 1803 Exp Review
2 pages
Mastering Node.js Web Development: Go on a comprehensive journey from the fundamentals to advanced web development with Node.js
From Everand
Mastering Node.js Web Development: Go on a comprehensive journey from the fundamentals to advanced web development with Node.js
Adam Freeman
No ratings yet
HTML And CSS Lab Companion: HTML And CSS Lab Companion, #1
From Everand
HTML And CSS Lab Companion: HTML And CSS Lab Companion, #1
hendra mulyanto
No ratings yet
Javascript Concepts: 1St Edition
From Everand
Javascript Concepts: 1St Edition
Mohammed Ashequr Rahman
No ratings yet
ConfigMgr - An Administrator's Guide to Deploying Applications using PowerShell
From Everand
ConfigMgr - An Administrator's Guide to Deploying Applications using PowerShell
Owen Smith
5/5 (1)
Pyqt6 101: A Beginner’s Guide to PyQt6
From Everand
Pyqt6 101: A Beginner’s Guide to PyQt6
Edward Chang
No ratings yet
Practical Ethical Hacking from Scratch
From Everand
Practical Ethical Hacking from Scratch
Ansh Goyal
4.5/5 (2)
SQL| KILLING STEPS TO INTRODUCE SQL DATABASES
From Everand
SQL| KILLING STEPS TO INTRODUCE SQL DATABASES
Ben Brumm
No ratings yet
How to Write a Bulk Emails Application in Vb.Net and Mysql: Step by Step Fully Working Program
From Everand
How to Write a Bulk Emails Application in Vb.Net and Mysql: Step by Step Fully Working Program
Lotfi Ferchichi
No ratings yet
Learn Angular: Build a Todo App
From Everand
Learn Angular: Build a Todo App
Jurgen van de Moere
No ratings yet
Angular for Kids: Start Your Coding Adventure
From Everand
Angular for Kids: Start Your Coding Adventure
Abdelfattah Ragab
No ratings yet
NoSQL Injection for Elasticsearch
From Everand
NoSQL Injection for Elasticsearch
Gary Drocella
No ratings yet
Umbraco User's Guide
From Everand
Umbraco User's Guide
Nik Wahlberg
4/5 (1)
Build your own Blockchain: Make your own blockchain and trading bot on your pc
From Everand
Build your own Blockchain: Make your own blockchain and trading bot on your pc
Magelan Cybersecurity
No ratings yet
OpenCart Tips and Tricks
From Everand
OpenCart Tips and Tricks
iSenseLabs
No ratings yet
Learn Javascript In 1 Hour
From Everand
Learn Javascript In 1 Hour
John Bura
No ratings yet
Client Side Web Development For Beginners (HTML/CSS/JS)
From Everand
Client Side Web Development For Beginners (HTML/CSS/JS)
Maks Uri
4/5 (1)
Python and SQLite Development
From Everand
Python and SQLite Development
Agus Kurniawan
No ratings yet
Eclipse for Java Developers
From Everand
Eclipse for Java Developers
Dimitar Boyadzhiev
No ratings yet
So You Want To Be an iOS Developer
From Everand
So You Want To Be an iOS Developer
Kent Franks
No ratings yet
How to Upgrade Captiva InputAccel
From Everand
How to Upgrade Captiva InputAccel
Cooper Faust
No ratings yet
The Definitive Guide to Getting Started with OpenCart 2.x
From Everand
The Definitive Guide to Getting Started with OpenCart 2.x
iSenseLabs
No ratings yet
Coding & Dev Tools 300+ Prompts Collection
From Everand
Coding & Dev Tools 300+ Prompts Collection
Hema
No ratings yet
Make Bootstrap Themes
From Everand
Make Bootstrap Themes
Bo Feng
No ratings yet
AutoIT Scripting For Beginners
From Everand
AutoIT Scripting For Beginners
Rajan
5/5 (2)
C# Interview Questions, Answers, and Explanations: C Sharp Certification Review
From Everand
C# Interview Questions, Answers, and Explanations: C Sharp Certification Review
equitypress
4.5/5 (3)
Your First Week With Node.js
From Everand
Your First Week With Node.js
James Hibbard
No ratings yet
ASP.NET Application Development Fundamentals
From Everand
ASP.NET Application Development Fundamentals
James Lombard
No ratings yet
The Little Book of Sitecore® Tips: Volume 1
From Everand
The Little Book of Sitecore® Tips: Volume 1
Neil P Shack
No ratings yet
Projects with IOTA
From Everand
Projects with IOTA
Guillermo Perez Guillen
No ratings yet
The Beginner’s Guide to JavaScript
From Everand
The Beginner’s Guide to JavaScript
Steven Mcananey
No ratings yet
Cisco Packet Tracer for Beginners
From Everand
Cisco Packet Tracer for Beginners
kalyan chinta
5/5 (3)
CSS Grid Layout: 5 Practical Projects
From Everand
CSS Grid Layout: 5 Practical Projects
Craig Buckler
No ratings yet
Salesforce Developer Interview Questions: 1.0, #1
From Everand
Salesforce Developer Interview Questions: 1.0, #1
SFDC TELUGU
No ratings yet
Visual Studio Tips and Tricks: I
From Everand
Visual Studio Tips and Tricks: I
Priyanka Agarwal
No ratings yet
Vue.js: Tools & Skills
From Everand
Vue.js: Tools & Skills
James Hibbard
No ratings yet
Unofficial TIBCO® Business Works™ Interview Questions, Answers, and Explanations: TIBCO Certification Review Questions
From Everand
Unofficial TIBCO® Business Works™ Interview Questions, Answers, and Explanations: TIBCO Certification Review Questions
equitypress
3.5/5 (2)
Practice Questions for UiPath Certified RPA Associate Case Based
From Everand
Practice Questions for UiPath Certified RPA Associate Case Based
Exam OG
No ratings yet
Javascript: Javascript Programming For Absolute Beginners: Ultimate Guide To Javascript Coding, Javascript Programs And Javascript Language
From Everand
Javascript: Javascript Programming For Absolute Beginners: Ultimate Guide To Javascript Coding, Javascript Programs And Javascript Language
William Sullivan
3.5/5 (2)
Spring Boot Intermediate Microservices: Resilient Microservices with Spring Boot 2 and Spring Cloud
From Everand
Spring Boot Intermediate Microservices: Resilient Microservices with Spring Boot 2 and Spring Cloud
Jens Boje
No ratings yet
Inspiring Powershell Articles
From Everand
Inspiring Powershell Articles
Murat Yildirimoglu
No ratings yet

Lab 0 - Environment Setup

Uploaded by

Lab 0 - Environment Setup

Uploaded by

Lab 0 - Environment Setup

Congratulations, you are now ready for Lab 1!

5. On the resulting page, click on “Enterprise Search”

8. Be sure to name it “elastic-docs”, then click “Create Index”. Naming it elastic-docs is

21. Near the top of the next screen click on “Pipelines”...

28. Under Content click on “Indices”.

29. Under “Available Indices” click on “search-elastic-docs”.

31. You’ll get a warning about robots.txt. This can be ignored.

32. After the checks complete click “Add Domain”.

Allow Regex /guide/en/.*/current/.*

Disallow Contains release-notes

1. In a browser navigate to https://platform.openai.com/ where you’ll need to sign up for an

2. Click on your account and then click on “View API keys” to

4. Click “Create New Secret”

1. Navigate back to the “Terminal” view in Strigo.

2. At the command prompt, type “cd src/elasticgpt/” and press enter.

There may be a couple of warnings, these can be ignored.

4. Copy the External URL like below…

You might also like

Allow Regex /guide/en/./current/.