Using AI For OSINT - A Comprehensive Guide
Using AI For OSINT - A Comprehensive Guide
OSINT
A COMPREHENSIVE GUIDE
By, An0n Ali
,
Table of
Contents
01 DATA SCRAPING TOOLS
SCRAPING DATA
• Additional Prompts
• Cross-referencing data
04 CLOSING REMARKS
WHAT THIS GUIDE COVERS:
© COPYRIGHT NOTICE
1
INTRODUCING THE TOOLS...
Now that you've seen the video, let’s jump into my favorite
data scraping tools: Apify & PhantomBuster.
Pros:
Cons:
1
WHY PHANTOMBUSTER?
-
Pros:
Cons:
21
Scraping
data
APIFY – PUBLIC ACCOUNTS
2
STEP 2: ACCESS THE APIFY STORE
In the left-hand panel of the Apify console, click on the
"Store" button.
This will take you to the Apify Store, where you can
browse and find various scripts for web scraping. These
are called “Actors” within Apify.
2
STEP 3: SEARCH FOR SCRAPERS
-
2
Concept: The reason behind picking this specific actor is
because it's built to extract data from an Instagram
profile, such as posts, comments, followers, and more.
2
STEP 4: CONFIGURE SCRAPING SETTINGS
Input the Profile URL: Enter the URL(s) of the Instagram
profile(s) you want to scrape.
2
In our scenario, the appropriate option would be
“Scrape Posts” from each page.
Here’s why:
2
Max Items:
-
Optional:
-
1. Newer than:
-
Only scrape posts newer than the date you enter. This is
useful if you’re only interested in recent data.
2
hashtags, rather than targeting a specific user profile
directly.
-
Run Options:
2
STEP 5: RUN THE ACTOR
Click “Save and Start” to begin scraping. Monitor the
progress as data is being fetched.
2
STEP 6: VIEW AND DOWNLOAD THE DATA
After the scraping is complete, the actor will display a
"Succeeded" prompt, like the one shown below.
2
STEP 7: ANALYZE THE DATA WITH CHATGPT
Finally, use the scraped data with ChatGPT for specific
analyses.
2
Upon hitting enter, ChatGPT will list links to all the posts
where that user had commented.
2
PRIVATE ACCOUNTS - PHANTOMBUSTER
STEP 1: SIGN UP
Go to www.phantombuster.com and click on “Start free
trial” to create an account.
2
Once done, open your email inbox and verify your
Phantom Buster account.
2
STEP 2: CHOOSE AN INSTAGRAM PHANTOM
From the left-hand panel under the Filters section, click
on “Instagram”.
2
STEP 3: SET UP THE PHANTOM
Click on “Use this Phantom” to add it to your
dashboard.
2
For simplicity, I’m going with the browser extension
option.
2
Enter the Profile URL: Next, input the URL of the private
Instagram account you want to scrape.
2
Launch Settings: Finally, select the Launch Frequency
Settings according to your needs.
2
2. After another Phantom: This option schedules your
task to run after another Phantom task has
completed. This is useful if you have a sequence of
tasks that need to run in a specific order.
2
STEP 4: LAUNCH THE PHANTOM
From the next window, click on “Launch” to launch the
phantom…
2
STEP 5: WE’RE NOT DONE YET!
After the phantom has finished running, you’ll need to
return to the "Phantom Store" and choose the
"Instagram Post Commenters Export" phantom.
2
2. Posts to Process: From the “Posts to Process” window,
click on “My Phantoms” and select the dataset from
previous phantom.
2
STEP 6: DOWNLOAD AND ANALYZE DATA
Once the Phantom has completed its run, you can
download the dataset in CSV format…
2
Prompts &
scenarios
WHAT WE WILL COVER
3
Few Things to Keep in Mind:
Before diving into the specific scenarios, here are a
few important points to consider:
3
EXPLORING INSIGHTS FROM POSTS,
COMMENTS & TAGS
1. Through Posts
2. Through Tags/Mentions
3
3. Through Comments
For example:
3
Map Out Relationships / Connections
1. Through Posts
2 . Through Tags/Mentions
3
3 . Through Comments
Example Prompt:
3
Identify Behavioral Patterns
1. Through Posts
2 . Through Tags/Mentions
3
• Prompt 2: " I've provided you with a dataset of all
the posts where [username] is tagged. Track the
types of events, places, or activities the user is
frequently tagged in, which can point to trends or
hobbies.”
3 . Through Comments
3
ADDITIONAL PROMPTS
3
4. Sentiment Analysis of Captions: "Perform a sentiment
analysis on the captions of all posts. Provide a summary
of the overall sentiment (positive, negative, neutral)."
3
For Analysis on Comments
3
5. Sentiment Analysis: "Analyze the sentiment of the
comments (positive, neutral, negative). Provide a
summary of the overall sentiment on the profile."
3
For Analysis on Tags/Mentions
3
5. Identify Relationships: "Analyze the tags and
mentions to identify key individuals that frequently
appear with this user. Summarize the nature of their
interactions."
3
CROSS-REFERENCE DATA BETWEEN
MULTIPLE PROFILES
3
1. Track Common Connections: “Compare the list of
friends, followers, and connections from [Target's]
Instagram, Facebook, and LinkedIn profiles. Identify
individuals who appear across all three platforms and
analyze the nature of their relationship with [Target].
Highlight any key connections that might reveal close
relationships or professional ties.
3
3. Uncover Potential Inconsistencies: "Compare the
employment history, educational background, and
other personal information listed on [Target's] LinkedIn
profile with the information shared on Facebook and
Instagram. Identify any discrepancies or inconsistencies
that may suggest misrepresentation or hidden aspects
of [Target's] life."
3
6. Identify Consistent Behavioral Patterns: "Cross-
reference the posts, comments, and interactions from
[Target's] Instagram and Facebook profiles. Identify
consistent behavioral patterns, such as recurring
themes, topics, or tone across both platforms, to gain
insights into [Target's] personality and interests."
3
9. Track Professional and Personal Networks: "Compare
the professional connections on [Target's] LinkedIn
profile with their personal connections on Facebook
and Instagram. Identify individuals who appear in both
networks and analyze whether there are any overlaps
between [Target's] work and personal life."
3
Closing
remarks
BEFORE WE CONCLUDE…
4
My goal with this guide was to provide you with an
introduction to how emerging Large Language
Models, like ChatGPT, can be used in conjunction
with datasets to reveal valuable insights.
- An0n Ali
3
4