0% found this document useful (0 votes)

30 views6 pages

Open Facebook Crawler Based On Python

The document outlines requirements for developing an open Facebook crawler in Python. It specifies details like the crawler needing a GUI to input login credentials and a page URL, crawling posts and comments from the given page, and saving the extracted data to an Excel file with fields like brand, post ID, date, content, reactions.

Uploaded by

Asim Anayat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views6 pages

Open Facebook Crawler Based On Python

Uploaded by

Asim Anayat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

OFFICIAL (CLOSED) \ NON-SENSITIVE

Open Facebook Crawler based on Python

Requirements and Deliverables: To implement and deliver an Open Facebook
Crawler in python that will allow automatic collection of open Facebook posts
and comments based on a given Open Facebook page according to the
specifications 1) Open Facebook Crawler UI and 2) Open Facebook Python
Crawler. The Open Facebook Scraper based on Python will be able to run on
any Windows Notebook.

This will include: Python source code and necessary python libraries installers
and user instructions to set-up and run the crawler on a windows Notebook.

Specifications of the Open Facebook Crawler GUI

 There will be a Graphical User Interface (GUI) where users will be able to
enter their Facebook personal information: User_ID and Password
information and the Open Facebook Page URL for configuring the crawl as
shown in Figure 1.

 A SAVE button to save the information. Make an excel file in same where
the application exist. Save username, password and page url in that file.

 A START button to start the crawler.

 A STOP button to provide hard stop for the crawler.

 Status of the Crawler will be updated: “Scraper is Running or Scraper is not

Running”.

 Status of the Crawl will be updated every 5-10 secs according to the total
number of posts and number of comments crawled.

 Incorrect User_ID or/and Password will invoke a warning prompt to

encourage the user to check and re-enter their personal Facebook
information as shown in Figure 2.

1
OFFICIAL (CLOSED) \ NON-SENSITIVE

 Incorrect open Facebook page URL will also invoke a warning prompt to
encourage the user to check and re-enter the URL as shown in Figure 2.

Figure 1: GUI interface for configuring Open Facebook Crawl

Figure 2: Appearance of Prompts dialog boxes when information is not

correctly entered in the GUI interface for configuring Open Facebook Crawl

Specifications of the Open Facebook Python Crawler

2
OFFICIAL (CLOSED) \ NON-SENSITIVE

 The Open Facebook Python crawler must be able to crawl the following
information found on the Facebook Pages specifically: all Posts and
Comments found on the page.

 The following is a list of items to be extracted and placed in a output excel

file as shown In Figure 3 and Table 1 from the all the posts and comments
found in the Open Facebook Page:

Facebook POSTS: Brand, Post ID, Date, Content, No. Likes, No.
Shares, No. Comments
Facebook Comments: User Names and Comments, Post ID

Table 1: Typical items to be crawled

Figure 3: A typical Open Facebook Post and all the items to be crawled

3
OFFICIAL (CLOSED) \ NON-SENSITIVE

How to get post ID

To get post ID click on date. Post ID will visible in url.

Location of output excel file

4
OFFICIAL (CLOSED) \ NON-SENSITIVE

Output file should be saved in the same folder where application is placed.
Name of output excel file should be the name of crawled Facebook page.
How output file should formated
A sample output file is provided. Follow that
Meaning of open facebook pages
Open pages are those which are visible on facebook and published. If page is
unpublished and cannot be accessed, show error (as discussed above).
Note
 For the company Facebook pages let's say they have 300 posts and
maybe 3000 comments. You must ensure your tool is able to scrape all
the posts and comments as much as possible.
 Scrap data in a way that facebook should not block account due to
scrapping activities.

Final Delivery
As discussed, please follow the specifications, example output file and deliver
the following: 1) souce code of python facebook posts and comments scraper
2) user-guide on installation of code to run on window platform, 3) necessary
python installers libraries. so that we can run it on my end. Thanks

Example lists of Typical Open Facebook Pages for Potential Crawling

The following are some examples of open-Facebook pages
No Open-Facebook Pages

1 https://www.facebook.com/ShopeeSingapore
2 https://www.facebook.com/LazadaSingapore
3 https://www.facebook.com/cnn
4 https://www.facebook.com/ChannelNewsAsia
5 https://www.facebook.com/lovebonito
6 https://www.facebook.com/SonySingapore
7 https://www.facebook.com/GrabFoodSG

5
OFFICIAL (CLOSED) \ NON-SENSITIVE

Mercedes Wis Epc Installation Guide PDF
No ratings yet
Mercedes Wis Epc Installation Guide PDF
3 pages
Programming 2 Lectures
No ratings yet
Programming 2 Lectures
52 pages
Facebook Apps Secrets: Facebook Apps Secret For Businesses and Marketers
From Everand
Facebook Apps Secrets: Facebook Apps Secret For Businesses and Marketers
John Hawkins
No ratings yet
Data Analysis by Web Scraping Using Python
No ratings yet
Data Analysis by Web Scraping Using Python
6 pages
Web Scraping With Python
No ratings yet
Web Scraping With Python
21 pages
Web Crawling - python
No ratings yet
Web Crawling - python
34 pages
PDF Document 2
No ratings yet
PDF Document 2
24 pages
Christos Chen
No ratings yet
Christos Chen
42 pages
Chapter 11. Web Scraping
100% (1)
Chapter 11. Web Scraping
57 pages
Fun With Python
100% (5)
Fun With Python
113 pages
3252_ids_10
No ratings yet
3252_ids_10
5 pages
Web Scraping Report
No ratings yet
Web Scraping Report
14 pages
Api and data structure
No ratings yet
Api and data structure
3 pages
b
No ratings yet
b
77 pages
UI Ex 6 (61)-1
No ratings yet
UI Ex 6 (61)-1
3 pages
Unit 11 Application Development Using Python
No ratings yet
Unit 11 Application Development Using Python
19 pages
Web Scraping and Data Collection CheatSheet 1731972399
No ratings yet
Web Scraping and Data Collection CheatSheet 1731972399
10 pages
Web Scraping
No ratings yet
Web Scraping
28 pages
60004210188_RajSingh_WIexp4
No ratings yet
60004210188_RajSingh_WIexp4
7 pages
Data Engineering Concepts #2 - Sending Data Using An API - by Bar Dadon - Dev Genius
No ratings yet
Data Engineering Concepts #2 - Sending Data Using An API - by Bar Dadon - Dev Genius
14 pages
Development Web Scrapping
No ratings yet
Development Web Scrapping
14 pages
Software Engineering Project
No ratings yet
Software Engineering Project
55 pages
I) Web Crawling: Yash Pahlani D17B 49
No ratings yet
I) Web Crawling: Yash Pahlani D17B 49
7 pages
Instagram Automation Tool: Project Report of Major Project Bachelor of Technology
No ratings yet
Instagram Automation Tool: Project Report of Major Project Bachelor of Technology
15 pages
Upload PDF
No ratings yet
Upload PDF
11 pages
Web Scrapping
100% (1)
Web Scrapping
20 pages
Template
No ratings yet
Template
21 pages
Web Scrapper From Scratch
No ratings yet
Web Scrapper From Scratch
25 pages
Web Scraping With Python and Selenium: Sarah Fatima, Shaik Luqmaan Nuha Abdul Rasheed
No ratings yet
Web Scraping With Python and Selenium: Sarah Fatima, Shaik Luqmaan Nuha Abdul Rasheed
5 pages
Instagram Automation Tool: Project Description
100% (1)
Instagram Automation Tool: Project Description
10 pages
Web Scraping With Python Tutorials From A To Z
100% (2)
Web Scraping With Python Tutorials From A To Z
35 pages
DAP_4_module
No ratings yet
DAP_4_module
45 pages
Strip HTML Tags Using Python
No ratings yet
Strip HTML Tags Using Python
8 pages
PYTHON UNIT-4
No ratings yet
PYTHON UNIT-4
10 pages
Data - Collection Python
No ratings yet
Data - Collection Python
40 pages
web scraping using python
No ratings yet
web scraping using python
18 pages
Web Scraping Using Python: A Step by Step Guide: September 2019
No ratings yet
Web Scraping Using Python: A Step by Step Guide: September 2019
7 pages
Icrawler
No ratings yet
Icrawler
35 pages
basic_scraping_techniques
No ratings yet
basic_scraping_techniques
7 pages
Practical Web Scraping for Economists 1744341390
No ratings yet
Practical Web Scraping for Economists 1744341390
33 pages
Web-Scraping-With-Python
No ratings yet
Web-Scraping-With-Python
16 pages
Python Units 4 Notes
No ratings yet
Python Units 4 Notes
11 pages
Image Scrapper From Scratch To Proudction
No ratings yet
Image Scrapper From Scratch To Proudction
22 pages
4a82c633-5051-45ef-a932-6a6495641a0e_4F_IntroToWebScraping
No ratings yet
4a82c633-5051-45ef-a932-6a6495641a0e_4F_IntroToWebScraping
6 pages
Facebook Marketing Secrets
From Everand
Facebook Marketing Secrets
Anthony Ekanem
No ratings yet
Web Scraper Mini Project
No ratings yet
Web Scraper Mini Project
13 pages
Introduction to Web Crawling chapter -13
No ratings yet
Introduction to Web Crawling chapter -13
3 pages
SOCIAL NETWORKING
No ratings yet
SOCIAL NETWORKING
21 pages
06 WebScrapingData
No ratings yet
06 WebScrapingData
39 pages
Efficient Python Tricks and Tools For Data Scientists
100% (1)
Efficient Python Tricks and Tools For Data Scientists
23 pages
Web Scraping Job Portals: Ashutosh Kumar, Kinshuk Chauhan, Jaspreet Kaur Grewal
No ratings yet
Web Scraping Job Portals: Ashutosh Kumar, Kinshuk Chauhan, Jaspreet Kaur Grewal
13 pages
Web Scraping for SEO with Python
From Everand
Web Scraping for SEO with Python
Enrique Vicente
No ratings yet
20+ Real-World Java and Python Projects To Expand Your Dev Portfolio
100% (1)
20+ Real-World Java and Python Projects To Expand Your Dev Portfolio
25 pages
Scraping Book Python PDF
No ratings yet
Scraping Book Python PDF
50 pages
Scraping Book
No ratings yet
Scraping Book
50 pages
Web Scraping Using Python: A Step by Step Guide: September 2019
No ratings yet
Web Scraping Using Python: A Step by Step Guide: September 2019
7 pages
Web Scraping Using Python: A Step by Step Guide: September 2019
0% (1)
Web Scraping Using Python: A Step by Step Guide: September 2019
7 pages
Lab7 - Python Assisted Exploitation
No ratings yet
Lab7 - Python Assisted Exploitation
11 pages
Getting Started with SharePoint Framework (SPFx): Design and Build Engaging Intelligent Applications Using SharePoint Framework
From Everand
Getting Started with SharePoint Framework (SPFx): Design and Build Engaging Intelligent Applications Using SharePoint Framework
Vipul Jain
No ratings yet
Broad Crawls - Scrapy 2.12.0 Documentation
No ratings yet
Broad Crawls - Scrapy 2.12.0 Documentation
3 pages
Pdfsearch em Ingles
No ratings yet
Pdfsearch em Ingles
29 pages
Ebo Vision Thin Client Solution MaintenanceManual
No ratings yet
Ebo Vision Thin Client Solution MaintenanceManual
1 page
Linux Installation - Guide
No ratings yet
Linux Installation - Guide
9 pages
(Web Design / Web Technology / Web Engineering) : Follow Us On Facebook Join Our Telegram Channel Join Discussion Board
No ratings yet
(Web Design / Web Technology / Web Engineering) : Follow Us On Facebook Join Our Telegram Channel Join Discussion Board
31 pages
Mysql Connection Hijacking Over Rfi: - Important of The Mysql - Close
No ratings yet
Mysql Connection Hijacking Over Rfi: - Important of The Mysql - Close
3 pages
Oracle BI 11.1.1.6.1 and ADF Integration
No ratings yet
Oracle BI 11.1.1.6.1 and ADF Integration
17 pages
XSS and HTML Code Injection
No ratings yet
XSS and HTML Code Injection
23 pages
Cheat Sim House Party
No ratings yet
Cheat Sim House Party
2 pages
Window Control Buttons
No ratings yet
Window Control Buttons
3 pages
CRM Projects Using PHP and MySQL
No ratings yet
CRM Projects Using PHP and MySQL
8 pages
Digital Portfolio Structure
No ratings yet
Digital Portfolio Structure
1 page
Guide To AnimoSpace
No ratings yet
Guide To AnimoSpace
12 pages
Case Studeis: Netflix, Youtube, and Kankan: By: Qais Sheikh USN:4MW19CS072
No ratings yet
Case Studeis: Netflix, Youtube, and Kankan: By: Qais Sheikh USN:4MW19CS072
19 pages
How To Install Canon CanoScan LiDE 100 Scanner in Ubuntu Linux Mint 20
No ratings yet
How To Install Canon CanoScan LiDE 100 Scanner in Ubuntu Linux Mint 20
2 pages
MEDITAB: Job Description: Interesting Facts About Meditab Group of Companies: (Brief Introduction)
No ratings yet
MEDITAB: Job Description: Interesting Facts About Meditab Group of Companies: (Brief Introduction)
2 pages
Bhaskar PDF
No ratings yet
Bhaskar PDF
8 pages
Fastfood E-Order System
No ratings yet
Fastfood E-Order System
22 pages
Unit Test Script: Test Scenario No. Cost Center
100% (1)
Unit Test Script: Test Scenario No. Cost Center
4 pages
HL Upgrade Instructions
No ratings yet
HL Upgrade Instructions
5 pages
Advanced View Design - HTML Tag Helper Part 3
No ratings yet
Advanced View Design - HTML Tag Helper Part 3
19 pages
Dxdiag
No ratings yet
Dxdiag
26 pages
Lesson 1 - Introduction To IT
No ratings yet
Lesson 1 - Introduction To IT
24 pages
Sass and Compass Designer's Cookbook - Sample Chapter
No ratings yet
Sass and Compass Designer's Cookbook - Sample Chapter
41 pages
Nishant Shukla Resume
No ratings yet
Nishant Shukla Resume
1 page
WP Hostbridge Soap and Rest 090303
No ratings yet
WP Hostbridge Soap and Rest 090303
12 pages
2 Mark of HTML
No ratings yet
2 Mark of HTML
2 pages
Glamorous Documentation
No ratings yet
Glamorous Documentation
23 pages
Solutions: Answers Are Marked in Bold and Underline
No ratings yet
Solutions: Answers Are Marked in Bold and Underline
4 pages
++(*Watch*)Jobz Hunting Sajal Malik Viral Video
No ratings yet
++(*Watch*)Jobz Hunting Sajal Malik Viral Video
4 pages
The Chasm in Netflix: Figure 1 Shows Netflix's Net Income From 2000 To 2019 (In Million USD)
No ratings yet
The Chasm in Netflix: Figure 1 Shows Netflix's Net Income From 2000 To 2019 (In Million USD)
3 pages

Open Facebook Crawler Based On Python

Uploaded by

Open Facebook Crawler Based On Python

Uploaded by

OFFICIAL (CLOSED) \ NON-SENSITIVE

Open Facebook Crawler based on Python

Specifications of the Open Facebook Crawler GUI

 A START button to start the crawler.

 A STOP button to provide hard stop for the crawler.

 Status of the Crawler will be updated: “Scraper is Running or Scraper is not

 Incorrect User_ID or/and Password will invoke a warning prompt to

Figure 1: GUI interface for configuring Open Facebook Crawl

Figure 2: Appearance of Prompts dialog boxes when information is not

Specifications of the Open Facebook Python Crawler

 The following is a list of items to be extracted and placed in a output excel

Table 1: Typical items to be crawled

How to get post ID

Location of output excel file

Example lists of Typical Open Facebook Pages for Potential Crawling

You might also like