0% found this document useful (0 votes)

574 views5 pages

Part 4: Implementing The Solution in Python

The document describes implementing a word counting algorithm in Python. Students are asked to write a "search" function that takes a keyword as input, uses a provided Thesaurus to find synonyms, and counts occurrences of the keyword and synonyms in a provided Corpus, returning a list of tuples with words and counts. Hints are provided on iterating over lists, counting occurrences, and debugging.

Uploaded by

Huỳnh Đỗ Tấn Thành

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

574 views5 pages

Part 4: Implementing The Solution in Python

Uploaded by

Huỳnh Đỗ Tấn Thành

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Part 4: Implementing the Solution in Python

Summary
In this final part of the course project, you will implement the solution in Python.

Description
Throughout this course project, we have been applying computational thinking to solve the
problem of counting the number of occurrences of a word and its synonyms in a corpus of text
documents.

In Part 1, we applied the pillars of computational thinking to decompose this problem into two
smaller problems, then used pattern recognition to find commonalities between the problems,
used data representation and abstraction to identify the data we needed, and developed an
algorithm.

In Part 2, we represented that algorithm using a flowchart, and in Part 3 we developed the
pseudocode for that algorithm as follows:

1 All_words ← [Keyword]
2 For each Entry in Thesaurus
3 If Keyword = Entry.Word
4 Then
5 For each Word in Entry.Synonyms
6 Add Word to All_words
7 Stop
8
9 For each Search_word in All_words
10 Count ← 0
11 For each Document in Corpus
12 For each Word in Document
13 If Search_word = Word
14 Then Count ← Count + 1
15 Output: Search_word, Count

Now, to finish the computational thinking problem solving process, implement this algorithm in
Python in the space below by completing the “search” function.

As you can see, the parameter to the function is the “keyword” for which to search. Your
program can also access two other variables that you will need to complete the program:

Computational Thinking For Problem Solving |Assignment 4.7 | Property of Penn Engineering
● Thesaurus, which is a list of Entry objects; each Entry object has a word attribute,
which is a string of characters; and an attribute called synonyms, which is a list of
strings
● Corpus, which is a list of lists of strings

For example, the Entry class and Thesaurus variable may be defined like this:

1 class Entry :
2 def __init__(self, input_word, input_synonyms) :
3 self.word = input_word
4 self.synonyms = input_synonyms
5
6 e1 = Entry("dog", ["doggie", "puppy"])
7 e2 = Entry("cat", ["kitty"])
8
9 Thesaurus = [e1, e2]

Note that the first argument to the Entry constructor is the word in the thesaurus, and the
second is the list of words that are its synonyms. All words consist only of lowercase letters.

And the Corpus variable may be defined like this:

1 doc1 = [“this”, “is”, “a”, “single”, “document”]

2 doc2 = [“here”, “is”, “another”, “document”]
3
4 Corpus = [doc1, doc2]

Each document is represented as a list of words, which are all lowercase letters, and the
Corpus is a list containing those lists.

You can access the Thesaurus and Corpus variables in your program without having to define
them; they have already been defined and initialized with words and phrases describing a
person’s emotions -- happy, sad, angry, etc. -- in the setup of this activity. For your own testing
purposes, you can create your own Thesaurus and Corpus and populate them with your own
data, but please do so outside the function definition, and keep in mind that the correctness of
your function will be determined using the Thesaurus and Corpus that we have provided.

Your “search” function should implement the algorithm described by the pseudocode above by
using the Thesaurus to find the keyword’s synonyms, and then reporting the number of
occurrences of the keyword and its synonyms in all documents in the Corpus.

Computational Thinking For Problem Solving |Assignment 4.7 | Property of Penn Engineering
The output of your function should be a list of tuples, in which the first element of the tuple is the
word that was searched for (either the keyword or one of its synonyms) and the second is its
total number of occurrences.

For instance, if the keyword was “cat” and it occurred 120 times, and its synonym was “kitty” and
it occurred 84 times, the output should be:
[ (“cat”, 120), (“kitty”, 84) ]
Hint: You can create a tuple variable like this:
result = (“cat”, 120)
And then add it to a list using the list’s “append” function.

As in Parts 2 and 3, you may assume that:

● The Thesaurus is not empty, i.e., there is at least one Entry in the Thesaurus.
● The Corpus is not empty, i.e. there is at least one document (list of strings) in
the Corpus.
● Each document contains at least one string.
You can also assume that all words consist only of lowercase letters and that you don’t have to
account for punctuation.

1 def search(keyword) :
2
3 # implement the function here
4
5 return # modify to return a list of tuples
6
7 input = “happy”
8 output = search(input) # invoke the method using a test input
9 print(output) # prints the output of the function
10 # do not remove this line!

Your program will be evaluated using the inputs “happy” and “sad” for the Thesaurus and
Corpus we have provided, along with other test inputs as well. Keep in mind that you can print
out the Thesaurus and Corpus objects to see what is in them and to get an idea of whether your
program is producing the correct output for those inputs.

Computational Thinking For Problem Solving |Assignment 4.7 | Property of Penn Engineering
Hints
● Recall that using the computational thinking pillar of decomposition, we broke this
problem into two small problems: finding the synonyms and counting the number of
occurrences for each word. Rather than writing the entire function all at once, start on
the first part (finding the synonyms) and make sure that works before moving on to the
second.
● Review previous activities and lessons for examples of iterating over lists, looking for
individual values, and updating variables to keep count of something.
● Part of the challenge of this activity is knowing in advance whether your program is
generating the correct output. As mentioned above, you can create your own Thesaurus
and Corpus based on the examples described above, and use these to determine
whether your program is correctly finding the synonyms and correctly counting the
number of occurrences; be sure to do this outside the function definition. Keep in mind,
though, that your function will be graded using the Thesaurus and Corpus that we have
provided.
● If you’d like to see the Thesaurus and Corpus that we have provided, don’t forget that
these are variables just like any others, and you can print out their values.
● As in previous activities, don’t forget that you can use the “print” function to print out
intermediate values as your code is performing operations so that you can see what is
happening, and you can use the “Run” button to run the program and see those outputs.

Common Errors
You may run into Python syntax or runtime errors while developing your solution. Here are
some of the common ones:
● If you encounter TypeError or AttributeError related to the Entry class, be sure you are
correctly using its attributes, based on the example above. Note that you should not
need to create any new Entry objects unless you are creating a test Thesaurus.
● Likewise, if you encounter a NameError or AttributeError related to the Thesaurus or
Corpus variables, check the examples above and make sure you understand their
structure. Note that the Corpus is a list of lists, and not a list of “documents,” i.e. there is
no “document” class in this program.
● Last, if you encounter an IndentationError, this means that your code is not properly
indented. Keep in mind that all the code you write as part of the solution should be within
the body of the “search” function and indented by at least one tab; you may have
additional code outside the search function, e.g. creating a test Thesaurus or Corpus,
but the code inside the function must be indented.

After you have run your program using the “Run” button and believe that it is producing the
correct output, accept the terms of the Coursera Honor Code and then click the “Submit Quiz”
button below to submit this assignment and have it graded.

Computational Thinking For Problem Solving |Assignment 4.7 | Property of Penn Engineering
A few seconds after you submit the quiz, you will see the result at the top of the screen. If it
reads “Congratulations! You passed!” then you are done!

However, if it reads “Try again once you are ready.” then this means that the automatic grading
program indicated that your program is not correct. In that case, click the “Retake” button to go
back to the quiz and try again.

If you believe that your code was correct, be sure to click the “Run” button before submitting and
inspect the result of the “print” statement that is right before the “return” statement. This allows
you to see the output that your code is producing before it is evaluated. If it looks right, but the
automatic grading utility is still indicating that it is incorrect, then post a message on the
discussion board to ask for assistance.

Computational Thinking For Problem Solving |Assignment 4.7 | Property of Penn Engineering

Prompt Engineering
0% (1)
Prompt Engineering
2 pages
English Alive Book 3 TG
No ratings yet
English Alive Book 3 TG
43 pages
Muppets Blatch PDF
No ratings yet
Muppets Blatch PDF
7 pages
Module in English 4
No ratings yet
Module in English 4
46 pages
Curriculum Map: English 5 1st Quarter
100% (2)
Curriculum Map: English 5 1st Quarter
22 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Confident Programmer Problem Solver: Six Steps Programming Students Can Take to Solve Coding Problems
From Everand
Confident Programmer Problem Solver: Six Steps Programming Students Can Take to Solve Coding Problems
Cloudy Heaven Games
No ratings yet
Python: Advanced Guide to Programming Code with Python
From Everand
Python: Advanced Guide to Programming Code with Python
Charlie Masterson
No ratings yet
PYTHON DATA SCIENCE FOR BEGINNERS: Unlock the Power of Data Science with Python and Start Your Journey as a Beginner (2023 Crash Course)
From Everand
PYTHON DATA SCIENCE FOR BEGINNERS: Unlock the Power of Data Science with Python and Start Your Journey as a Beginner (2023 Crash Course)
Rufus Johnston
No ratings yet
Programming with Python
From Everand
Programming with Python
Enrique Vicente
No ratings yet
Debugging Techniques: Troubleshooting Computer Problems
No ratings yet
Debugging Techniques: Troubleshooting Computer Problems
18 pages
T1 Decomposition and Abstraction
No ratings yet
T1 Decomposition and Abstraction
26 pages
Data Science Chapitre 0
No ratings yet
Data Science Chapitre 0
25 pages
Ethical Consideration in Artificial Intelligence Development and Deployment
No ratings yet
Ethical Consideration in Artificial Intelligence Development and Deployment
6 pages
IA Ethique 15-04
No ratings yet
IA Ethique 15-04
22 pages
Repetition Structures Python
No ratings yet
Repetition Structures Python
12 pages
Big Data Analytics
No ratings yet
Big Data Analytics
18 pages
Class Object
No ratings yet
Class Object
26 pages
Anaconda Installation Guidelines
No ratings yet
Anaconda Installation Guidelines
6 pages
Chapter 3: Control Structures: 1. Higher Order Organization of Python Instructions
No ratings yet
Chapter 3: Control Structures: 1. Higher Order Organization of Python Instructions
7 pages
Python While Loops
No ratings yet
Python While Loops
1 page
Beginner Cheat Sheet KNIME 5.1
No ratings yet
Beginner Cheat Sheet KNIME 5.1
2 pages
Python Functions
0% (1)
Python Functions
5 pages
Python Course Outline
No ratings yet
Python Course Outline
2 pages
Python Chapter 04 While Loop Notes
No ratings yet
Python Chapter 04 While Loop Notes
21 pages
A Short Guide For Feature Engineering and Feature Selection
No ratings yet
A Short Guide For Feature Engineering and Feature Selection
32 pages
Python (Anaconda) - Installation Kit
No ratings yet
Python (Anaconda) - Installation Kit
7 pages
And Lists: Jason Myers
No ratings yet
And Lists: Jason Myers
114 pages
LLM Architectures Explained - Transformers (Part 6) - by Vipra Singh - Freedium
No ratings yet
LLM Architectures Explained - Transformers (Part 6) - by Vipra Singh - Freedium
95 pages
Version Control Systems
No ratings yet
Version Control Systems
6 pages
Cheat Sheet - Machine Learning - Data Science Interview PDF
No ratings yet
Cheat Sheet - Machine Learning - Data Science Interview PDF
16 pages
Data Visualization For Industry 4
No ratings yet
Data Visualization For Industry 4
3 pages
Funciones para Python
No ratings yet
Funciones para Python
33 pages
Python Tuple
No ratings yet
Python Tuple
23 pages
Datascience Lab Manual
No ratings yet
Datascience Lab Manual
46 pages
Python
No ratings yet
Python
12 pages
Python Unit-3 Question Bank
No ratings yet
Python Unit-3 Question Bank
88 pages
Time Series 1
No ratings yet
Time Series 1
23 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
Convolutional Neural Networks
100% (1)
Convolutional Neural Networks
31 pages
Stakeholder Management
No ratings yet
Stakeholder Management
27 pages
Data Cleaning 2021
No ratings yet
Data Cleaning 2021
61 pages
S6 - Time - Series Analysis - 1
No ratings yet
S6 - Time - Series Analysis - 1
21 pages
Part 5 Stakeholder Management
No ratings yet
Part 5 Stakeholder Management
16 pages
Machine Learning: Linear Models For Classification 1
No ratings yet
Machine Learning: Linear Models For Classification 1
30 pages
Install Anaconda On Windows
No ratings yet
Install Anaconda On Windows
19 pages
GCLUTO - An Interactive Clustering, Visualization, and Analysis System
No ratings yet
GCLUTO - An Interactive Clustering, Visualization, and Analysis System
10 pages
Time Series
No ratings yet
Time Series
44 pages
Data Flow Diagram
No ratings yet
Data Flow Diagram
55 pages
Time Series and Forecasting
No ratings yet
Time Series and Forecasting
75 pages
ARIMA Modeling:: B-J Procedure
No ratings yet
ARIMA Modeling:: B-J Procedure
26 pages
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
39 pages
Audio Sample Selection With Generative Adversarial Networks
No ratings yet
Audio Sample Selection With Generative Adversarial Networks
80 pages
OpenCV Android Programming By Example: Leverage OpenCV to develop vision-aware and intelligent Android applications.
From Everand
OpenCV Android Programming By Example: Leverage OpenCV to develop vision-aware and intelligent Android applications.
Amgad Muhammad
No ratings yet
MongoDB Mongoosess
No ratings yet
MongoDB Mongoosess
31 pages
Constructivist Learning Models Completed
No ratings yet
Constructivist Learning Models Completed
17 pages
IoT Introduction
No ratings yet
IoT Introduction
30 pages
C
No ratings yet
C
212 pages
Css
No ratings yet
Css
22 pages
Presentation On Neural Networks
No ratings yet
Presentation On Neural Networks
46 pages
Knowledge Engineering Using Large Language Models - TGDK.1.1.3
No ratings yet
Knowledge Engineering Using Large Language Models - TGDK.1.1.3
19 pages
Installing A Python Based Machine Learning Environment in Windows 10
No ratings yet
Installing A Python Based Machine Learning Environment in Windows 10
9 pages
The Diverse Landscape of Large Language Models Deepsense Ai
No ratings yet
The Diverse Landscape of Large Language Models Deepsense Ai
16 pages
Corpus
No ratings yet
Corpus
1 page
Huỳnh Đỗ Tấn Thành: Computational Thinking for Problem Solving
No ratings yet
Huỳnh Đỗ Tấn Thành: Computational Thinking for Problem Solving
1 page
Crawl
No ratings yet
Crawl
1 page
Bài tập 1 - 19522227
No ratings yet
Bài tập 1 - 19522227
3 pages
Grade 4 q1 English Las Week 2
No ratings yet
Grade 4 q1 English Las Week 2
2 pages
De Jong, T. - Van Der Voordt, D. (2002) Ways To Study and Research - Urban, Architectural and Technical Design
No ratings yet
De Jong, T. - Van Der Voordt, D. (2002) Ways To Study and Research - Urban, Architectural and Technical Design
562 pages
Relevance Feedback
No ratings yet
Relevance Feedback
47 pages
1.110 ATP 2023-24 GR 11 English FAL Final
No ratings yet
1.110 ATP 2023-24 GR 11 English FAL Final
8 pages
How To Use A Dictionary Powerpoint
0% (1)
How To Use A Dictionary Powerpoint
36 pages
Reference Materials
No ratings yet
Reference Materials
24 pages
Chapter - 2 Thesaurus Construction and Its Role in Indexing
No ratings yet
Chapter - 2 Thesaurus Construction and Its Role in Indexing
9 pages
Lesson Plan Cefr Form 2. 2020docx
100% (1)
Lesson Plan Cefr Form 2. 2020docx
112 pages
Reading - Reading Power - Int - Port - 9-10-09
No ratings yet
Reading - Reading Power - Int - Port - 9-10-09
7 pages
GRADE 4 Module W2
No ratings yet
GRADE 4 Module W2
2 pages
Second Grading English Reviewer Grade 7
No ratings yet
Second Grading English Reviewer Grade 7
9 pages
CLMD4A - EngG4 Q1
No ratings yet
CLMD4A - EngG4 Q1
40 pages
Syllabus Class 7th
No ratings yet
Syllabus Class 7th
141 pages
English Week 3
No ratings yet
English Week 3
78 pages
Get The Pocket Oxford Dictionary and Thesaurus 2nd Edition Elizabeth J. Jewell (Editor) Free All Chapters
67% (3)
Get The Pocket Oxford Dictionary and Thesaurus 2nd Edition Elizabeth J. Jewell (Editor) Free All Chapters
84 pages
FDLP - English - Grade 4 - Q1 - W2
100% (1)
FDLP - English - Grade 4 - Q1 - W2
2 pages
English Today
No ratings yet
English Today
5 pages
Paraphrasing and Summarizing
No ratings yet
Paraphrasing and Summarizing
25 pages
Least Mastered Competencies - English
100% (1)
Least Mastered Competencies - English
3 pages
COT 1 A Detailed Lesson Plan in ENGLISH 4
No ratings yet
COT 1 A Detailed Lesson Plan in ENGLISH 4
6 pages
Word of The Week - Context Clues Sample
No ratings yet
Word of The Week - Context Clues Sample
13 pages
English: Quarter 3 - Module 3 Clarifying and Using Appropriate Expressions and Meaning of Words
No ratings yet
English: Quarter 3 - Module 3 Clarifying and Using Appropriate Expressions and Meaning of Words
33 pages
Inglés Secundaria: Topic 10
No ratings yet
Inglés Secundaria: Topic 10
20 pages
DOG - English Meaning - Cambridge Dictionary
No ratings yet
DOG - English Meaning - Cambridge Dictionary
15 pages
Spanish Dissertation Examples
100% (2)
Spanish Dissertation Examples
7 pages
Dokumen - Pub - Networks and Knowledge in Rogets Thesaurus 0199553238 9780199553235 9780191564680
No ratings yet
Dokumen - Pub - Networks and Knowledge in Rogets Thesaurus 0199553238 9780199553235 9780191564680
226 pages
UNFORTUNATE English Meaning - Cambridge Dictionary
No ratings yet
UNFORTUNATE English Meaning - Cambridge Dictionary
1 page

Part 4: Implementing The Solution in Python

Uploaded by

Part 4: Implementing The Solution in Python

Uploaded by

Part 4: Implementing the Solution in Python

And the Corpus variable may be defined like this:

1 doc1 = [“this”, “is”, “a”, “single”, “document”]

As in Parts 2 and 3, you may assume that:

You might also like