0% found this document useful (0 votes)

22 views3 pages

The University of Western Australia School of Computer Science & Software Engineering

The document describes a lab assignment to create an index for a text document. Students are provided starter code and asked to write functions to: 1. Number the lines of a document and pair each line with its text 2. Pair each word in the document with its line number 3. Return a list of unique first elements from a list of pairs 4. Return the line numbers associated with a given word 5. Combine words with their line numbers into an index 6. Use the other functions to create an index for a given document 7. Test the main function on a sample file

Uploaded by

mheba11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views3 pages

The University of Western Australia School of Computer Science & Software Engineering

Uploaded by

mheba11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

THE UNIVERSITY OF WESTERN AUSTRALIA

SCHOOL OF COMPUTER SCIENCE & SOFTWARE ENGINEERING

CITS3242: Programming Paradigms

Lab sheet 2: Indexing a text document.

In this lab, you will write a program that creates an index for a text document. The index will list
every word that occurs in the document, together with the number of every line on which the word
appears. The words in the index will be sorted alphabetically, the line numbers listed for each
word will be in ascending order, and there will be no duplicated entries.
This lab uses some features that we have not seen in lectures yet – polymorphism and type
abbreviations. This code is in the starting point, however, so you not need to know how to write
such code, just to make use of the provided definitions, which should be easy.

Make sure that you have the following files, which are available via the unit web page:
Lab2index.fs contains type declarations and some trivial functions
inp contains an example text file
inp.index contains the index produced for inp

Compare inp and inp.index to make sure that you understand the problem specification – put inp in
an appropriate folder somewhere to use for testing once your program is complete.
Create a new project, and add the code Lab2index.fs, then add the functions below. The types for
each function include type abbreviations – these are in the starting point, and you should use these
types to guide you in writing your program.

1. Define a function
numberLines : fileContents -> (line * lineNumber) list

that takes the contents of a file and returns a list containing the lines from the file, each paired
with its line number. E.g.,
numberLines "a zz\r\nb yy\r\ncx d\r\n\r\nd zz d"
// yields: [ ("a zz",1); ("b yy",2); ("cx d",3); ("",4); ("d zz d",5) ]
(Hint: use lines and zip.)

[Note: “\r\n” is an end of line under Windows. However, beyond test cases like this it is
generally better to use Environment.NewLine, so that code will work with other operating
systems.]

CITS3242: Programming Paradigms 1 Lab sheet 2

2. Use numberLines to define a function
numberWords : fileContents -> (wordstr * lineNumber) list

that takes the contents of a file and returns a list containing the words of the file, each paired
with its line number. E.g.,
numberWords "a zz\r\nb yy\r\ncx d\r\n\r\nd zz d\r\n"

// yields: [("a",1); ("zz",1); ("b",2); ("yy",2);

// ("cx",3); ("d",3); ("d",5); ("zz",5); ("d",5)]

(Hint: use words and a list comprehension.)

3. Use loseAdjacentDuplicates (see below) to define a function

distinctfsts : (wordStr * lineNumber) list -> wordStr list

that takes a list of pairs xys and returns the list of distinct first fields in xys. xys is assumed
to be sorted. E.g.,

distinctfsts [("a",1); ("b",2); ("cx",3); ("d",3); ("d",5);

("d",5); ("yy",2); ("zz",1); ("zz",5)]

// yields: ["a"; "b"; "cx"; "d"; "yy"; "zz"]

(Hint: use a list comprehension.)

Note – you should use the following function from the starting point code:
loseAdjacentDuplicates : 'a list -> 'a list

Here the 'a is a type variable, and means that the function can be used for every type obtained
by replacing 'a by some type. This function will remove duplicates from a list, as long as the
duplicates are grouped together – which they will be if the list is sorted.

4. Use loseAdjacentDuplicates again to define a function

snds : wordStr -> (wordStr*lineNumber) list -> lineNumber list

that takes a value x and a list of pairs xys and returns the list of distinct second fields
associated with x in xys. x is assumed to occur on xys. E.g.,
snds "d" [("a",1); ("b",2); ("cx",3); ("d",3); ("d",5);
("d",5); ("yy",2); ("zz",1); ("zz",5)]

// yields: [3; 5]

(Hint: use a list comprehension.)

CITS3242: Programming Paradigms 2 Lab sheet 2

5. Use distinctfsts and snds to define a function
combineWords : (wordstr*lineNumber) list -> index

that takes a list of words paired with individual line numbers wis and returns an index, i.e. a
list of words each paired with a list of line numbers. wis is assumed to be sorted. E.g.,
combineWords [("a",1); ("b",2); ("cx",3); ("d",3); ("d",5); ("yy",2); ("zz",1); ("zz",5)]

// yields: [("a",[1]); ("b",[2]); ("cx",[3]); ("d",[3,5]); ("yy",[2]); ("zz",[1,5])]

(Hint: use a list comprehension.)

6. Use numberWords, combineWords and sort to define a function

makeIndex : fileContents -> index

that takes the contents of a file and returns an index for the file. See inp and inp.index for
examples.

7. Test main by giving it the full path to a copy of inp, and check the index that it writes to
inp.index.

CITS3242: Programming Paradigms 3 Lab sheet 2

DWH Project Documentation Template
No ratings yet
DWH Project Documentation Template
3 pages
AI Lab2
No ratings yet
AI Lab2
28 pages
Matter, A: Python
No ratings yet
Matter, A: Python
16 pages
5-Module-2 - Working With SPL Data Structures-18-01-2024
No ratings yet
5-Module-2 - Working With SPL Data Structures-18-01-2024
34 pages
Lab 2-Part 2: Lists
No ratings yet
Lab 2-Part 2: Lists
5 pages
Data Preparation in Python
No ratings yet
Data Preparation in Python
8 pages
Module 5 C
No ratings yet
Module 5 C
44 pages
Module 2
No ratings yet
Module 2
13 pages
PyCode Files&Lists
No ratings yet
PyCode Files&Lists
15 pages
Lab_6_List_std1 (1)
No ratings yet
Lab_6_List_std1 (1)
32 pages
2.ListsTuples
No ratings yet
2.ListsTuples
28 pages
So Lab Manual
No ratings yet
So Lab Manual
10 pages
Python Cost Model: Docdist1
No ratings yet
Python Cost Model: Docdist1
12 pages
Python
No ratings yet
Python
21 pages
UNIT 3
No ratings yet
UNIT 3
11 pages
Python Unit 3
No ratings yet
Python Unit 3
99 pages
Lab4 - Lab5 - Lab6 (1) CCP
No ratings yet
Lab4 - Lab5 - Lab6 (1) CCP
11 pages
05 Lists Advanced
No ratings yet
05 Lists Advanced
40 pages
Final Mock Solution
No ratings yet
Final Mock Solution
8 pages
Arrays 1: Gcse Computer Science
No ratings yet
Arrays 1: Gcse Computer Science
37 pages
Lists
No ratings yet
Lists
38 pages
Cambridge International Computer Science 2
No ratings yet
Cambridge International Computer Science 2
9 pages
Python Prac3 14
No ratings yet
Python Prac3 14
24 pages
UNIT 2_Python
No ratings yet
UNIT 2_Python
55 pages
Python Language Features Summary
No ratings yet
Python Language Features Summary
26 pages
CS WS-2 XII
No ratings yet
CS WS-2 XII
24 pages
Lists
No ratings yet
Lists
45 pages
Unit3 2
No ratings yet
Unit3 2
75 pages
Gebrekidan Yonatan Yakob
No ratings yet
Gebrekidan Yonatan Yakob
14 pages
Py Tutorial 57 84
No ratings yet
Py Tutorial 57 84
28 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Lists in Python (TBC)
No ratings yet
Lists in Python (TBC)
19 pages
Write Python Script For Following: Practical:Set - 3
No ratings yet
Write Python Script For Following: Practical:Set - 3
7 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Topic 6: Lists, Sets, Tuples, Dictionaries: Recall: Stings
No ratings yet
Topic 6: Lists, Sets, Tuples, Dictionaries: Recall: Stings
9 pages
Python cheatsheet
No ratings yet
Python cheatsheet
2 pages
U2 Mod2 Activity 1
No ratings yet
U2 Mod2 Activity 1
3 pages
hw2
No ratings yet
hw2
4 pages
Strings and Lists
No ratings yet
Strings and Lists
33 pages
Bakerfranke Github Io
No ratings yet
Bakerfranke Github Io
4 pages
Python-Unit2
No ratings yet
Python-Unit2
36 pages
Lisp Programming Language
From Everand
Lisp Programming Language
Faiz ul haque Zeya
No ratings yet
Smart English-Chinese Dictionary With Python
No ratings yet
Smart English-Chinese Dictionary With Python
13 pages
Abhinav Shah EC20102 ML
No ratings yet
Abhinav Shah EC20102 ML
26 pages
10 Lists
No ratings yet
10 Lists
153 pages
Lab Manual Python 2023-Final
No ratings yet
Lab Manual Python 2023-Final
48 pages
Python Cheat Sheet - Lecture Notes 1-19 Python Cheat Sheet - Lecture Notes 1-19
No ratings yet
Python Cheat Sheet - Lecture Notes 1-19 Python Cheat Sheet - Lecture Notes 1-19
4 pages
Unit 3
No ratings yet
Unit 3
69 pages
04 DataContainer
No ratings yet
04 DataContainer
48 pages
Skip List & Hashing: Cse, Postech
No ratings yet
Skip List & Hashing: Cse, Postech
36 pages
Unit 4
No ratings yet
Unit 4
26 pages
Module6.Ds
No ratings yet
Module6.Ds
16 pages
Fundamentals of Python: First Programs Second Edition: Week 5 (Chapter 5)
No ratings yet
Fundamentals of Python: First Programs Second Edition: Week 5 (Chapter 5)
46 pages
CAIE-AS Level-Computer Science - Practical
No ratings yet
CAIE-AS Level-Computer Science - Practical
10 pages
115 Ir 9
No ratings yet
115 Ir 9
4 pages
PRACTICAL FILE
No ratings yet
PRACTICAL FILE
67 pages
Lab9_Dictionaries Done (2)
No ratings yet
Lab9_Dictionaries Done (2)
8 pages
COMPUTERSCIENCE_ASSIGNMENT
No ratings yet
COMPUTERSCIENCE_ASSIGNMENT
27 pages
11.4-Text-File-Handling-EMK-Notes-2024
No ratings yet
11.4-Text-File-Handling-EMK-Notes-2024
4 pages
Python Basics: Note The Comma!
No ratings yet
Python Basics: Note The Comma!
3 pages
Python Cheat Sheet PDF
No ratings yet
Python Cheat Sheet PDF
3 pages
ch20 22
No ratings yet
ch20 22
8 pages
CH 13
No ratings yet
CH 13
6 pages
A Self-Adapting Ant Colony Optimization Algorithm Using Fuzzy Logic (ACOF) For Combinatorial Test Suite Generation
No ratings yet
A Self-Adapting Ant Colony Optimization Algorithm Using Fuzzy Logic (ACOF) For Combinatorial Test Suite Generation
11 pages
A Tool For Automated Test Data Generation (And Execution) Based On Combinatorial Approach
No ratings yet
A Tool For Automated Test Data Generation (And Execution) Based On Combinatorial Approach
19 pages
Combinatorial Testing of ACTS: A Case Study: Mehra N.Borazjany, Linbin Yu, Yu Lei Raghu Kacker, Rick Kuhn
No ratings yet
Combinatorial Testing of ACTS: A Case Study: Mehra N.Borazjany, Linbin Yu, Yu Lei Raghu Kacker, Rick Kuhn
10 pages
Late Acceptance Hill Climbing Based Strategy For Addressing Constraints Within Combinatorial Test Data Generation
No ratings yet
Late Acceptance Hill Climbing Based Strategy For Addressing Constraints Within Combinatorial Test Data Generation
6 pages
ABC Algorithm For Combinatorial Testing Problem: October 2017
No ratings yet
ABC Algorithm For Combinatorial Testing Problem: October 2017
5 pages
Kuhn 2011
No ratings yet
Kuhn 2011
1 page
Implementation of Artificial Bee Colony Algorithm For T-Way Testing
No ratings yet
Implementation of Artificial Bee Colony Algorithm For T-Way Testing
4 pages
A Cuckoo Search Based Pairwise Strategy For Combinatorial Testing Problem
No ratings yet
A Cuckoo Search Based Pairwise Strategy For Combinatorial Testing Problem
9 pages
A Review On Recent T-Way Combinatorial Testing Strategy: Nuraminah Ramli, Rozmie Razif Othman
No ratings yet
A Review On Recent T-Way Combinatorial Testing Strategy: Nuraminah Ramli, Rozmie Razif Othman
6 pages
Programming Paradigms: Unit 1 - Introduction and Basic Concepts
No ratings yet
Programming Paradigms: Unit 1 - Introduction and Basic Concepts
33 pages
Rajalakshmi Engineering College Department of Computer Science Cs2309 - Java Lab Lab Manual
100% (1)
Rajalakshmi Engineering College Department of Computer Science Cs2309 - Java Lab Lab Manual
5 pages
The University of Western Australia School of Computer Science & Software Engineering
No ratings yet
The University of Western Australia School of Computer Science & Software Engineering
2 pages
BCS Higher Education Qualifications Professional Graduate Diploma in IT Programming Paradigms Syllabus
No ratings yet
BCS Higher Education Qualifications Professional Graduate Diploma in IT Programming Paradigms Syllabus
6 pages
Programming Languages & Paradigms Abstraction & Modularity: PROP HT 2011
No ratings yet
Programming Languages & Paradigms Abstraction & Modularity: PROP HT 2011
14 pages
Wrapping Things Up: Programming Paradigms - P. 361/385
No ratings yet
Wrapping Things Up: Programming Paradigms - P. 361/385
8 pages
Reevaluating Amdahl's Law and Gustafson's Law
No ratings yet
Reevaluating Amdahl's Law and Gustafson's Law
9 pages
Introduction and Motivation: CITS 3242 Programming Paradigms
No ratings yet
Introduction and Motivation: CITS 3242 Programming Paradigms
11 pages
XX Chapter16 InstructionLevelParallelismAndSuperscalarProcessors PDF
No ratings yet
XX Chapter16 InstructionLevelParallelismAndSuperscalarProcessors PDF
90 pages
Subject Description Form: Subject Code Subject Title Credit Value Pre-Requisite / Co-Requisite/ Exclusion
No ratings yet
Subject Description Form: Subject Code Subject Title Credit Value Pre-Requisite / Co-Requisite/ Exclusion
4 pages
Xx-Iip & Ilp
No ratings yet
Xx-Iip & Ilp
16 pages
Lec 13
No ratings yet
Lec 13
14 pages
Point Feature Detection and Matching: Davide Scaramuzza
No ratings yet
Point Feature Detection and Matching: Davide Scaramuzza
65 pages
XX-BSC Compact Vector Processing
No ratings yet
XX-BSC Compact Vector Processing
49 pages
Network Security: MSC Course by
No ratings yet
Network Security: MSC Course by
9 pages
Project Database Group 4
No ratings yet
Project Database Group 4
22 pages
CISA-Certified-Information-Systems-Auditor_174752_Euro-Training
No ratings yet
CISA-Certified-Information-Systems-Auditor_174752_Euro-Training
4 pages
Google Cloud Messaging, Android Simple Tutorial For Beginner
100% (1)
Google Cloud Messaging, Android Simple Tutorial For Beginner
13 pages
UU-COM-4008 Reading Material Week 3
No ratings yet
UU-COM-4008 Reading Material Week 3
9 pages
Planning and Administering Sharepoint 2016: Id Moc 20339-1 Price 2,590.
No ratings yet
Planning and Administering Sharepoint 2016: Id Moc 20339-1 Price 2,590.
8 pages
1Z0-1115-24-Demo
No ratings yet
1Z0-1115-24-Demo
5 pages
DBMS 11
No ratings yet
DBMS 11
13 pages
One Page Summary of My CV That Can Save Some Time.
No ratings yet
One Page Summary of My CV That Can Save Some Time.
1 page
OS DB Certification Questions
No ratings yet
OS DB Certification Questions
6 pages
Bulk
No ratings yet
Bulk
10 pages
Advanced SQL - Database MCQ Questions and Answers - Technical Aptitude Page-3 Section-1
No ratings yet
Advanced SQL - Database MCQ Questions and Answers - Technical Aptitude Page-3 Section-1
5 pages
Info Written Exam 20156autum
No ratings yet
Info Written Exam 20156autum
2 pages
Software Engineering Practical Sheet for Class 12
No ratings yet
Software Engineering Practical Sheet for Class 12
5 pages
Kubernetes Administrator Roadmap
100% (1)
Kubernetes Administrator Roadmap
22 pages
Network Security
No ratings yet
Network Security
28 pages
SifySAP - GST Deployment Questionnaire
No ratings yet
SifySAP - GST Deployment Questionnaire
3 pages
BCA Project Customer Care Supportive System PDF
No ratings yet
BCA Project Customer Care Supportive System PDF
20 pages
CPU Scheduling Algorithm: A Project Report ON
No ratings yet
CPU Scheduling Algorithm: A Project Report ON
5 pages
Str. Programming
No ratings yet
Str. Programming
101 pages
Security Part I: Auditing Operating Systems and Networks: IT Auditing, Hall, 4e
100% (1)
Security Part I: Auditing Operating Systems and Networks: IT Auditing, Hall, 4e
34 pages
Double Take Availability Windows & Linux.
No ratings yet
Double Take Availability Windows & Linux.
73 pages
Pipeline (Agent (Label ') Stages (Stage ( Build') (Steps (SH MVN Install') ) ) )
No ratings yet
Pipeline (Agent (Label ') Stages (Stage ( Build') (Steps (SH MVN Install') ) ) )
2 pages
Use Case Template
No ratings yet
Use Case Template
3 pages
087 DXB1327 DI - DS - U - 1327 - JLTA2MovenpiHtIL29 Site Recovery U2100
No ratings yet
087 DXB1327 DI - DS - U - 1327 - JLTA2MovenpiHtIL29 Site Recovery U2100
1 page
ISTQB CTFL 2018v3.1 Sample Exam A Answers v1.7
No ratings yet
ISTQB CTFL 2018v3.1 Sample Exam A Answers v1.7
22 pages
3-Layered Advanced Atm Security
No ratings yet
3-Layered Advanced Atm Security
2 pages
Pro Couchbase Development
No ratings yet
Pro Couchbase Development
338 pages
Evaluating The Scalability of Distributed Systems PDF
No ratings yet
Evaluating The Scalability of Distributed Systems PDF
2 pages
1COR CreateXMLMappingDocumentation
No ratings yet
1COR CreateXMLMappingDocumentation
10 pages