0% found this document useful (0 votes)

8 views11 pages

QT 1 - Group 5 - R Assignment 1

Uploaded by

Gagan Hasija

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views11 pages

QT 1 - Group 5 - R Assignment 1

Uploaded by

Gagan Hasija

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Quantitative Techniques 1

Increasing COVID-19 testing capability through pooled testing

R Assignment

Group 5

B24061 Aman Sharma B24071 Gagan Hasija

B24081 Manu Agarwal B24092 Rishabh Saboo
B24102 Soham Parija BL24004 Anjaney Srinivas

1
Task 1

We have used a function (“expected_tests_first_twenty_pool_size”) which

will return the best pool size value.
The total sample is N=10000 and the number of people in a group is denoted
by n.
Total number of groups (NG) = floor(N/n)
Some samples would have to be tested individually as they can’t be clubbed
together in any group.
Total number of individual samples(N_INDV) = N - NG*n
P(An individual tests positive) = p = 0.01
P(An individual tests negative) = 1-p = 0.99
Therefore, P(A group tests negative) = (1-p)n.
P(A group tests positive) = 1-(1-p)n
In a particular group, we can either require 1 test if the test is negative or we
can require n+1 tests if the group is positive.
Now we use the concept of expectation through which we can find the
expected number of tests in a group.

E(number of tests in a group) = 1(1-p)n + (n+1)(1-(1-p)n) . This expectation

gets multiplied by the total number of groups to give the total expected tests
for all groups.
E(total number of tests) = E(number of tests in a group) * NG + N_INDV
Further we have used a loop to compute the expected number of tests for
different row sizes ranging from 1 to 20 and store them in a vector. We have
created a function(“best_pool_size_row_pool”) to return the best pool size
giving the highest numbers of test savings.

2
Result:
Group Size E(X) of Tests

1 10000.000

2 5199.000

3 3630.980

4 2894.040

5 2490.100

6 2254.964

7 2111.075

8 2022.553

9 1976.741

10 1956.179

11 1956.513

12 1972.697

13 1996.422

14 2030.017

15 2074.017

16 2110.422

17 2161.940

18 2218.208

19 2269.271

20 2320.931

3
The above data can be better understood by looking at the graph attached below:

For p=0.01, the optimal pool size is 10

Task 2

This task is an extension of task 1, where instead of finding the best

option from 1 to 30, here, we need to find the optimal value of n, for a
given prevalence rate p.

4
The function (“optimal_pool_size_row_pool”) iterates over all possible
pool sizes and returns the one that minimizes the expected number of
tests.

It has a while loop running from 2 to 10000 and every time an expected
test count is found to be lower than the pre-stored optimal tests, we
update the optimal tests with the new minimum and the optimal value
with the index of the minimum expected test count.

Results:

 For p=0.02, the optimal pool size is 8.

 For p=0.05, the optimal pool size is 5.
 For p=0.10, the optimal pool size is 4.

Task 3
CALCULATING THE CUT-OFF VALUE OF ‘p’ FOR ANY GIVEN VALUE OF ‘n’

Let 'p' be the prevalence of disease in an individual and n be the size of

the group.

X = number of tests that are needed for the group

P(Group result negative) = (1-p)n

Number of tests when the group has negative result = 1

Probability of group having at least one positive = 1-(1-p)n

Number of tests when the group has at least one positive = n+1

Therefore expected number of tests, E = 1(1-p)n + (n+1)(1-(1-p)n)

At the cut-off value of p, the expected number of tests for a group shall
be greater than or equal to the number of individuals in the group.

5
A/c to the previous statement, E>=n

=> 1(1-p)n + (n+1)(1-(1-p)n) >= n

=> n+1-n*(1-p)n>=n

=> n+1-(n*(1-p)n)-n>=n-n

=> 1-n*(1-p)n >=0

=> 1>=n*(1-p)n

=> 1/n>=(1-p)n (n>=1 always)

=> (1/n)(1/n)>=1-p

=> p>=1-(1/n) (1/n)

Therefore, the cutoff value of p = 1-(1/n) (1/n)

We use the above formula to find the cut-off value of p for a given value
of n.

n Prevalence rate(Task 2) Cutoff_value(p)

8 0.02 0.2288496
5 0.05 0.2752203
4 0.10 0.2928932

Task 4

6
We take n*n samples from N samples at a time. Each of those n*n samples is
referred to as a group.
Here N=10000 and p=0.01, the prevalence rate of the disease.
Number of groups(NG) = floor(N/n2)
Number of individual tests(N_IND) = Individual samples which are not a part of
any group
All samples of a row are tested together. Similarly, all samples of a column are
tested together. If a row and a column are tested positive, then their intersection
sample is tested again.
FOR A GROUP :
P(row negative) = (1-p)n
P(column negative) = (1-p)n
P(both row and column negative) = (1-p)(2*n-1)
P(row or column negative) =2*(1-p)n - (1-p)(2*n-1)
P(row and column positive) =1-P(row or column negative)
Number(row and column test) = 2*n
E(number of intersection tests) = n*n*P(row and column positive)
E(number of tests per group) = 2*n + n*n*P(row and column positive)
FOR THE WHOLE SAMPLE
E(number of tests) = E(number of tests per group)*NG + N_INDV

We iterate over a loop, where n varies from 1 to 30 and we compute the expected
number of tests for each value of n. The value of n which gives the minimum result
is taken as our best pool size.

7
Results
Group Size E(X) of Tests
1 10000
2 10100.99
3 6770.91
4 5108.733
5 4115.371
6 3475.433
7 2993.85
8 2657.457
9 2409.498
10 2174.045
11 2071.028
12 1927.111
13 1790.133
14 1680.411
15 1687.848
16 1557.408
17 1642.901
18 1694.564
19 1640.729
20 1399.152
21 1637.501
22 1643.745
23 1772.168
24 1534.84
25 1354.745
26 1821.143
27 1815.904
28 1884.139
29 2030.509
30 1485.499

8
Also attaching the graph for better understanding:

Running the R code, the best option for pool size is 25

Task 5

● The goal is to find the optimal square size for a given prevalence rate p.
● The function “optimal_pool_size_cross_pool” iterates over possible
square sizes and returns the one that minimizes the expected number of
tests.
● It builds on the previous task, with the only change that, instead of
finding the optimum value in the range of 1 to 30, in this task, we find
the optimum value in the range of 1 to 100 (floor value of square root
of N).

9
Results:

● For p = 0.02, the optimal square size is 16.

● For p = 0.05, the optimal square size is 10.
● For p = 0.10, the optimal square size is 7.

Task 6
From this table, we observe that the Acbott Test is conducted on 2000 Individuals
and the truth table looks as follows.

TOTAL TRUE POSITIVES ACTUAL TOTAL TRUE NEGATIVES ACTUAL

1000 1000

TOTAL TRUE POSITIVES TOTAL TRUE NEGATIVE

PREDICTED PREDICTED

990 950

PROBABILITY_TRUE_POSITIVE PROBABILITY_TRUE_NEGATIVE

p1=0.99 p3=0.95

PROBABILTY_FALSE_NEGATIVE PROBABILITY_FALSE_POSITIVE

p2=0.01 p4=0.05

p_pos=P(individual tests positive)=p1p+p4(1-p), where p is the

prevalence rate of disease in the population (assumed as 0.01)

We have assumed that whenever a row tests positive, we re-test all the
samples of the row.

With this information, we approach the problem in a way similar to the first
task. Every time a pool of samples test positive, all the individuals of the pool

10
would be retested individually.
E(number of tests per group)= (1*(1-p_pos)n + (n+1)*(1-(1-p_pos)n))

Number of groups(NG) = floor(N/n)

Number of individual testing(N_INDV) = N-Number of groups*n

E(total number of tests) = E(number of test per group)*NG + N_INDV

After computing the expected number of tests for one value of n in a

function(‘acbott_test_optimization’), we run a loop in another
function(‘expected_test_for_first_twenty_values_acbott_test1’) which stores
all the computed expected values, for all n belonging to 1 to 20. A third
function (‘best_pool_size_acbott_test1’) calculates the best pool size, among
these twenty values.

For general optimization of pool size, we have a minimum expected value.

When we iterate over all the possible values of n and if we get a value
smaller than the minimum value, we update our minimum value, as well as
the optimum batch size. This happens in another function called
(“best_pool_size_acbott_test2”)

Astam Formula Sheet
No ratings yet
Astam Formula Sheet
10 pages
Printing Process
100% (4)
Printing Process
28 pages
TSTA602 NA Exam Answer Booklet 2022 Term 5
No ratings yet
TSTA602 NA Exam Answer Booklet 2022 Term 5
11 pages
Grub CFG
No ratings yet
Grub CFG
12 pages
STAM Formula Sheet
100% (2)
STAM Formula Sheet
4 pages
Unit 9 - Week 7: Assignment 7
No ratings yet
Unit 9 - Week 7: Assignment 7
5 pages
AB CLX Standard 4 Day Module1 5F
No ratings yet
AB CLX Standard 4 Day Module1 5F
62 pages
RRU5258 Product Description Draft A (20180925)
67% (3)
RRU5258 Product Description Draft A (20180925)
12 pages
RHEL 8.5 - Release Notes
No ratings yet
RHEL 8.5 - Release Notes
184 pages
M870 Operator Manual PDF
No ratings yet
M870 Operator Manual PDF
62 pages
Belden: Belden'S Cable Management Solutions Catalog
No ratings yet
Belden: Belden'S Cable Management Solutions Catalog
60 pages
Vizio M550SV LCD TV User Manual
No ratings yet
Vizio M550SV LCD TV User Manual
53 pages
Rbs 6501 Datasheet PDF
No ratings yet
Rbs 6501 Datasheet PDF
2 pages
Manual de Utilizare PG106 Eng
No ratings yet
Manual de Utilizare PG106 Eng
27 pages
Sonix SNC7001A - Spec - V1.5
No ratings yet
Sonix SNC7001A - Spec - V1.5
22 pages
Regression in R
No ratings yet
Regression in R
40 pages
Politecnico Di Torino PoliTOcean Technical Documentation 2019
No ratings yet
Politecnico Di Torino PoliTOcean Technical Documentation 2019
23 pages
Graph Theory (J) : Grace He March 2021
No ratings yet
Graph Theory (J) : Grace He March 2021
4 pages
Top 50 db2 Interview Questions
No ratings yet
Top 50 db2 Interview Questions
8 pages
Nota
No ratings yet
Nota
16 pages
Statistics Notes
No ratings yet
Statistics Notes
18 pages
ML Trends
No ratings yet
ML Trends
89 pages
On Fitting Models For Danish Fire Data
No ratings yet
On Fitting Models For Danish Fire Data
49 pages
DBMS Assignment 2
No ratings yet
DBMS Assignment 2
9 pages
Zoom Manual
No ratings yet
Zoom Manual
9 pages
Microsoft: Exam Questions MS-500
No ratings yet
Microsoft: Exam Questions MS-500
14 pages
(ESP32 At) (v2.2.0.0) Release Note
No ratings yet
(ESP32 At) (v2.2.0.0) Release Note
5 pages
EC212: Introduction To Econometrics Multiple Regression: Inference (Wooldridge, Ch. 4)
No ratings yet
EC212: Introduction To Econometrics Multiple Regression: Inference (Wooldridge, Ch. 4)
89 pages
Cut Off of Last Round (Without Conversion) For BA (JMC) Programme, CET Code-126 For The Academic Session 2019-20
No ratings yet
Cut Off of Last Round (Without Conversion) For BA (JMC) Programme, CET Code-126 For The Academic Session 2019-20
6 pages
Sheet9 Sol
No ratings yet
Sheet9 Sol
11 pages
Income Tax (It) Corporate Finance (CF)
No ratings yet
Income Tax (It) Corporate Finance (CF)
4 pages
2024 09 Exam SRM Syllabus
No ratings yet
2024 09 Exam SRM Syllabus
6 pages
Diagrama Entidad Relacion Moodle: Relación Usuario - Grupo - Curso
No ratings yet
Diagrama Entidad Relacion Moodle: Relación Usuario - Grupo - Curso
2 pages
Reliance JIO
No ratings yet
Reliance JIO
69 pages
3-28.OSB2B05 Traffic Statistics
No ratings yet
3-28.OSB2B05 Traffic Statistics
34 pages
Tecnia Institute of Advanced Studies: Practical File
No ratings yet
Tecnia Institute of Advanced Studies: Practical File
1 page
Guide ECON306 Solution HW 10
No ratings yet
Guide ECON306 Solution HW 10
35 pages
Power
No ratings yet
Power
29 pages
Pset 03 Spring2020 Solutions
No ratings yet
Pset 03 Spring2020 Solutions
15 pages
Presentation of Final Project 2
No ratings yet
Presentation of Final Project 2
26 pages
A11yprovider Log
No ratings yet
A11yprovider Log
132 pages
MIT18 05S14 ps9 Solutions PDF
No ratings yet
MIT18 05S14 ps9 Solutions PDF
5 pages
ECON 1630 Problem Set #2 Fall 2021: Bias Variance
No ratings yet
ECON 1630 Problem Set #2 Fall 2021: Bias Variance
9 pages
Sheet8 Sol
No ratings yet
Sheet8 Sol
14 pages
Exercises - (Activity #6)
No ratings yet
Exercises - (Activity #6)
15 pages
Log
No ratings yet
Log
27 pages
1.2.3.4.short Solutions
No ratings yet
1.2.3.4.short Solutions
14 pages
Solution - Test 2
No ratings yet
Solution - Test 2
6 pages
Faculty of Higher Education: Assignment Cover Sheet
No ratings yet
Faculty of Higher Education: Assignment Cover Sheet
8 pages
Netgear Cm1000v2 User - Manual Um - en
No ratings yet
Netgear Cm1000v2 User - Manual Um - en
28 pages
CS1B April 2024
No ratings yet
CS1B April 2024
9 pages
STA3030F - Jan 2015 PDF
No ratings yet
STA3030F - Jan 2015 PDF
13 pages
Statistics: Assignment 6
No ratings yet
Statistics: Assignment 6
6 pages
Specimen Exam Solutions Cs1a Ifoa 2019 Final
No ratings yet
Specimen Exam Solutions Cs1a Ifoa 2019 Final
11 pages
Exercise Sheet - Control Structures and Functions: Hint: You Can Use The Command Diag
No ratings yet
Exercise Sheet - Control Structures and Functions: Hint: You Can Use The Command Diag
4 pages
MATH 376 - Final Exam Sample Solutions: 1 2 M 1 2 N I 1 2 1 I 2 2 2
No ratings yet
MATH 376 - Final Exam Sample Solutions: 1 2 M 1 2 N I 1 2 1 I 2 2 2
8 pages
Data Science - Part II (Cra 4061)
No ratings yet
Data Science - Part II (Cra 4061)
2 pages
02 MP 8086 Architecture and Instruction Set
No ratings yet
02 MP 8086 Architecture and Instruction Set
12 pages
CS2B - April23 - EXAM - Clean Proof - v2
No ratings yet
CS2B - April23 - EXAM - Clean Proof - v2
8 pages
Biostastucs Assignment Final
No ratings yet
Biostastucs Assignment Final
19 pages
Session13 Student
No ratings yet
Session13 Student
38 pages
SPECIMEN EXAM SOLUTIONS - CS1B - IFoA - 2019 - Final
No ratings yet
SPECIMEN EXAM SOLUTIONS - CS1B - IFoA - 2019 - Final
8 pages
Final Review Handout
No ratings yet
Final Review Handout
47 pages
EEPF Finals
No ratings yet
EEPF Finals
2 pages
Raging Bull
No ratings yet
Raging Bull
1 page
HMWK 4
No ratings yet
HMWK 4
5 pages
Exam 3 Solution
No ratings yet
Exam 3 Solution
8 pages
AST-01 July 2013 (E)
No ratings yet
AST-01 July 2013 (E)
6 pages
Practice Questions - Final With Feedback
No ratings yet
Practice Questions - Final With Feedback
8 pages
Ef 22 Ene
No ratings yet
Ef 22 Ene
7 pages
Statistics2 PastExamQuestions
No ratings yet
Statistics2 PastExamQuestions
8 pages
Panel Lecture - Gujarati
100% (1)
Panel Lecture - Gujarati
26 pages
00 Lab Notes
No ratings yet
00 Lab Notes
8 pages
Statastics MQP II Pu 2023-24
No ratings yet
Statastics MQP II Pu 2023-24
8 pages
MAT2337 December 2010 Final Exam
No ratings yet
MAT2337 December 2010 Final Exam
11 pages
Discussion1 Solution
No ratings yet
Discussion1 Solution
5 pages
B.A. H Economics Intermedi Bikup2y2023
No ratings yet
B.A. H Economics Intermedi Bikup2y2023
32 pages
Assign3 AanchalDhar 4590
No ratings yet
Assign3 AanchalDhar 4590
12 pages
STA1000F Test 2 2008 Sol
No ratings yet
STA1000F Test 2 2008 Sol
6 pages
Lab Manual
No ratings yet
Lab Manual
118 pages
6 Lecture6 AI
No ratings yet
6 Lecture6 AI
7 pages
Stats Quiz 2 Cheatsheet
No ratings yet
Stats Quiz 2 Cheatsheet
2 pages
Unit-3 (Estimation)
No ratings yet
Unit-3 (Estimation)
16 pages
The Kraft Heinz Not Company Case Questions
No ratings yet
The Kraft Heinz Not Company Case Questions
1 page
TP MSDC 1 Sujet
No ratings yet
TP MSDC 1 Sujet
3 pages
CH 05 Ans
No ratings yet
CH 05 Ans
15 pages
Attachment 1
No ratings yet
Attachment 1
3 pages
2017aug 02323 02402 Solution en
No ratings yet
2017aug 02323 02402 Solution en
43 pages
S3 Review Exercise 1
No ratings yet
S3 Review Exercise 1
12 pages
2019 Exam
No ratings yet
2019 Exam
14 pages
Ass 3 Skeleton - 1
No ratings yet
Ass 3 Skeleton - 1
4 pages
Final Fa21 Solutions
No ratings yet
Final Fa21 Solutions
40 pages
R Class 10
No ratings yet
R Class 10
5 pages
Probability and Statistics Lab Submission 4
No ratings yet
Probability and Statistics Lab Submission 4
8 pages

QT 1 - Group 5 - R Assignment 1

Uploaded by

QT 1 - Group 5 - R Assignment 1

Uploaded by

Quantitative Techniques 1

Increasing COVID-19 testing capability through pooled testing

B24061 Aman Sharma B24071 Gagan Hasija

We have used a function (“expected_tests_first_twenty_pool_size”) which

E(number of tests in a group) = 1*(1-p)n + (n+1)*(1-(1-p)n) . This expectation

For p=0.01, the optimal pool size is 10

This task is an extension of task 1, where instead of finding the best

 For p=0.02, the optimal pool size is 8.

Let 'p' be the prevalence of disease in an individual and n be the size of

X = number of tests that are needed for the group

P(Group result negative) = (1-p)n

Number of tests when the group has negative result = 1

Probability of group having at least one positive = 1-(1-p)n

Therefore expected number of tests, E = 1*(1-p)n + (n+1)*(1-(1-p)n)

=> 1*(1-p)n + (n+1)*(1-(1-p)n) >= n

=> 1-n*(1-p)n >=0

=> 1/n>=(1-p)n (n>=1 always)

=> p>=1-(1/n) (1/n)

Therefore, the cutoff value of p = 1-(1/n) (1/n)

n Prevalence rate(Task 2) Cutoff_value(p)

Running the R code, the best option for pool size is 25

● For p = 0.02, the optimal square size is 16.

TOTAL TRUE POSITIVES ACTUAL TOTAL TRUE NEGATIVES ACTUAL

TOTAL TRUE POSITIVES TOTAL TRUE NEGATIVE

p_pos=P(individual tests positive)=p1*p+p4*(1-p), where p is the

Number of groups(NG) = floor(N/n)

Number of individual testing(N_INDV) = N-Number of groups*n

E(total number of tests) = E(number of test per group)*NG + N_INDV

After computing the expected number of tests for one value of n in a

For general optimization of pool size, we have a minimum expected value.

You might also like

E(number of tests in a group) = 1(1-p)n + (n+1)(1-(1-p)n) . This expectation

Therefore expected number of tests, E = 1(1-p)n + (n+1)(1-(1-p)n)

=> 1(1-p)n + (n+1)(1-(1-p)n) >= n

p_pos=P(individual tests positive)=p1p+p4(1-p), where p is the