0% found this document useful (0 votes)

31 views29 pages

Today: Random Testing Again

The document discusses random testing and provides advice for testing the YAFFS filesystem code. It introduces random testing and its advantages and disadvantages. It recommends testing core YAFFS functions like open, write, read, etc. on the /ram2k partition. Students should submit a tester program that runs YAFFS and reports success or failure, along with test cases for their own discovered bugs. Various tools for testing like CUTE and CIL are mentioned but warned to have issues. The document aims to help students get started on the YAFFS testing project.

Uploaded by

Ankur Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views29 pages

Today: Random Testing Again

Uploaded by

Ankur Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 29

Today

Random testing again

Some background (Hamlet) Why not always use random testing?

More YAFFS & project Grill Alex! Maybe Ive forgotten something important CUTE: concolic testing
1

Random Testing
Random testing is, of course, the most used and least useful method

Original slang meaning of random to mean wrong or disorganized and useless

We mean random in the mathematical sense Take a stream of pseudo-random numbers and map them into test operations/cases

Random Testing
Hamlet talks about one advantage of random testing (that often doesnt really appear):

With random testing and an operational profile giving usage patterns for the program, with probabilities

random testing can establish statistically meaningful estimates of program reliability

In program testing, with systematic methods we know what we are doing, but not what it means; only by giving up all systematization can the significance of testing be known. - Hamlet, Random Testing

Operational Profiles & Reliability

Can make statements like:

Its 99% certain that P will fail no more than 1 in 1,000,000 times. Its 95% certain that P has a mean-time-tofailure greater than 100 hours of operation. Real statistics!

Sadly, usable operational profiles with probabilities attached are very rare

And the numbers mean nothing if the profile is something you make up
4

Random Testing
Hamlet also notes that random testing is a good baseline for other methods to compare to

Keeps us honest If systematic is no better, then it may not be a very good approach Whats good about 80% (no loop) path coverage?

If, on the other hand, a comparison with random testing as the standard were available, it might help us to write better standards, or to improve the significance of systematic methods. - Hamlet, Random Testing

Hamlets Claims
Two cases when only random testing will do (Hamlet, Workshop on Random Testing 06)

Well, maybe not only random testing

Cases where systematic testing is meaningless (no plan has a rational basis) Cases where systematic testing is too difficult to carry out Hamlet emphasizes the dangers of adding systematic choice without justification: confusing what software should do with what it does do
6

Hamlets Claims
Danger of ignoring a test case because

Oh come on, it couldnt possibly fail to handle that correctly or Nobody would ever do that

Compare to game theory: cases where if we really know something about opponents play we can take advantage

But, lacking that, random strategy may be inefficient but is the only strategy that cannot be gamed if opponent knows what were up to This is not to imply that programs we test
are adversaries, out to get us but its sometimes useful to act as if they are
7

...

Sidebar: Proving an Assumption

> cbmc discharge.c file discharge.c: Parsing Converting Checking discharge Generating GOTO Program Pointer Analysis Adding Pointer Checks Starting Bounded Model Checking Unwinding loop 1iteration 1 Unwinding loop 1iteration 1 ... size of program expression: 111 assignments removed 11 assignments Generated 111 claims, 1remaining Passing problem to MiniSAT Passing to decision procedure Running MiniSAT Solving with MiniSAT 1 1 1variables, 1 1 1 clauses 11 111 SAT checker: negated claim is UNSATISFIABLE, i.e., holds VERIFICATION SUCCESSFUL

int fs_read (int fd, char* buf, size_t nbytes) { if (buf == NULL) { errno = EINVAL; return -1 ; } if (!in_table(fd)) { errno = EBADF; return -1 ; } assert(1 ); ... } int main () { int i; int fd = nondet_int(); int nbytes = nondet_size_t(); havoc(file_system_state); file_system_state_old = file_system_state; ... int res = fs_read (fd, NULL, nbytes); assert(file_system_state = old_file_system_state); }

Problems with Random Testing

Why not use random testing for everything? Oracle problem: figuring out if a random test is successful is often much harder than with a systematic test

Sometimes we cant do differential testing

Problems with Random Testing

Why not use random testing for everything? Generation problem: how do we make a random input?

What, exactly, is a random C program? Is a random C program going to fit any sane (but unknown) operational profile? Are these the bugs we care about most? For some programs, producing wellformed input that makes for interesting tests is fundamentally hard
10

Problems with Random Testing

Why not use random testing for everything? Even with feedback, produces lots of redundant or uninteresting operations Not good at testing boundary conditions where the boundaries are drawn from a large range

If the program only breaks when x = 2^31 dont expect to find that randomly

Problems with Random Testing

Why not use random testing for everything? Related problem: not good when an error depends on an unlikely relationship between inputs

Program only fails when x + y = MAXINT?

Good luck finding that if you dont bake it into the random tester explicitly. . .

Now, Lets Talk About YAFFS

Project due date: May 13 What to submit: Test report

Document, preferably a pdf More on how to submit in a second Submit as .c (or .h I guess) file, where the name is original_yaffs_name.login.bug#.c And two test cases (more on this too)

Tester

Two buggy versions of YAFFS

Submitting the Tester

Give me a tarball I want to be able to go to a YAFFS install
cd direct tar xvf login.tester.tar make clean; make ./directtest2k

And see it run Use whatever language you see fit, so long as that holds true Admittedly, if I cant make head or tail of your tester (say its in FORTRAN or unlambda), grading it fairly will be harder
14

Tester Output
If YAFFS passes the test, your tester should terminate with error code 0 and print (on standard output) the string:

TEST SUCCESSFULLY COMPLETED

If YAFFS fails terminate with code 2 and print (again, on stdout):

TEST FAILED

See my (very) stupid tester on the website

Tester Output
Ill count it as a case where you find a bug in YAFFS if the program hangs: Hasnt terminated by the time limit of 60 minutes Is not producing any new output

Tester Output
Sanity check Im going to make sure none of your testers say TEST FAILED or hang when run with the original YAFFS So let me know if you have found a YAFFS bug

Tester Output
If you want to use a script to have your directtest2k run another tool on YAFFS, and then parse the output to produce that result, thats ok with me Its worth some points, but not strictly required, that your tool also be able to produce a test case when a test fails something more specific than run the tester
18

Tester Output
Bonus if you include delta-debugging tools for your test case format C programs (or python scripts) are very nice test cases, and easily deltadebuggable Document your test case format and why you chose it in the test report

Test Cases for Your Own Bugs

Again, any format you like, so long as I can replay and see exactly what to do with YAFFS to produce the bug One-minimal test cases are worth more credit Oh, I forgot to mention your bugs should be ones that my stupid tester cant find Shouldnt be hard, I give some examples for you to look at
20

Test Case Output

Use the same output

TEST SUCCESSFULLY COMPLETED TEST FAILED

vs.

About the Time Limit

I know our hardware will vary If you want, send me early versions of your tester and Ill try to run them on my machine and let you know

If it works How long it takes to run

What to Test
You must test these functions:

yaffs_StartUp yaffs_mount yaffs_unmount yaffs_open yaffs_write yaffs_read yaffs_close yaffs_mkdir yaffs_rmdir yaffs_unlink

What to Test
Can use other functions to figure out whats going on with YAFFS Might make it easier to find some bugs But only use these basics in the test cases you submit for your bugs make sure the bug can be exposed using only the core operations! For open, need to test these options:

O_TRUNC, O_APPEND, _O_RDONLY, O_WRONLY, O_EXCL, O_CREAT, O_RDWR

How to Test
Perform all tests on /ram2k Use my replacement yaffscfg2k.c On the website

Tools for Testing

Might want to look at CUTE and SPLAT (links on the web page)

Warning: academic software, dont expect it to work (Im having difficulties right now)

CIL is a very useful tool if your testing ambitions involve instrumenting the code somehow (http://hal.cs.berkeley.edu/cil)

E.g., want to compute path coverage? Instrument every branch with a bit vector insertion
26

Tools for Testing

Lots of other tools out there for testing

Look around you might find something useful that will save you a lot of work Warning, again: the academic software is often not-quite-ready-for-prime-time

Questions?

Some advice
Take a look at my tester Take a look at one of the bugs Figure out why my tester cant find it

Notes - CompTIA A+ (220-801) PDF
0% (2)
Notes - CompTIA A+ (220-801) PDF
29 pages
Course ID 51803 RiO Application Operation Competency
75% (4)
Course ID 51803 RiO Application Operation Competency
9 pages
SAP Finance
100% (1)
SAP Finance
94 pages
Automatix Art of RPA
50% (2)
Automatix Art of RPA
25 pages
TOSCA Automation
67% (3)
TOSCA Automation
3 pages
C-Programming-Class 9
No ratings yet
C-Programming-Class 9
47 pages
Azure DevOps
100% (1)
Azure DevOps
2 pages
Cloud Foundry
0% (1)
Cloud Foundry
30 pages
ST - Module 5
No ratings yet
ST - Module 5
24 pages
Software Testing Seminar: Mooly Sagiv Tel Aviv University 640-6706 Sunday 16-18 Monday 10-12 Schrieber 317
100% (2)
Software Testing Seminar: Mooly Sagiv Tel Aviv University 640-6706 Sunday 16-18 Monday 10-12 Schrieber 317
39 pages
Google Cloud Essential
100% (1)
Google Cloud Essential
19 pages
G and T Awareness PDF
100% (1)
G and T Awareness PDF
5 pages
Fresco Play 4
No ratings yet
Fresco Play 4
22 pages
RND Testing
No ratings yet
RND Testing
49 pages
Microsoft Teams Developer
100% (6)
Microsoft Teams Developer
3 pages
Fresco Play Training 2
No ratings yet
Fresco Play Training 2
12 pages
WorkFusion RPA Express
No ratings yet
WorkFusion RPA Express
10 pages
Credit Card
100% (1)
Credit Card
26 pages
Project
No ratings yet
Project
22 pages
Ericsson RBS Series
100% (1)
Ericsson RBS Series
2 pages
Data Mining Nostos
100% (1)
Data Mining Nostos
39 pages
D3.2 Part1 Guidelines Dependability Hazard Analysis
No ratings yet
D3.2 Part1 Guidelines Dependability Hazard Analysis
340 pages
Stats With Python
No ratings yet
Stats With Python
4 pages
An Enlightenment To Machine Learning
100% (1)
An Enlightenment To Machine Learning
16 pages
Implementing Design Thinking
100% (1)
Implementing Design Thinking
9 pages
EMC SAN Intro
100% (1)
EMC SAN Intro
40 pages
Testing 1
No ratings yet
Testing 1
132 pages
Microsoft Office Interview Questions and Answers PDF
100% (1)
Microsoft Office Interview Questions and Answers PDF
15 pages
Microsoft Teams Developer Final Assesment
100% (1)
Microsoft Teams Developer Final Assesment
2 pages
Mobile App Security Quiz
100% (1)
Mobile App Security Quiz
2 pages
Auto Test
No ratings yet
Auto Test
80 pages
1.white Box Testing
No ratings yet
1.white Box Testing
72 pages
CG Programs
No ratings yet
CG Programs
72 pages
Software Testing: Dr. R. Mall
No ratings yet
Software Testing: Dr. R. Mall
87 pages
Lect 15
No ratings yet
Lect 15
41 pages
Reg Testing
No ratings yet
Reg Testing
41 pages
C Interview Questions: Abdul Kalam
No ratings yet
C Interview Questions: Abdul Kalam
69 pages
08 Testing
No ratings yet
08 Testing
58 pages
1 Introduction
No ratings yet
1 Introduction
55 pages
CP 05 Testing
No ratings yet
CP 05 Testing
59 pages
Unittest Python
No ratings yet
Unittest Python
26 pages
Chapter 3
No ratings yet
Chapter 3
31 pages
Symbolic Exec
No ratings yet
Symbolic Exec
38 pages
14 Debugging
No ratings yet
14 Debugging
27 pages
Installation Document - SPPL - BLR
No ratings yet
Installation Document - SPPL - BLR
29 pages
Introduction To Simio
No ratings yet
Introduction To Simio
98 pages
Fuzzing and Patch Analysis - SAGEly Advice
No ratings yet
Fuzzing and Patch Analysis - SAGEly Advice
61 pages
W09C Full
No ratings yet
W09C Full
19 pages
STM PPT 1 Revision
No ratings yet
STM PPT 1 Revision
15 pages
Random Testing
No ratings yet
Random Testing
25 pages
Software Verification and Validation
No ratings yet
Software Verification and Validation
60 pages
FIDES GroundSlab e
No ratings yet
FIDES GroundSlab e
37 pages
19 Testing
No ratings yet
19 Testing
87 pages
EE485 DebuggingTechniques
No ratings yet
EE485 DebuggingTechniques
24 pages
A Novel Three-Factor Authentication Protocol For Wireless Sensor Networks With IoT Notion
No ratings yet
A Novel Three-Factor Authentication Protocol For Wireless Sensor Networks With IoT Notion
10 pages
1 Simbound Tabletto Case
No ratings yet
1 Simbound Tabletto Case
2 pages
PYQ With Solution-3
No ratings yet
PYQ With Solution-3
14 pages
Introduction To Interactive Content
No ratings yet
Introduction To Interactive Content
8 pages
RabbitMQ Training Daywise
No ratings yet
RabbitMQ Training Daywise
6 pages
Lecture 3 Test Planning
No ratings yet
Lecture 3 Test Planning
32 pages
Lecture 02 - Data Communication Networks 2022.01.06
No ratings yet
Lecture 02 - Data Communication Networks 2022.01.06
32 pages
Public Space Acupuncture PDF
No ratings yet
Public Space Acupuncture PDF
54 pages
Pythonlearn 04 Functions
No ratings yet
Pythonlearn 04 Functions
25 pages
Testing
No ratings yet
Testing
7 pages
Polytechnic University of The Philippines Paranaque Campus Bachelor of Science in Computer Engineering
No ratings yet
Polytechnic University of The Philippines Paranaque Campus Bachelor of Science in Computer Engineering
18 pages
Random Testing For C and C++ Compilers With Yarpgen: Vsevolod Livinskii, Dmitry Babokin, John Regehr
No ratings yet
Random Testing For C and C++ Compilers With Yarpgen: Vsevolod Livinskii, Dmitry Babokin, John Regehr
25 pages
Testing C Code
No ratings yet
Testing C Code
22 pages
Web Crawler
No ratings yet
Web Crawler
28 pages
Transport Layer Numerical
No ratings yet
Transport Layer Numerical
29 pages
Software Testing Lect 2-3
No ratings yet
Software Testing Lect 2-3
34 pages
Project 3 Approach
No ratings yet
Project 3 Approach
5 pages
Systematic Software Testing
No ratings yet
Systematic Software Testing
17 pages
Project 2: University Course & Result Management System: Feature List & Score SL# Feature Score
No ratings yet
Project 2: University Course & Result Management System: Feature List & Score SL# Feature Score
14 pages
272: Software Engineering Fall 2012: Instructor: Tevfik Bultan
No ratings yet
272: Software Engineering Fall 2012: Instructor: Tevfik Bultan
55 pages
RRL
No ratings yet
RRL
24 pages
Testing and Debugging: Chapter Goals
No ratings yet
Testing and Debugging: Chapter Goals
28 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
31 pages
CTest
No ratings yet
CTest
35 pages
CP4P Testing Activity Instructions
No ratings yet
CP4P Testing Activity Instructions
4 pages
Testing2: - White Box Testing - Black Box Testing - Testing OO Programs
No ratings yet
Testing2: - White Box Testing - Black Box Testing - Testing OO Programs
22 pages
White Box Testing: Len Schroath September 18, 2007
No ratings yet
White Box Testing: Len Schroath September 18, 2007
32 pages
Testing and Design Styles
No ratings yet
Testing and Design Styles
3 pages
Lab+3 Control Flow - Testing
No ratings yet
Lab+3 Control Flow - Testing
35 pages
Hybrid Apps Introduction
No ratings yet
Hybrid Apps Introduction
21 pages
STQA Experiment 5 8 PDF
No ratings yet
STQA Experiment 5 8 PDF
14 pages
Digital Malware Analysis
No ratings yet
Digital Malware Analysis
2 pages
Testing
No ratings yet
Testing
63 pages
ATS 12-13 Security Testing 1
No ratings yet
ATS 12-13 Security Testing 1
37 pages
Exp-3 STM
No ratings yet
Exp-3 STM
28 pages
Exploring Quantum Computing Use Cases For Manufacturing - IBM
No ratings yet
Exploring Quantum Computing Use Cases For Manufacturing - IBM
8 pages
EMC2
No ratings yet
EMC2
37 pages
Module01 LimitsAndObjectivesOfTesting
No ratings yet
Module01 LimitsAndObjectivesOfTesting
37 pages
Introduction To Software Testing Graph Coverage For Source Code
No ratings yet
Introduction To Software Testing Graph Coverage For Source Code
25 pages
Controlflow Testing
No ratings yet
Controlflow Testing
48 pages
Whitebox Testing: CS 420 - Spring 2007
No ratings yet
Whitebox Testing: CS 420 - Spring 2007
52 pages
Lecture 2 DBMS
No ratings yet
Lecture 2 DBMS
20 pages
Chapter5 Testing The Software With Blinders On
No ratings yet
Chapter5 Testing The Software With Blinders On
42 pages
Eight Axioms That Apply To All Validation Testing
No ratings yet
Eight Axioms That Apply To All Validation Testing
24 pages
RMB Consulting: A C' Test: The 0x10 Best Questions For Would-Be Embedded Programmers
No ratings yet
RMB Consulting: A C' Test: The 0x10 Best Questions For Would-Be Embedded Programmers
9 pages
Web Services Security 1
No ratings yet
Web Services Security 1
1 page
ABBYY FlexiCapture
No ratings yet
ABBYY FlexiCapture
7 pages
Java Innards
No ratings yet
Java Innards
3 pages
User Research Methods Q A
No ratings yet
User Research Methods Q A
3 pages
Regression Analysis Q A
No ratings yet
Regression Analysis Q A
2 pages
Ansible Automation Coral
No ratings yet
Ansible Automation Coral
2 pages
Difference Between SAP Memory and ABAP Memory: Answers 1
No ratings yet
Difference Between SAP Memory and ABAP Memory: Answers 1
2 pages
2 Years Software Testing Resume Software Testing
No ratings yet
2 Years Software Testing Resume Software Testing
9 pages
Mobile App Security PDF
No ratings yet
Mobile App Security PDF
3 pages
Automatic Test Case Generation of C Program Using CFG
No ratings yet
Automatic Test Case Generation of C Program Using CFG
5 pages
Python Web Flask
No ratings yet
Python Web Flask
118 pages
Black Box Testing Examples
No ratings yet
Black Box Testing Examples
9 pages
Amazon Storage S3
No ratings yet
Amazon Storage S3
4 pages
AXE Telephone Exchange - Wikipedia, The Free Encyclopedia
No ratings yet
AXE Telephone Exchange - Wikipedia, The Free Encyclopedia
2 pages
Ciphering Procedure in GSM Call Flow
No ratings yet
Ciphering Procedure in GSM Call Flow
3 pages
Mastering Test Automation: A Practical Guide to Scalable & Efficient Testing
From Everand
Mastering Test Automation: A Practical Guide to Scalable & Efficient Testing
Chizitere Sylvia Olebu
No ratings yet
Confident Programmer Problem Solver: Six Steps Programming Students Can Take to Solve Coding Problems
From Everand
Confident Programmer Problem Solver: Six Steps Programming Students Can Take to Solve Coding Problems
Cloudy Heaven Games
No ratings yet
Practical Design of Experiments: DoE Made Easy
From Everand
Practical Design of Experiments: DoE Made Easy
Colin Hardwick
4.5/5 (7)
Laboratory Practice, Testing, and Reporting: Time-Honored Fundamentals for the Sciences
From Everand
Laboratory Practice, Testing, and Reporting: Time-Honored Fundamentals for the Sciences
Dwayne Phillips
No ratings yet
Learn Software Testing in 24 Hours
From Everand
Learn Software Testing in 24 Hours
Alex Nordeen
No ratings yet
Software Testing: A Guide to Testing Mobile Apps, Websites, and Games
From Everand
Software Testing: A Guide to Testing Mobile Apps, Websites, and Games
Mark Garzone
4.5/5 (3)

Today: Random Testing Again

Uploaded by

Today: Random Testing Again

Uploaded by

Today

Random testing again

Some background (Hamlet) Why not always use random testing?

Original slang meaning of random to mean wrong or disorganized and useless

random testing can establish statistically meaningful estimates of program reliability

Operational Profiles & Reliability

Well, maybe not only random testing

Sidebar: Proving an Assumption

Problems with Random Testing

Sometimes we cant do differential testing

Problems with Random Testing

Problems with Random Testing

Problems with Random Testing

Program only fails when x + y = MAXINT?

Now, Lets Talk About YAFFS

Two buggy versions of YAFFS

Submitting the Tester

TEST SUCCESSFULLY COMPLETED

If YAFFS fails terminate with code 2 and print (again, on stdout):

See my (very) stupid tester on the website

Test Cases for Your Own Bugs

Test Case Output

TEST SUCCESSFULLY COMPLETED TEST FAILED

About the Time Limit

If it works How long it takes to run

O_TRUNC, O_APPEND, _O_RDONLY, O_WRONLY, O_EXCL, O_CREAT, O_RDWR

Tools for Testing

Tools for Testing

You might also like