software testing and reliability southern methodist university cse 7314

Software Testing

and Reliability

Southern Methodist University

CSE 7314

“A working program remains an elusive thing of beauty”

Robert Dunn

Syllabus

• Instructor; Rob Oshana

• Office hours: By appointment

• Phone; (281) 274-3211

• Fax; (214) 768-3085

• E-mail; oshana@airmail.net• Web site;

www.engr.smu.edu/cse/roshana/cse7314

Syllabus

• Required Text Book: Systematic Software Testing, by Rick Craig and Stefan Jaskiel, Artech House, ISBN 1-58053-508-9

• Practical Guide to Testing Object-Oriented Software by David A. Sykes, John D. McGregor Addison-Wesley Pub Co; ISBN: 0201325640

Syllabus

• Student Evaluation; The course grade will be computed as follows:–Midterm Exam 30%– Final Exam 30%– Homework 15%– Project 25%

Course Outcomes

• Upon successful completion of this course the student will be able to:

• 1. Determine the test techniques applicable to a given program

• 2. Construct a test suite using the techniques discussed in class

Course Outcomes

• 3. Determine various test and quality metrics of a program

• 4. Create and manage an effective software testing team

Course is / is not

• Is a roadmap approach for test professionals

• Is not an implementation course

• Is not a software testing tools course

Outline

Trip Topics Readings

1 Overview of the testing process

Risk analysis

Master test planning

Detailed test planning

Craig, Chapters 1-4

2 Analysis and design

Test implementation

Test execution

Craig, Chapters 5-7

3 Test organization

The software tester

The test manager

Improving the process

Craig, chapters 8-11

Outline

Trip Topics Readings

4 Statistical testing techniques

Testing OO systems

Sykes, Chapters 1-6

5 Testing OO systems Sykes, Chapters 7-10

6 Testing RT systems

Testing safety critical systems

Testing web based systems

Outline

• Selected readings will be sent to the students on a periodic basis

• Homework; assignments/schedule will be posted on the web site shortly

• Project will be discussed next trip

Testing style

Competence Test question cues

Knowledge List, describe

Comprehension Summarize, discuss, describe

Evaluation Explain, compare

Analysis Analyze, explain, compare

“Errors are more common, more pervasive, and more troublesome in software than with other technologies”

David Parnas

Homework 1a

• Please send me a couple paragraphs describing your background and experience. Describe to me what you want to get out of the course

Homework 1a

• Please read the paper entitled “Improving Software Testability”

• www.stlabs.com/newsletters/testnet/docs/testability.htm

CSE 7314

Software Testing and Reliability

What is testing?

• How does testing software compare with testing students?

What is testing?

• “Software testing is the process of comparing the invisible to the ambiguous as to avoid the unthinkable.” James Bach, Borland corp.

What is testing?

• Software testing is the process of predicting the behavior of a product and comparing that prediction to the actual results." R. Vanderwall

Purpose of testing

• Build confidence in the product

• Judge the quality of the product

• Find bugs

Finding bugs can be difficult

Mine field

A path through themine field (use case) A path through the

mine field (use case)

Why is testing important?

• Therac25: Cost 6 lives

• Ariane 5 Rocket: Cost $500M

• Denver Airport: Cost $360M

• Mars missions, orbital explorer & polar lander: Cost $300M

Why is testing so hard?

Reasons for customer reported bugs

• User executed untested code• Order in which statements were

executed in actual use different from that during testing

• User applied a combination of untested input values

• User’s operating environment was never tested

Interfaces to your software

• Human interfaces

• Software interfaces (APIs)

• File system interfaces

• Communication interfaces–Physical devices (device drivers)

–controllers

Selecting test scenarios

• Execution path criteria (control)–Statement coverage–Branching coverage

• Data flow – Initialize each data structure–Use each data structure

• Operational profile • Statistical sampling….

What is a bug?

• Error: mistake made in translation or interpretation ( many taxonomies exist to describe errors)

• Fault: manifestation of the error in implementation (very nebulous)

• Failure: observable deviation in behavior of the system

Example

• Requirement: “print the speed, defined as distance divided by time”

• Code: s = d/t; print s

Example

• Error; I forgot to account for t = 0

• Fault: omission of code to catch t=0

• Failure: exception is thrown

Severity taxonomy

• Mild - trivial

• Annoying - minor

• Serious - major

• Catastrophic - Critical

• Infectious - run for the hills

What is your taxonomy ?

IEEE 1044-1993

Life cycle

Requirements

Design

Testing

Errors can be introduced ateach of these stages

Resolve

Isolate

Classify

Testing and repair process can bejust as error prone as the developmentProcess (more so ??)

Ok, so lets just design our systems with “testability” in

mind…..

Testability

• How easily a computer program can be tested (Bach)

• We can relate this to “design for testability” techniques applied in hardware systems

A standard Integrated Circuit

CoreIC

Test access portcontroller

Test mode Select (TMS)

Test clock (TCK)

Test data out (TDO)

Test data in (TDI)

BoundaryScan cells

BoundaryScan path

I/O pads

Data in

Data out

TDI TDOcell

Operability

• “The better it works, the more efficiently it can be tested”–System has few bugs (bugs add

analysis and reporting overhead)–No bugs block execution of tests–Product evolves in functional

stages (simultaneous development and testing)

Observability

• “What you see is what you get”– Distinct output is generated for each

input– System states and variables are visible

and queriable during execution– Past system states are ….. (transaction

logs)– All factors affecting output are visible

Observability

– Incorrect output is easily identified

– Internal errors are automatically detected through self-testing mechanisms

– Internal errors are automatically reported

–Source code is accessible

Visibility Spectrum

DSPvisibility

GPPvisibility

Factoryvisibility

End customervisibility

Controllability

• “The better we can control the software, the more the testing can be automated and optimized”–All possible outputs can be

generated through some combination of input–All code is executable through some

combination of input

Controllability

–SW and HW states and variables can be controlled directly by the test engineer

– Input and output formats are consistent and structured

Decomposability

• “By controlling the scope of testing, we can more quickly isolate problems and perform smarter testing”–The software system is built from

independent modules–Software modules can be tested

independently

Simplicity

• “The less there is to test, the more quickly we can test it”–Functional simplicity (feature set is

minimum necessary to meet requirements)–Structural simplicity (architecture

is modularized)–Code simplicity (coding standards)

Stability

• “The fewer the changes, the fewer the disruptions to testing”–Changes to the software are

infrequent, controlled, and do not invalidate existing tests

–Software recovers well from failures

Understandability

• “The more information we have, the smarter we will test”– Design is well understood– Dependencies between external,

internal, and shared components are well understood

– Technical documentation is accessible, well organized, specific and detailed, and accurate

“Bugs lurk in corners and congregate at boundaries”

Boris Beizer

Types of errors

• What is a Testing error?–Claiming behavior is erroneous

when it is in fact correct

– ‘fixing’ this type of error actually breaks the product

Errors in classification

• What is a Classification error ?–Classifying the error into the

wrong category

• Why is this bad ?–This puts you on the wrong path

for a solution

Example Bug Report

• “Screen locks up for 10 seconds after ‘submit’ button is pressed”

• Classification 1: Usability Error • Solution may be to catch user events

and present an hour-glass icon• Classification 2: Performance error• solution may be a modification to a sort

algorithm (or visa-versa)

Isolation error

• Incorrectly isolating the erroneous modules• Example: consider a client server

architecture. An improperly formed client request results in an improperly formed server response

• The isolation determined (incorrectly) that the server was at fault and was changed

• Resulted in regression failure for other clients

Resolve errors

• Modifications to remediate the failure are themselves erroneous

• Example: Fixing one fault may introduce another

What is the ideal test case?

• Run one test whose output is "Modify line n of module i."

• Run one test whose output is "Input Vector v produces the wrong output"

• Run one test whose output is "The program has a bug" (Useless, we know this)

More realistic test case

• One input vector and expected output vector– A collection of these make of a Test

• Typical (naïve) Test Case– Type or select a few inputs and observe

output– Inputs not selected systematically– Outputs not predicted in advance

Test case definition

• A test case consists of;– an input vector– a set of environmental conditions– an expected output.

• A test suite is a set of test cases chosen to meet some criteria (e.g. Regression)

• A test set is any set of test cases

Requirements as theory model

• Suppose we consider a specification to be a theory describing a program

• How do we test theories?• By examining the theory and using it

to make predictions• First Principle of testing. The

expected results of a test should be known before the test is run

Requirements as theory

• "Accumulating evidence to support a theory is not the appropriate way to test it. What you should do is try to falsify it, to challenge it with your best efforts at proving it false."–Karl Popper

• Implications for us doing testing• Testing should not be used to

build confidence, to easy• Testing should attempt to find

deviations from the theory, that is, bugs

• Any other purpose sets up the wrong goal

• "Program testing can be used to show the presence of bugs, but never show their absence!" O.-J. Dahl, E. W. Dijkstra, and C.A.R. Hoare, Structured Programming, New York: Academic, 1972.

• "Absence of proof (of bugs) is not proof of absence."; Logic 101

A few words about computability

• From the theory of computability, we know:

• It is undecidable whether a given program will halt on a given input. (Halting problem)

• It is undecidable whether two programs will always output the same answer for a given input. (Equivalence)

Implications for testing

• There is no general solution for the automated oracle problem–no automatic testing strategy can be

devised that will work in all cases

• There is no general way to find the input that causes a specific line of code to be executed

• Coverage is undecidable

All of the following are undecidable

• Will a given statement ever be exercised by any input?

• What input will exercise a given statement?• Will a given input exercise some specified

statement?• Will a given path ever be exercised by any

input?• What input will exercise a given path?• Will a given input exercise some specified

Computability

• Note that even though it is in general undecidable, there is a large class of programs for which these issues can be decided– a large testing tools industry has

emerged because of this

• When examining a tool, make sure that the class of programs for which it works is well understood

Reference book

A few words on combinatorics

• Based on the Cartesian product of sets, we can count the number of possible inputs that a program has, i.e. | I |

Example

• Assume a program has a single input, Customer ID (CID)

• May be any value in the domain {00000-99999}

• What is | I | ?

• | I | = 100000

Example

• Now assume we add a second input, the Order ID (OID)

• This may be any value in the domain {00000-99999} as well

• Now what is | I |?

• 100000*100000 = 10,000,000,000

Example

• Finally, add a credit card number to the input

• This is a 12 digit number• | I | has now reached 10**22

• If we can execute 1 million tests per second, it will take 1016 seconds, or about 300 million years!!

Example

• Since we cannot know what data may exercise a given statement/path in general, we may attempt to resort to exhaustive testing• This attempt is doomed to fail

due to the combinatorial explosion

Functional and structural approaches to testing

Engineering the testing process

• Any engineered product (and most other things) can be tested in one of two ways–Knowing the specified function that

a product has been designed to perform, tests can be conducted that demonstrate each function is fully operational while at the same time searching for errors in each function

Engineering the testing process

–Knowing the internal workings of a product, tests can be conducted to ensure that “all gears mesh” (internal operations performed according to specifications)

Structural testing

• Uses knowledge of the internal workings

• Also known as Clear box/glass box• Code based• Can be useful for finding interesting

inputs• Misses an entire class of faults,

missing code

Behavioral

• Uses knowledge of the specific function that is to be performed

• Based solely on the specification without regard for the internals

• Also known as Black box• More user oriented• Misses an entire class of faults, extra

code (surprises) except by accident

Passing criteria

• How do we know when

• 1. a single test has passed

• 2. when we are done testing

Passing criteria

• A single test passes when its output is correct–This requires a specific definition

of correct and ties into the automated oracle problem

When are we done?

• Conway Criteria:• No syntactic errors (it compiles)• No compile errors or immediate

execution failures• There exists Some set of data for

which the program gives the correct output• A typical set of data produces the

correct output

When are we done?

• Difficult sets of data produce the correct output.

• All possible data sets in the problem specification produce the correct output

• All possible data sets and likely erroneous input succeeds.

• All inputs produce the correct output

Nature of software defects

• Logic errors and incorrect assumptions are inversely proportional to the probability that a program path will be executed

Nature of software defects

• We often believe that a logical path is not likely to be executed when, in fact, it may be executed on a regular basis

• Typographical errors are random

More of a case for WHITE box testing……

Summary

• Zeroth Principle of Testing; The purpose of testing is to find bugs

• Corollary to the zeroth principle; "The program is wrong"

Summary

• First Principle of Testing; The results of a test must be known before the test is run

• Second Principle of Testing: Testing is difficult

Summary

• Exhaustive testing is doomed by the combinatorial explosion

• Any other technique is undecidable

• Third Principle of Testing: No single technique will suffice for any non-trivial testing effort

Homework 1b

• Discuss a software failure from your experience or knowledge and attempt to explain the role of testing (or lack thereof) in that failure

Another reference

• Testing techniques newsletter

• www.testworks.com/News/TTN-Online

software testing and reliability southern methodist university cse 7314

Documents

u06-7314 #912481000

cse 5330/7330 database introduction fall 2009 margaret h....

cse 5331/7331 f'071 cse 5331/7331 fall 2007 image mining...

discrete mathematics cse 2353 fall 2007 margaret h. dunham...

1 cse 1341 southern methodist university lyle school of...

hansard home page: e-mail: phone: (07) 3406 7314 fax: (07

cse 7314 software testing and reliability robert oshana trip...

7314-1 lesson 10 microsoft powerpoint - cci...

cse 7314 software testing and reliability robert oshana...

1 requirements engineering southern methodist university cse...

students have to fill webkiosk... · 101704 nancy kondal...

requirements engineering southern methodist university cse...

1 requirements analysis and design engineering southern...

7324/7314 hardware user manual - national instrumentsthe...

hansard home page: e-mail: phone: (07) 3406 7314 … (07)...

cse 7314 software testing and reliability robert oshana...

© 2013 ken howard, southern methodist university cse 1341...

7314-7324 sepulveda blvd · yaron yasmeh...

human factors and user interface design southern methodist...

discrete mathematics, part iiib cse 2353 fall 2007 margaret...