Resources

CS 530

Fault Tolerant Computing

Preliminary: Will be updated time to time

 

Readings for Lecture Notes:

The lecture notes contain the concepts you need to know.

The Supporting Reading materials contain more detailed information about some of the topics mentioned in the Lecture Notes. The Further reading materials mentioned in the slides may provide further insight.

1 Introduction: Lecture Notes 1 (pdf)

Supporting reading: A Conceptual Framework for System Fault Tolerance; Later we will discuss some of the reliability measure in more detail.

2 Digital Systems: Lecture Notes 2

Supporting reading: Use your Logic Design/Computer Architecture text. You can find a quick introduction to Karnaugh maps at karnaugh.pdf. We will talk about faults and testing in more detail later, but if you want, you can take a peek at http://www.cs.colostate.edu/~cs530/digital_testing.pdf

3 Fault Modeling: Lecture Notes 3

Supporting reading: Design for Testability in Digital Integrated circuits, Bob Strunz, Colin Flanagan, Tim Hall http://www.cs.colostate.edu/~cs530/digital_testing.pdf

4 Combinational Circuit Testing: Lecture Notes 4

Supporting reading: Design for Testability in Digital Integrated circuits, Bob Strunz, Colin Flanagan, Tim Hall http://www.cs.colostate.edu/~cs530/digital_testing.pdf

5 Sequential Circuit Modeling: Lecture Notes 5

Supporting reading: Design for Testability in Digital Integrated circuits, Bob Strunz, Colin Flanagan, Tim Hall http://www.cs.colostate.edu/~cs530/digital_testing.pdf

6 Probabilistic Methods: Overview: Lecture Notes 6

Supporting reading: Markov Processes http://www.sics.se/~aeg/report/node10.html, Poisson process http://en.wikipedia.org/wiki/Poisson_process

7 Random Testing: Overview: Random Testing

Supporting reading:  Partial Detectability Profile, An Examination of Fault Exposure Ratio

7a Reliability Part 1: Lecture Notes 7a

Supporting reading: Understanding Series and Parallel Systems Reliability Don't tell anyone I suggested these: failure rate MTBF

7b Reliability Part 2: Lecture Notes 7b

Advanced classic paper: The Use of Triple-Modular Redundancy to Improve Computer Reliability Another interesting paper: TMR for process control A related controversial concept for software: N-version programming (has some well-known references).

8 Software Reliability: Lecture Notes 8a, Lecture Notes 8b, Lecture Notes 8c

Read this article written for an encyclopedia. Supporting reading: Software Reliability Handbook by Lakey and Neufelder

9 More Software Reliability: Lecture Notes 8d

Read this article written for an encyclopedia. Supporting reading: Software Reliability Handbook by Lakey and Neufelder

 

Texts:

 

Testing, Reliability & Security research group web page (includes links to several articles)

Other interesting Tutorials and articles on the web: (this will be updated soon)

Hardware Testing:

Software Testing

Reliability:

Software Reliability:

Coding Theory: