01/25/2021
Use the left and right arrow keys to navigate the presentation forward and backward respectively. You can also use the arrows at the bottom right of the screen to navigate with a mouse.
FAIR USE ACT DISCLAIMER: This site is for educational purposes only. This website may contain copyrighted material, the use of which has not been specifically authorized by the copyright holders. The material is made available on this website as a way to advance teaching, and copyright-protected materials are used to the extent necessary to make this class function in a distance learning environment. The Fair Use Copyright Disclaimer is under section 107 of the Copyright Act of 1976, allowance is made for “fair use” for purposes such as criticism, comment, news reporting, teaching, scholarship, education and research.
The following topics will be covered in this lecture:
Statistics is a science that helps us make decisions and draw conclusions in the presence of variability.
A typical problem related to transportation would involve data regarding this specific system’s number of non-work, home-based trips, the number of persons per household, and the number of vehicles per household.
The objective would be to produce a trip-generation model relating trips to the number of persons per household and the number of vehicles per household.
A statistical technique called regression analysis can be used to construct this model.
The trip-generation model is an important tool for transportation systems planning.
Regression methods are among the most widely used statistical techniques in engineering.
The hospital emergency department (ED) is an important part of the healthcare delivery system.
The service process is also highly variable, depending on the types of services that the patients require, the number of patients in the ED, and how the ED is staffed and organized.
An ED’s capacity is also limited;
How long patients wait on average is an important question for healthcare providers.
If waiting times become excessive, some patients will leave without receiving treatment (LWOT).
Patients who LWOT do not have their medical concerns addressed and are at risk for further problems and complications.
Therefore, another important question is: What proportion of patients LWOT from the ED?
These questions can be solved by employing probability models to describe the ED;
Courtesy of Montgomery & Runger, Applied Statistics and Probability for Engineers, 7th edition
Courtesy of Montgomery & Runger, Applied Statistics and Probability for Engineers, 7th edition
The field of statistics deals with the collection, presentation, analysis, and use of data to make decisions, solve problems, and design products and processes.
Statistical techniques can be powerful aids in designing new products and systems, improving existing designs, and designing, developing, and improving production processes.
Statistical methods give a systematic framework to describe and understand variability.
By variability, we mean that successive observations of a system or phenomenon do not produce exactly the same result.
Suppose that an engineer is designing a nylon connector to be used in an automotive engine application.
The engineer is considering establishing the design specification on wall thickness at 3∕32
inch but is somewhat uncertain about the effect of this decision on the connector pull-off force.
If the pull-off force is too low, the connector may fail when it is installed in an engine.
Eight prototype units are produced and their pull-off forces measured, resulting in the following data (in pounds):
12.6, 12.9, 13.4, 12.3, 13.6, 13.5, 12.6, 13.1.
Courtesy of Montgomery & Runger, Applied Statistics and Probability for Engineers, 7th edition
12.9, 13.7, 12.8, 13.9, 14.2, 13.2, 13.5, 13.1
Courtesy of Montgomery & Runger, Applied Statistics and Probability for Engineers, 7th edition
To handle these questions, we must systematically reason from a finite collection of measurements to infer what the pull-off force and its variability will be in production.
When we describe the prototypes and the hypothetical future production units, we often refer to them as the sample and population respectively.
We will introduce two new definitions related to samples and populations:
Imagine now that we are flipping a fair coin.
Suppose we take a sample of 4 flips and get 3 heads and 1 tails.
In our sample it may appear that it was a \( 75\% \) probability of getting heads and \( 25\% \) probability of getting tails.
If we flipped the coin 100 more times, our sample statistics will approach the true population parameter on average.
The random discrepancy between the sample statistic and the population parameter is known as sampling error.
Considering the model for the pull-off strength,
\[ X= \mu+\epsilon \] we might say that the sample-based average approximates population parameter \( \mu \), but will usually be different due to the observed disturbances in realizations of \( \epsilon \).
However, under certain conditions, with a large sample size will get a better approximation of \( \mu \) on average with the sample statistic from the prototypes.
Sometimes the data are all of the observations in the population;
Three basic methods of collecting data are
An effective data-collection procedure can greatly simplify the analysis and lead to improved understanding of the population or process that is being studied.
Montgomery, Peck, and Vining (2012) describe an acetone-butyl alcohol distillation column for which concentration of acetone in the distillate (the output product stream) is an important variable.
Factors that may affect the distillate are the reboil temperature, the condensate temperature, and the reflux rate.
Production personnel obtain and archive the following records:
The reflux rate should be held constant for this process and, consequently, production personnel change this very infrequently.
A retrospective study would use either all or a sample of the historical process data archived over some period of time.
The study objective might be to discover the relationships among the two temperatures and the reflux rate on the acetone concentration in the output product stream.
However, this type of study presents some problems:
A retrospective study may involve a significant amount of data, but those data may contain relatively little useful information about the problem.
Furthermore, some of the relevant data may be missing, there may be transcription or recording errors resulting in outliers (or unusual values), or data on other important factors may not have been collected and archived.
In the distillation column, for example, the specific concentrations of butyl alcohol and acetone in the input feed stream are very important factors, but they are not archived because the concentrations are too hard to obtain on a routine basis.
As a result of these types of issues, statistical analysis of historical data sometimes identifies interesting phenomena, but solid and reliable explanations of these phenomena are often difficult to obtain.
In an observational study, the engineer observes the process or population, disturbing it as little as possible, and records the quantities of interest.
Because these studies are usually conducted for a relatively short time period, sometimes variables that are not routinely measured can be included.
In the distillation column, the engineer would design a form to record the two temperatures and the reflux rate when acetone concentration measurements are made.
It may even be possible to measure the input feed stream concentrations so that the impact of this factor could be studied.
Generally, an observational study tends to solve problems 1 and 2 discussed earlier and goes a long way toward obtaining accurate and reliable data.
However, observational studies may not help resolve problems 3 and 4 discussed earlier.
Courtesy of Montgomery & Runger, Applied Statistics and Probability for Engineers, 7th edition
Courtesy of Montgomery & Runger, Applied Statistics and Probability for Engineers, 7th edition
Courtesy of Montgomery & Runger, Applied Statistics and Probability for Engineers, 7th edition