The flashcards below were created by user
lazvertiigo
on FreezingBlue Flashcards.

For Z = [Xμ]/σ, the random variable Z is a _______________.
Standard Normal Distribution

There are several ways to calculate the area under the standard normal curve. The three different area calculations are...
area to the left, to the right, and in between the zscore(s).

Many of the statistical tests that we perform on small data sets (sample size less than ___) require that the population from which the sample is drawn be _______________.
Many of the statistical tests that we perform on small data sets (sample size less than 30) require that the population from which the sample is drawn be normally distributed.

We have said that a random variable X is normally distributed, or at least approximately normal, provided the histogram of the data is ______ and__________
we have said that a random variable X is normally distributed, or at least approximately normal, provided the histogram of the data is symmetric and bellshaped

What defines a graph that plots observed data versus normal scores.
A normal probability plot is a graph that plots observed data versus normal scores.

A normal probability plot is a graph that plots observed data versus __________.
A normal probability plot is a graph that plots observed data versus normal scores.

What defines the expected zscore of the data value, assuming that the distribution of the random variable is normal.
A normal score is the expected zscore of the data value, assuming that the distribution of the random variable is normal.

The expected Zscore of an observed value will depend upon the number of _________ in the data set.
The expected Zscore of an observed value will depend upon the number of observations in the data set.

The idea behind finding the expected zscore is that, if the data comes from normally distributed population, we could predict _______ _________ _______.
The idea behind finding the expected zscore is that, if the data comes from normally distributed population, we could predict the area to the left of each of the data value.

T/F:
If sample data is taken from a population that is normally distributed, a normal probability plot of the actual values versus the expected Zscores will be approximately linear.
True

A new sample mean can be calculate each time a new sample is taken. In this way, the sample mean can be analyzed as _____________.
A new sample mean can be calculate each time a new sample is taken. In this way, the sample mean can be analyzed as a random variable.

Being able to approximately calculate the distribution of the sample mean is a critical tool for _________.
Being able to approximately calculate the distribution of the sample mean is a critical tool for inference.

Describe the sampling distribution of the sample mean.
Because the sample mean is a random variable, the sample mean has a mean, and standard deviation, and probability distribution. This is called the sampling distribution of the sample mean.

Because the sample mean is a random variable, the sample mean has three things; what are they?
Because the sample mean is a random variable, the sample mean has a mean, and standard deviation, and probability distribution.

T/F:
If we know that the population has a normal distribution, then the sampling distribution will not be normal.
 False:
 If we know that the population has a normal distribution, then the sampling distribution will also be normal.

T/F:
If we know that the population has a normal distribution then the sampling distribution will be normally distributed, have a mean equal to the mean of the population, and have a standard deviation less than the standard deviation of the population.
True

The standard deviation of the sampling distribution of 𝑥̅ is called the _______ _______and is denoted σ_{𝑥̅}.
standard error of the mean

T/F:
If a random variable X is normally distributed, the distribution of the sample mean σ_{𝑥̅} is normally distributed.
 False:
 If a random variable X is normally distributed, the distribution of the sample mean 𝑥̅ is normally distributed.

What is the Central Limit Theorem (conceptual)?
Regardless of the shape of the distribution of a population, the sampling distribution of 𝑥̅ is approximately normal as the sample size n increases.

T/F:
The rule of thumb is if n≥10, this is a good approximation.
The rule of thumb is if n≥30, this is a good approximation.

What defines the process of using sample data to estimate the value of a population parameter?
Estimation is the process of using sample data to estimate the value of a population parameter.

What defines the value of a statistic that estimates the value of a parameter.
A point estimate is the value of a statistic that estimates the value of a parameter.

A _____ _______ for an unknown parameter consist of an interval of numbers.
A confidence interval for an unknown parameter consist of an interval of numbers.

The ____ ______ represents the expected proportion of intervals that will contain the parameter if a large number of different samples is obtained.
The level of confidence represents the expected proportion of intervals that will contain the parameter if a large number of different samples is obtained.

T/F:
Confidence interval estimates for the population proportion are of the form
Point estimate ± population proportion
 False:
 Confidence interval estimates for the population proportion are of the form
 Point estimate ± margin of error

The margin of error of a confidence interval estimate of a parameter depends on three factors: What are they?
The margin of error of a confidence interval estimate of a parameter depends on three factors: level of confidence, sample size, and standard deviation of the population.

T/F:
For Sample Size: As the size of the random sample increases, the margin of error decreases.
True

T/F:
For the standard deviation of the population: The more spread there is in the population, the smaller the interval will be for a given confidence level.
 False:
 The more spread there is in the population, the wider the interval will be for a given confidence level.

T/F:
For level of confidence: as the level of confidence increases, the margin of error increases.
True

Interpret the Confidence Interval.
A [(1 − α) ∙ 100%] confidence interval indicates that, if we obtain many simple random samples of size n from the population whose parameter is unknown, then [(1 − α) ∙ 100%] of the intervals will contain the parameter.

T/F:
The number of degrees of freedom, n−1, is crucial for the tdistribution since this depends on the population proportion size.
 False:
 The number of degrees of freedom, n−1, is crucial for the tdistribution since this depends on the sample size.

T/F:
Properties of the tDistribution
The tdistribution is the same for different degrees of freedom.
 False:
 The tdistribution is different for different degrees of freedom.

T/F:
Properties of the tDistribution
The tdistribution is centered at 0 and is symmetric about 0.
True

T/F:
Properties of the tDistribution
The area under the curve is 0.5.
 False:
 The area under the curve is 1.

T/F:
Properties of the tDistribution
As t increases or decreases without bound, the graph approaches, but never equals, zero.
True

T/F:
Properties of the tDistribution
The area in the tails of the tdistribution is smaller than the area in the tails of the standard normal distribution, because we are using s as an estimate of σ, thereby introducing further variability into the t statistic.
 False:
 The area in the tails of the tdistribution is a little greater than the area in the tails of the standard normal distribution, because we are using s as an estimate of σ, thereby introducing further variability into the t statistic.

T/F:
Properties of the tDistribution
As the sample size n increases, the density curve of t gets closer to the standard normal density curve.
True.

Properties of the tDistribution:
As the sample size n increases, the density curve of t gets closer to the standard normal density curve. This result occurs because, as the sample size n increases, the values of s get closer to the values of σ, by the Law of ____________.
This result occurs because, as the sample size n increases, the values of s get closer to the values of σ, by the Law of Large Numbers.

T/F:
When constructing a [(1−α)∙100%] Confidence Interval for μ with unknown σ, the interval is exact when the population is normally distributed, but approximately correct for nonnormal population, where n is large enough.
True.

T/F:
Hypothesis testing and estimation are similar approaches to two similar problems.
 False:
 Hypothesis testing and estimation are two different approaches to two similar problems.

Hypothesis testing and estimation are part of _________.
Inferential Statistics.

What defines a statement or claim regarding a characteristic of one or more populations?
A hypothesis is a statement or claim regarding a characteristic of one or more populations.

What defines the procedure, based on sample evidence and probability used to test statements regarding a characteristic of one or more populations?
Hypothesis testing is a procedure, based on sample evidence and probability used to test statements regarding a characteristic of one or more populations.

T/F:
If population data are available, there is no need for inferential statistics.
True.

What are the steps in Hypothesis Testing? (3 steps)
Step 1. A statement is made regarding the nature of the population.
Step 2. Sample data is collected to test the statement.
Step 3. The data are analyzed to assess the plausibility of the statement.

Since claims can be either true or false, hypothesis testing is based on two types of hypothesis: ______ and ______.
Since claims can be either true or false, hypothesis testing is based on two types of hypothesis: null and alternative.

What defines the statement to be tested. We denote this by H_{0}?
The null hypothesis is the statement to be tested. We denote this by H_{0}.

What defines the claim to be tested. We denote this by H_{1}?
The alternative hypothesis is the claim to be tested. We denote this by H_{1}.

What are the different types of null hypothesis and alternative hypothesis pairs?
Twotailed, Lefttailed, and Righttailed.

T/F:
Twotailed test: test whether the parameter is either equal to, versus not equal to, some random variable.
 False:
 Twotailed test: test whether the parameter is either equal to, versus not equal to, some value.

T/F:
Lefttailed test: test whether the parameter is either equal to, versus less than, some value.
True.

T/F:
Righttailed test: test whether the parameter is either equal to, versus greater than, some value.
True.

Define the type of error:
If we reject stating the null hypothesis is false, but the null is true.
Type I error.

Define the type of error:
If we do not reject, stating null hypothesis could be true, but the null hypothesis is actually false.
Type II error.

T/F:
The level of significance, α, is the probability of making a Type II error.
 False:
 The level of significance, α, is the probability of making a Type I error.

The probability of making a Type II error is represented by ___.
β

T/F:
As the probability of Type I error increases, the probability of a Type II error decreases, and viceversa.
True.

When observed results are unlikely under the assumption that the null hypothesis is true, we say the result is _____ _____.
When observed results are unlikely under the assumption that the null hypothesis is true, we say the result is statistically significant.

T/F:
When results are found to be statistically significant, we accept the null hypothesis.
 False:
 When results are found to be statistically significant, we reject the null hypothesis.

What are the three equivalent ways to perform a hypothesis test that reach the same conclusion?
The methods are the classical approach, Pvalue approach, or confidence interval approach.

T/F:
Classical Approach: If the sample proportion is too many standard deviations from the proportion stated in the null hypothesis, we accept the null hypothesis.
 False:
 Classical Approach: If the sample proportion is too many standard deviations from the proportion stated in the null hypothesis, we reject the null hypothesis.

Pvalue approach: If the sample proportion as extreme or more extreme than the one obtained is _____ under the assumption the statement in the null hypothesis is true, reject the null hypothesis.
Pvalue approach: If the sample proportion as extreme or more extreme than the one obtained is small under the assumption the statement in the null hypothesis is true, reject the null hypothesis.

What are the initial conditions for Testing Hypothesis Regarding a Population Proportion, p?
• The sample is obtained by simple random sampling
• np_{0}(1 – p_{0}) ≥ 10
• The sampled values are independent of each other.

What are the initial conditions for Testing Hypotheses Regarding a Population Mean, μ?
• The sample is obtained using simple random sampling.
• The sample has no outliers, and the population from which the sample is drawn is normally distributed or the sample size is large (n ≥ 30).
• The sampled values are independent of each other.

What do we call a procedure with minor departures from normality that will not adversely affect the results of the test?
The procedure is robust.

