Home > Preview
The flashcards below were created by user
Radhika316
on FreezingBlue Flashcards.

Hypothesis Testing
 Discover the relationships that exist between events/things
 Accomplished by:Asking questions and Getting answers =>In accord with certain rules ... the scientific method.
 Question is a hypothesis & Answer is obtained by testing the hypothesis Which gives the general model………

Some IMPORTANT restrictions about How the hypothesis is formed.
 How the hypothesis is tested…
 Forming hypotheses is an "everydayeverybody" activity
 ex: I will do better on examinations if relax the night before> Is a OK "hypothesis" ... a statement of a relationship OK, BUT NOT a scientific hypothesis
 A scientific hypothesis must be: Specific,Empirically testable,Strictly related to some experimental procedure
 actually consists of two separate mutually exclusive hypotheses: 1. A null hypothesis and/or 2. An alternative hypothesis

Null Hypothesis
A statement reflecting the possibility that there are no differences between the objects and/or events that are being observed
In formal terms: Ho: µ1 = µ2Where: µ1 and µ2 are the mean or average of several observations

Alternative Hypothesis
A statement reflecting the possibility that there are differences between the objects and/or events that are being observed
 In formal terms:
 H1: µ1 <> µ2 or H1: µ1 < µ2 or H1: µ1 > µ2

Testing between the null and alternative hypothesis Accomplished through
collection of data that must be scientifically acceptable, i.e. Observable, Public, Replicable
The test concentrates on the null hypothesis which you either Reject or Fail to reject

If there are no differences between your observations you
Fail to reject the null hypothesis and Disregard the alternative hypothesis

If there are differences between your observations you
Reject the null hypothesis and Accept the alternative hypothesis

Some things to note about hypothesis testing
Failing to reject the null hypothesis does not mean that the null hypothesis is TRUE
The null hypothesis can never be proven ;You can only fail to reject it
Rejecting the null hypothesis Means you accept the alternative hypothesis It does not establish the validity of a relationship
Validity is a function of experimental design

When testing a hypothesis; how many outcomes?
Two possible outcomes regarding a null hypothesis
Two possible states of real world Thus four possible decisions: Two are incorrect ... i.e. errors

Type I error
You can directly "set" this
It is the chance (probability of the making the error) you are willing to accept when you test your hypothesis.

Type II error
You cannot directly "set" this You can attempt to control it through good experimental design.

Alpha Level or the level of significance
a probability value that is used to define the very unlikely sample outcomes if the null hypothesis is true.

Critical region
If sample data fall in the critical region, the null hypothesis is...?
composed of extreme sample values that are very unlikely to be obtained if the null hypothesis is true.
The boundaries for the critical region are determined by the alpha level.
If sample data fall in the critical region, the null hypothesis is rejected.

**Estimating Population Parameters from Samples
biased or unbiased, why?
Sample meanUnlikely to be exactly equal to population mean BUT Not more likely to be greater OR Not more likely to be less sooo sample mean is an unbiased estimate of population mean

**Sample standard deviation biased or unbiased?
 Unlikely to be exactly equal to population standard deviation BUT More likely to be less
 Is usually an under estimate of population parameter
 So sample standard deviation is a biased estimate of the population standard deviation
more likely to be smaller than population variance and Pop standard deviation > "Degrees of freedom"And so must correct any estimate of the population variance increase it (i.e. use "n1" when calculating the estimate)

Parametric Tests
Tests that do make assumptions and test hypotheses about population parameters.
z & t, ANOVA, and F test=>Involves an assessment of whether your observed data is related to your independent variable
Or is simply what might be expected by chance random sampling (i.e. no relation between Independent variable and Dependent measure)
 Requires knowing or estimating population parameters
 Mean: (μ)
 Standard deviation: (σ)
 Assumption of normality

**For example: consider Pat (individual score) = 64 Population mean (μ) = 50 Population standard deviation (σ) = 8
 ==>And if population is normal, then you know
 ~68 (68.26) % data points between + 1
 ~95 (95.44) % data points between + 2
 ~99 (99.74) % data points between + 3
 And remember: these are percentages, not absolute values
 Areas under the normal curve
 ==>Remember the zdistribution?
 Provided areas (proportion of scores) under a normal distribution according to zequation (xu/Pop SD) =1.76
==>And if convert Pat's raw score to a zscore And then look up Pat's zscore in the Ztable Meaning that ~96% (1.00  .0401 = .9599) of scores in distribution are below Pat. (page 699, G&W).
 =>OR PUT OTHERWISE: If we were to randomly select a score from Pat's distribution The probability that the score would be greater than Pat's would only be 4 in 100And we have done what we set out to doAccomplished a statistical test (comparing Observed data and What would be expected by chance)
 And thus, Pat's score is significant at p < .05

Pvalue
smaller the Pvalue?
A moderate to large Pvalue means?
The probability, when Ho is true, of a test statistic value at least as contradictory to Ho as the value actually observed.
 The smaller the Pvalue, the more strongly the data contradict Ho.
 The Pvalue summarizes the evidence in the data about the null hypothesis.
A moderate to large Pvalue means that the data are consistent with Ho. (fail to reject Null)
=>Ex. Pvalue .26 or .83 indicates that the observed data would not be unusual if Ho were true; However, a Pvalue such as .001 means that such data would be very unlikely, if Ho were true.
The Pvalue is the primary reported result of a significance test. If the Pvalue is sufficiently small, one rejects Ho and accepts H1.

Standard Error of the Mean (ch 7)
When comparing an individual to a population needed to know two things about the population
sampling dist?
the measure of "variability"?
 When comparing an individual to a population needed to know two things about the population
 Mean: (μ)
 Standard deviation: (σ)
 Only slightly different when comparing a sample to a population
 BUT NOWa SAMPLE: Since you are not concerned with a single individual but with a sample of individuals;
 The "population" of interest is not A population of individuals but rather A population of samples, i.e. a SAMPLING DISTRIBUTION
 And the measure of "variability" is not "The standard deviation" of a population but rather "The standard deviation of a sampling distribution", IE...STANDARD ERROR OF THE MEAN

The calculation details of Standard Error of Mean: (2 steps)
 Step 1: The standard error of the mean
 σm = σ ∕ √n; where: n = sample size
 Step 2:The z comparison
 Z = (M – μ) ∕ σm
 M=Sample Mean
==>Which is not really different than what we did when comparing an individual to a population

** Herd Of Cows Example:
Suppose a Herd of 10 cows (n=10); Mean milk production is 1.8 gallons/cow
Question: How unique is this herd?
Given: μ=1.5 (population mean), σ (population)=.55, n=10, M (sample mean)=1.8
 => Calculate the standard Error of Mean using:
 Z = (M – μ) ∕ σm
 Z=.1724; using Z table: .0427 which is less than .05 (means our cows are special :D )
 =>Thus herd is pretty unique since the likelihood that any random sample of 10 cows from the population would produce more milk is less than 5 times in 100 (study!)
 Or in "statistics": The probability of selecting a herd of greater milk producers is p < 0.05

Problem: compare a sample to a population; what is method?
 Method:
 1. Use population parameters to calculate the standard error of the mean of a sampling distribution. (σm = σ ∕ √n)
2. Use the standard error of the mean to compare sample mean with population mean by calculating a zscore (Z = (M – μ) ∕ σm)
3. Use ztable to determine the probability that a random sample would yield a mean greater than the mean of the sample

**A word on the logic and requirements of the statistic
The "uniqueness" of your sample
Two requirements
The "uniqueness" of your sample is the probability that another random sample of the same size would have the same mean as your sample.
Or put otherwise, is your sample mean, is what would be expected by chance, a random selection?
=>The more unique your sample, the more likely it reflects a relationship between:Your independent variable & Your dependent measure
 Two requirements
 1.The population is normally distributed
 2. You know the population Mean (μ) & Standard deviation (σ)

The t statistic
An alternative to z
 ==>MUST know the population mean But can estimate population standard deviation from sample data (SD is missing)
 A sample standard deviation is given by (as you know) BY THE SS FORMULA> Sigma/ Population SD
==> And so an ESTIMATED standard error of the mean is: "sm = s ∕ √n"
And to use the estimated standard error of the mean to compare your sample to the population must make one adjustment (Adjustment is necessary to account for the fact that you are estimating)

Comparisons When Estimating Population Parameters & TDistribution
The adjustment part: Sm (estimated Standard Error of the mean)
Estimating the standard deviation requires a different sampling distribution which is the tdistribution==>The tdistribution :Normal distribution, More platokurtic than zdistribution, Tails more elevated
==>And comparison becomes ==> step 2 of TTest after SS and finding sm (step 1) t = (M – μ) ∕ sm
Thus: Because use estimate of population standard deviation to estimate standard error of mean , Must use tdistribution to get probability of randomly selecting a sample with a mean similar to the mean of your sample,
But not conceptually different  just an adjustment

**TTest Steps.
1. SS
2. S^2 =SS/(n1)
3. S
4. Sm: Estimated Standard error of the mean BECAUSE the SD is missing! (Adjustment is necessary to account for the fact that you are estimating)
 5. Final Comparision: t = (M – μ) ∕ sm;
 M= Sample mean
 μ= Population mean
 Sm=Estimated Standard error of the mean

What does that "tvalue" mean ?
What is the probability of a random sample of 10 cows being like your cows
To find out, consult a ttable

THIS IS IMPORTANT REGARDING TTESTS Values!
Relationship between TValue obtained and TCritical?
when do you fail to reject?
 To reject the null?
 > In order to reject the null (There's a change), the Tvalue obtained (from the formula)must be greater than the Tcritical (obtained from the table values); meaning SPECIAL (Less than .05 alpha level)
 T_{Obtained}>T_{Critical} (change)
 >Failing to Reject: (No changeconfirm the Null) TCritical is greater than Obtained T value
 T_{Obtained}<T_{Critical} (change)

Z vs T Tables
Ztable gives exact probabilities
ttable gives ranges of probabilities

**THREE STREPS TO LOOKING UP VALUES OF TCRITICAL:
 1. Two Tails by Default
 2. degrees of Freedom (from your sample> n1)
 3. Alpha level: .05

**Degrees of freedom:
 Number of values in a calculation that are free to vary
 That is: The degrees of freedom for a mean of 10 values is 9 ... because
 If the mean of 10 numbers is, for example, 5 Nine of the numbers "free" to be any value but when these are established, the 10th number is determined if the mean is to be 5

Directionality of Statistical Tests
 Statistical tests have a property called "directionality"
 Nondirectional, called "twotailed" tests
Directional, called "onetailed" tests

**Looking up tcritical on table
reject null when?
moving left to right?
Directionality? two tailed vs one tailed??
reject when t crit is less than t obtained
moving left to right, t values get smaller, and t critical gets bigger
two tailed test: 2.04 is smaller than 2.262
directionality: prior knowledge, predict outcome from prior knowledge (classical music makes more milk) meaning now i should perform a one tail test... more likely to reject the null because the event is much more different and chance of rejecting is larger. so using one tail: more likely to reject null because you increase the area
Your ability to predict an outcome means that you are better able to determine whether an event is a chance occurrence;More likely to reject Null hypothesis
In statistical terms the region of the sampling distribution indicating that an event is something different than what would be expected by chance is larger

Review:Ztest a statistical test
used to decide if a sample mean does or does not come from a specified populationwhen the standard deviation of the population is known.When the standard deviation of the population is unknown then a ttest is performed.

Review:Hypothesis testing
the goal is to decide whether to reject the null hypothesis.

Alpha level
traditionally set at .05 where, also, the acceptance and rejection regions are determined.

Critical value
the absolute value of that defines the rejection region(s).

Directionality:
Nondirectional vs Directional
Nondirectional (twotailed), where rejection of the sample mean is either above or below hypothesized population mean.
Directional (onetailed), where rejection of the sample mean is determined prior to experimentation.

**Compared a Sample to a population:When population parameters are known…..
**Compared a Sample to a population: When population parameters are unknown….
 When population parameters are known…
 assume a normal population and known standard deviation
 Z = (M – μ) ∕ σm
 When population parameters are unknown…
 Sample to population: assume a normal population and unknown standard deviation
 t = (M – μ) ∕ sm

Hypothesis Test": .
 a statistical method that uses sample data to evaluate a hypothesis about a population
 1. We state a hypothesis about a population
2. Use the hypothesis to predict the characteristics that the sample should have (meaning sample should be similar to the population; along with a certain amount of error)
3. Then we obtain a sample randomly from a sample and measure the changing variables.
4. Compare the obtained sample data with the Hypothesis Prediction. If sample is consistent, then hypothesis is reasonable. BUT if there's a big discrepancy, we must conclude the hypothesis is wrong.
note: there are different versions of hypothesis testing; but the simplest version is using a sample mean to test a hypothesis about a population mean

The Unknown Population:
Reseearch begins with a known population with a set of individuals that exists BEFORE treatment; the purpose of the reasearch is to determine the effect of a treatment on the indivdiuals in the population.
assume that if treatment has any effect, it is simply to add a constant amount or subtract from the individual's scores (which wouldn't change the shape of the distribution or SD)
unknown population (after treatment) is the focus of the research question

**THE FOUR STEPS OF A HYPOTHESIS TEST:
 ==> STEP ONE: STATE THE HYPOTHESIS <==
 state a hypothesis about the unknown popultion; there are TWO OPPOSING hypothesis stated in terms of population parameters (mean)
 Null & Alternative: (directionality)
 ==> STEP 2: SET THE CRITERIA FOR A DECISION:
 => Alpha level/Level of significance: The boundaries that sepearate the high0probablility samples from the low probability samples; if a=.05 (5%) then 95% lies in the center and the tail ends are the "critical regions"(very unlikely)
 => Critical Regions: composed of the extreme sample values that are very unlikely (as defined by the Alpha level) to be obtained if the null hypothesis is true.boundaries for the critical region: use the alphalevel probability and the unit normal table; 5% of the critical regions are divided into two tails of the distribution so that the Zscore boundary is 1.96 or 1.96. (column c)
 **Whenever athe data froma research study produce a sample mean that is located in the critical region, we can conclude that the data are NOT consistent with the Null and we REJECT the null.
 ==> STEP 3: COLLECT DATA AND COMPUTE SAMPLE STATISTICS
 compare sample mean (data) to the Null hypothesis (heart of the hypothesis test)
comparisiion computed by computered a zscore that describes exactly where the sample mean is located realtive to the hypotheisized population mean from Null/H0.z= sample mean subtract the Value of the mean from Null Hypothesis and divde by "standard error between M and U" (pg 211)
 ==> STEP 4: MAKE A DECISIONUse the zscore value to make a decision about the Null Hypothesis according to the criteria established in step 2 with two possible outcomes:
 Final decision is made by comparing treated sample (those who get treatment) with the distribution of sample means that would be obtained for untreated samples (the majority)

TYPE 1 and TYPE 2 ERRORS (Hypothesis Conclusions)
 ==> Type 1 Errors: When you reject the null hypothesis when in actuality, there was no effect from the treatment;
 it is a false reportdetermined by the alpha level: the probablility that the test will lead to a type 1 error if the null hypothesisis true; it determines the probability of obtaining sample data in the critical region even though there is no treatment effect
==> Type 2 Errors: occurs when a researcher fails to reject a null hypothesis(saying there is no effect) that IS REALLY FALSE, meaning the treatment did effect the sample but the hypothesis test failed to detect it (fail to detect the actual effect or change that resulted).likely to occur when the treatment effect is very small; so a study is more likely to fail to detect the effect.

**Statistically significant:
if it is very unlikely to occur when the Null Hyp is true; meaning result is significant enough to reject the null hypothesis. So treatment has a significant effect if the decision from the hypothesis is to reject the null

Increasing Sample size increases the ...
zscore and produces a smaller standard error and a larger value for zscore,

Higher variability can reduce ...
the chances of finding a significant treatment effect, increases the standard error and bring the zscore closer to the mean/center of distribution making it more likely to be concluded as a "fail to reject the null" (no change occurs/not special)

Directional hypothesis Test/One Tailed:
the stat hypotheses (both null and H1) specify either an increase or a decrease in the population mean

statistical power:
Power of a statistical test is the probability that the test will correctly reject a false null hypothesis. Meaning power is the probability that the test will identify a treatment effect if one really exists.
Alpha level: Reducing alpha level reduces power of test; there would be lower probability of rejecting the null hypothesis and a power value of test because area increases?

One Tail vs Two tailed tests
Going from 2 tails to 1 Tail increases the power of the hypothesis test.

Sampling error:
the natural discrpancy, or amount of error, between a sample statistic and it's corresponding population parameter.

Distribution of sample means:
 the collection of sample means for all of the possible random smaples of a particular size (n) that can be obtained from a population; contains all the possible samples.
 Necessary to have all the values in order to compute probability

Sampling distribution:
is a distribution of statistics obtained by selecting all of the possible samples of a specific size from a population (Msampling distribution); the larger the sample, the closer to the Population mean

Standard Error of the Sample mean:
describes the distribution of sample means, shows how much diff is experceted from one sample to another; also how well an individual sample mean respresents the entire distribution

EXTRA CREDIT: Remember to write this...
A TTest was performed and revealed a significant/nonsignificant difference between the sample mean and population, t(df)=t_{OBT}, P<.05

σ_{m}
 Standard Deviation of Sample distribution or Standard Error of Mean (sample vs population)
 Find by: σ/_{}
 n= Number of samples

S_{m}
 Estimated Standard Error of the mean (SD is missing)
 find by: s/_{}

