x
Quote information
Message
Your list is empty, add products to the list to send a request
Shop

Hypothesis Testing

What is Hypothesis Testing?Â

Hypothesis testing is a method used to find out the representable validity of a given probable outcome being tested for a defined significance value in a sample. For a set of potential probability distributions, a hypothesis will be tested in comparison to an alternative hypothesis with a pre-defined significance level, also called a confidence level. By hypothesis testing, we want to come to a statistical inference based on the comparison of hypotheses with a defined significance level so that the resulting data value does not fall under the null hypothesis. (Rice, John A, 2007).Â

The process of hypothesis testing starts with establishing a preliminary hypothesis to be proven, with its corresponding null and alternative hypotheses. Once the hypotheses are defined, some valid assumptions regarding the sample are made to assess whether any interdependence (or lack of it) exists. Then the test statistic (T) is determined so that the distribution under the null hypothesis can be determined to be either simple or composite. For subsequent testing, a specific significance level (Î±) is chosen, which is normally between 1% and 5%. In order to determine the critical region, the point where the test statistic rejects or accepts the null hypothesis, the distribution partition is selected. Once all these are set, the values of the test statistic are observed repeatedly and calculated so that the decision to either accept or reject the null hypothesis can be taken.Â

At this point, it will be critical to understanding the distinguishing factor between ‘accept the null hypothesis’ and ‘fail to reject.’ Simply accepting the null hypothesis means the initial assumption regarding the test was true, which is not always the case. Failing to reject the null hypothesis, however, means that after testing, no significant confirmatory or contradictory results can be observed. Therefore, this must be either re-tested, or the initial hypotheses must be re-phrased.Â

Â

TerminologiesÂ

Before getting into any further detail for a technical understanding, we need to know the following key concepts:Â

Alternative hypothesis H1

It is the new hypothesis based on literature review and previous studies, in contrast, but consistent with the null hypothesis.Â

Critical region (Region of rejection)Â

It is that area of test statistic values where the null hypothesis is appropriately rejected.Â

Critical valueÂ

It is the bracket for test statistic value where either it is accepted or rejected.Â

ErrorsÂ

There are two types of errors that help to differentiate the null hypothesis from the alternative hypothesis:Â

• Type 1 Error: Null hypothesis is incorrectly rejectedÂ
• Type 2 Error: Null hypothesis is incorrectly acceptedÂ
Null hypothesis H0Â

A null hypothesis is the default state of a chosen argument proposing a lack of inter-relationship between the compared hypotheses being tested. The subsequent acceptance or rejection of the null hypothesis provides a reliable benchmark to move forward. Prior to reaching any definitive conclusion, the null hypothesis is implicitly accepted to be true unless the testing process proves otherwise.Â

P-value conceptÂ

It is the best probability of the null hypothesis being true.Â

Power of a test (1 âˆ’ Î²)Â

It is the probability of accepting the alternative hypothesis, thereby appropriately rejecting the null hypothesis, where ‘power’ means the sensitivity of the test.Â

Region of acceptanceÂ

It is that area of test statistic values where the null hypothesis is failed to be rejected.Â

Statistical hypothesisÂ

It is an assumption based on the specific features of a population, not just a sample.Â

Â

Core Concept when used in Machine LearningÂ

In hypothesis testing, the p-value results determine the number of probable outcomes of null hypothesis conditionality (Wasserman, L. 2004) where:Â

• Probability of p-value falling within the significance threshold or critical regionÂ
• Probability of p-value being less than the significance thresholdÂ
• Probability of p-value falling outside the significance thresholdÂ

It must be noted that the focus of hypothesis testing is on the principle of rejection, which means more rigorous logic is applied to determine the validity of a probable outcome. There are five main factors under which the probability of rejection functions.

1. The test being one-tailed or two-tailedÂ
2. Significance levelÂ
3. Standard deviationÂ
4. Extent of deviation from the null hypothesisÂ
5. Subjective appearance of the results controlled by the experimenterÂ

A number of cautionary steps are advised to avoid misuse or misrepresentation of data within the hypothesis testing framework. In order to reduce type 2 errors, it is advised to consider larger sample sizes. It must be noted that the statistical significance of data in no way asserts the practical significance of the outcomes. Similarly, correlation does not equate to causation; therefore, it is not enough to simply reject the null hypothesis in order to reach a definitively correct outcome.Â

Â

Practical ApplicationÂ

In data sciences, statistical tools like hypothesis testing play a critical role in justifying probable inferences, especially where no previous scientific theory or practice exists. Most significantly, the field of social sciences has benefitted greatly from hypothesis testing, although there has been some criticism of the application as well. Some of the important applications of hypothesis testing in the practical world are as follows:Â

Courtroom trialsÂ

In a courtroom trial setting, the default mode or null hypothesis works best since the assumption, ‘innocent until proven guilty,’ correlates to the H0 probability. Therefore the two hypotheses can be very clearly formulated to be not guilty vs. guilty.Â

Gender ratioÂ

Hypothesis testing was initially used to prove the assumption of the equal gender distribution of human births back in the 1700s. The two hypotheses being as simple as true or false, it came out to be males having a greater probability of birth than females at that time, without any considerable explanation.Â

Other areas of application include, but are not limited to:Â

• Handwriting analysis claimsÂ
• Best ways to quit smoking for goodÂ
• The extent of behavioral effects of a full moon on humans and animalsÂ
• Verifying the origin of manuscriptsÂ

Still, a great deal of criticism exists on the validity of hypothesis testing since the results of an experiment are only as valid as the sample selection criteria and design. Hence, caution must be taken before admitting any results from a single source.Â

Â

ConclusionÂ

In the modern world, hypothesis testing has far-reaching consequences and can be seen in a variety of fields, from opinion polls to trends in biomedicine. For a mature statistical method, the process of hypothesis testing can be easily summarized as follows:Â

• Establish hypothesesÂ
• Determine a significance levelÂ
• Evaluate point estimateÂ
• Compute test statisticÂ
• Assess p-valueÂ
• Deduce outcomesÂ

In effect, it serves as an important â€˜filterâ€™ before investing time and money into any statistical outcome of consequence. Prior to building on a previous result, it will be practical to read into the details of the said experiment to look for any design or execution errors of the study. As mentioned earlier, more than a single source should be sought before taking the results of a single study for granted, since the most prevalent use of hypothesis testing remains the scientific deductions of experimental stats. Instead of repeating a faulty experiment to confirm a subsequently erroneous result, it will be prudent to initiate a critical test of the existing results to avoid misrepresentation and abuse of this sometimes misunderstood statistical method.Â

Weâ€™ve collected the items for you to purchase for your convenience.

Get the entire package for up to 50% discount with our Replication program.