This statistics tutorial is a guide to help you understand key concepts of statistics and how these concepts relate to the scientific method and research.
Scientists frequently use statistics to analyze their results. Why do researchers use statistics? Statistics can help understand a phenomenon by confirming or rejecting a hypothesis. Statistics is often vital to change scientific theories.
You don't need to be a scientist though; anyone wanting to learn about how researchers can get help from statistics may want to read this statistics tutorial for the scientific method.
RESEARCH DATA
This section of the statistics tutorial is about understanding how data is acquired and uses.
The results of a science investigation often contain much more data or information than the researcher needs. This data-material, or information, is called raw data.
To be able to analyze the data sensibly, the raw data is processed into "output data". There are many methods to process the data, but basically the scientist organizes and summarizes the raw data into a more sensible chunk of data. Any type of organized information may be called a "data set".
Then, researchers may apply different statistical methods to analyze and understand the data better (and more accurately). Depending on the research, the scientist may also want to use statistics descriptively or for exploratory research.
What is great about raw data is that you can go back and check things if you suspect something different is going on than you originally thought. This happens after you have analyzed the meaning of the results.
The raw data can give you ideas for new hypotheses, since you get a better view of what is going on. You can also control the variables which might influence the conclusion (e.g. third variables).
CENTRAL TENDENCY AND NORMAL DISTRIBUTION
This part of the statistics tutorial will help you understand distribution, central tendency and how it relates to data sets.
The central tendency may give a fairly good idea about the nature of the data (mean, median and mode shows the "middle value"), especially when combined with measurements on how the data is distributed. Scientists normally calculate the standard deviation to measure how the data is distributed.
But there are various methods to measure ho data is distributed: variance, standard deviation, standard error of the mean, standard error of the estimate or "range" (which states the extremities in the data).
To create the graph of the normal distribution for something, you'll normally use the arithmetic mean of a "big enough sample" and you will have to calculate the standard deviation.
But, the distribution will not be normal distributed if the distribution is skewed (naturally) or has outliers (often rare outcomes or measurement errors) messing up the data. One example of a distribution which is not normally distributed is the F-distribution, which is skewed to the right.
So, often researchers double check that their results are normally distributed using range, median and mode. If the distribution is not normally distributed, this will influence which statistical test/method to choose for the analysis.
HYPOTHESIS TESTING - STATISTICS TUTORIAL
How do we know whether a hypothesis is correct or not?
Why use statistics to determine this?
Using statistics in research involves a lot more than make use of statistical formulas or getting to know statistical software.
Making use of statistics in research basically involves
learning basic statistics
understanding the relationship between probability and statistics
comprehension of inferential statistics.
knowledge of how statistics relates to the scientific method.
Statistics in research is not just about formulas and calculation. (Many wrong conclusions have been conducted from not understanding basic statistical concepts)
Statistics inference helps us to draw conclusions from samples of a population.
When conducting experiments, a critical part is to test hypotheses against each other. Thus, it is an important part of the statistics tutorial for the scientific method.
This means that not all differences between the experimental group and the control group can be accepted as supporting the alternative hypothesis - the result need to differ significantly statistically for the researcher to accept the alternative hypothesis. This is done using a significance test (another article).
Depending on the hypothesis, you will have to choose between one-tailed and two tailed tests.
Often there is a publication bias when the researcher finds the alternative hypothesis correct, rather than having a "null result", concluding that the null hypothesis provides the best explanation.
If applied correctly, statistics can be used to understand cause and effect between research variables.
It may also help identify third variables, although statistics can also be used to manipulate and cover up third variables if the person presenting the numbers does not have honest intentions (or sufficient knowledge) with their results.
Misuse of statistics is a common phenomenon, and will probably continue as long as people have intentions about trying to influence others. Proper statistical treatment of experimental data can thus help avoid unethical use of statistics. Philosophy of statistics involves justifying proper use of statistics and establishing the ethics in statistics.
Here is another great statistics tutorial which integrates statistics and the scientific method.
RELIABILITY AND EXPERIMENTAL ERROR
Statistical tests make use of data from samples. These results are then generalized to the general population. How can we know that it reflects the correct conclusion?
Contrary to what some might believe, errors in research are an essential part of significance testing. Ironically, the possibility of a research error is what makes the research scientific in the first place. If a hypothesis cannot be falsified (e.g. the hypothesis has circular logic), it is not testable, and thus not scientific, by definition.
If a hypothesis is testable, to be open to the possibility of going wrong. Statistically this opens up the possibility of getting experimental errors in your results due to random errors or other problems with the research. Experimental errors may also be broken down into Type-I error and Type-II error.
A power analysis of a statistical test can determine how many samples a test will need to have an acceptable p-value in order to reject a false null hypothesis.
Replicating the research of others is also essential to understand if the results of the research were a result which can be generalized or just due to a random "outlier experiment". Replication can help identify both random errors and systematic errors (test validity).
What you often see if the results have outliers, is a regression towards the mean, which then makes the result not be statistically different between the experimental and control group.
STATISTICAL TESTS
At this stage in the statistics tutorial for the scientific method, we're introducing some commonly used statistics tests/methods.
What is the difference between correlation and linear regression? Basically, correlation is about the strength between the variables whereas linear regression is about the best fit line in a graph.
a Path Analysis is an extension of the regression model
A Factor Analysis attempts to uncover underlying factors of something.
The Meta-Analysis frequently make use of effect size
Bayesian Probability is a way of predicting the likelihood of future events in an interactive way, rather than to start measuring and then get results/predictions.
TESTING HYPOTHESES STATISTICALLY
Student's t-test is a test which can indicate whether the null hypothesis is correct or not. In research it is often used to test differences between two groups (e.g. between a control group and an experimental group).
The t-test assumes that the data is more or less normally distributed and that the variance is equal (this can be tested by the F-test).
A Z-Test is similar to a t-test, but will usually not be used on sample sizes below 30.
A Chi-Square can be used if the data is qualitative rather than quantitative.
COMPARING MORE THAN TWO GROUPS
An ANOVA, or Analysis of Variance, is used when it is desirable to test whether there are different variability between groups rather than different means. Analysis of Variance can also be applied to more than two groups. The F-distribution can be used to calculate p-values for the ANOVA.